Inconsistent response from OpenAI Deployments API

Question

Inconsistent response from OpenAI Deployments API

Taylor Nelson 1

According to the documentation, the Azure OpenAI Deployments API should return token limits for the deployment (if applicable).

    "capabilities": {
      "area": "EUR",
      "chatCompletion": "true",
      "jsonObjectResponse": "true",
      "maxContextToken": "128000",
      "maxOutputToken": "16834",
      "assistants": "true"
    },

However, it seems in some cases, this information is not included. At least in my case, I don't get the information for gpt-4o or gpt-4.1, but it works for gpt 3.5, gpt-4, gpt-4-vision, gpt-4o-mini.

1 answer

Your answer

Answer 1

Azar 29,520 MVP Volunteer Moderator

Hi there Taylor Nelson

Thanksn for using QandA platform

Yup i hve seen it too, OpenAI Deployments API is supposed to return maxContextToken and maxOutputToken, some deployments like gpt-4o or gpt-4.1 may not include those fields, even though they’re present for gpt-3.5 or gpt-4. likely due to backend deployment schema differences hope this will be addresred in the next rollout

The best workaround is to refer to the official model documentation for token limits or test them programmatically.

if this helps kindly accept the answer thanks much.

Taylor Nelson 1 Reputation point

2025-06-05T05:59:26.1533333+00:00

I need to be able to retrieve these values programmatically as they are used within some of my applications to determine message history truncation etc. So the only "workaround" is to copy all the values from the MS documentation and put them into a dictionary, which is what I have done for now. However, Microsoft should fix this!
Saideep Anchuri 9,425 Reputation points Microsoft External Staff Moderator

2025-06-05T06:20:54.8833333+00:00

Hi there Taylor Nelson

It seems that you are experiencing inconsistent responses from the OpenAI Deployments API and are looking for a way to retrieve certain values programmatically. Unfortunately, the context does not provide specific solutions for fixing the inconsistencies in the API responses. However, it does mention that you can find various resource information, such as the endpoint and API keys, in the Azure portal under the Keys & Endpoint section.

If you are currently using a workaround by copying values into a dictionary, it may be a temporary solution until Microsoft addresses the inconsistencies in the API.

Kindly refer below link: retrieve-resource-information

azure-government

Thank You.
Saideep Anchuri 9,425 Reputation points Microsoft External Staff Moderator

2025-06-09T05:34:35.4366667+00:00

Hi there Taylor Nelson

Did you get any chance to check above response.

Thank You.
Saideep Anchuri 9,425 Reputation points Microsoft External Staff Moderator

2025-06-10T01:28:40.1833333+00:00

Hi there Taylor Nelson

We haven’t heard from you on the last response and was just checking back to see if you have a resolution yet.

Thank You.

Share via

Inconsistent response from OpenAI Deployments API

1 answer

Your answer