Azure OpenAI Responses API x-ratelimit headers values -1, 0

Renganathan, Hemnaath 0 Reputation points
2025-11-17T20:19:19.9666667+00:00

The header values coming back from the Azure OpenAI Response API calls are coming back with 0 and -1 values.

x-ratelimit-limit-tokens: -1

x-ratelimit-remaining-tokens: -1

x-ratelimit-reset-tokens: 0

x-ratelimit-reset-tokens usually gives me an int in seconds that I wait for before retrying the call. I also noticed the retry-after header is missing. Any help would be appreciated. Thanks!

I'm using an Azure OpenAI resource, Responses API, GPT4.1.

I tried chat completion endpoint and that is working fine.

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
{count} votes

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.