Where to find metrics concerning rate limiting?

Stephen Cattaneo 25 Reputation points
2025-03-21T20:37:44.88+00:00

Hi,

I'm using GPT-4o + OpenAI SDK + Assistants, I'm hitting rate limiting. I understand that limit are based on 1) Requests per Minute and 2) Tokens per Minute.

I'm trying to find the right metrics that will tell me what limits my API calls have hit so that I can detect issues in the future or decide what kind of quota I need to request.

The model deployment has metrics here -- https://ai.azure.com/resource/metrics

Viewing this dashboard I get 5 errors that are all some version of
"BadRequest: Failed to find metric configuration for provider: Microsoft.CognitiveServices, resource Type: accounts, metric: AzureOpenAIRequests"

Is this the right dashboard, if so how do I fix the errors? Is there another dashboard I should be looking to for

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
4,104 questions
{count} votes

1 answer

Sort by: Most helpful
  1. kothapally Snigdha 3,020 Reputation points Microsoft External Staff Moderator
    2025-03-22T00:43:53.32+00:00

    Hi Stephen Cattaneo

    I'm glad that you were able to resolve your issue and thank you for posting your solution so that others experiencing the same thing can easily reference this!

    Since the Microsoft Q&A community has a policy that "The question author cannot accept their own answer. They can only accept answers by others", I'll repost your solution.

    Please click "Accept" the answer as original posters help the community find answers faster by identifying the correct answer.

    Issue: Where to find metrics concerning rate limiting? 

    Resolution: Since posting this question the above dashboard has started working.

    So, I guess this is answered now.

    Please don’t forget to close the thread by clicking "Accept the answer" wherever the information provided helps you, as this can be beneficial to other community members.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.