why do I hit openai rate limit

Sergey Markosyan 10 Reputation points
2023-03-24T18:06:49.74+00:00

The rate limit for ChatGPT model is 300 requests per minute. However our requests are hitting rate limit at much lower rates. The "metrics" report of Azure OpenAI service shows maximum 200 requests in 5-minute intervals, e.g. 188 total requests, from which 76 are blocked (we get rate limit error). How that is possible?

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
3,204 questions
{count} vote

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.