why do I hit openai rate limit
Sergey Markosyan
10
Reputation points
The rate limit for ChatGPT model is 300 requests per minute. However our requests are hitting rate limit at much lower rates. The "metrics" report of Azure OpenAI service shows maximum 200 requests in 5-minute intervals, e.g. 188 total requests, from which 76 are blocked (we get rate limit error). How that is possible?
Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
4,081 questions
Sign in to answer