Azure OpenAI Availability rate down to 65%. 503 error

Tung Nguyen Xuan 70 Reputation points
2025-01-17T05:48:08.3+00:00

Today I frequently got service denial for chat completion requests with high token counts (~10K)

openai.InternalServerError: Error code: 503 - {'error': {'code': 'InternalServerError', 'message': 'The service is temporarily unable to process your request. Please try again later.'}}

Included is the availability chart from monitoring

User's image

Deployment info

gpt 4o

Deployment typeGlobal Standard

Rate limit (Tokens per minute)13,565,000

Rate limit (Requests per minute)81,390

Model version2024-11-20

Region eastus

Troubleshooting in the Portal did not help.

I need to explain to my customers that the latency and unavailability is caused by AzureOpenAI, not my production code.

I need pressing support right now.

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
4,098 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.