Azure AI - Error 429 GPT4o Chat Playground

Abigail Joyce Cuadra 20 Reputation points
2024-07-03T11:53:11.3866667+00:00

Dear Team,

I have configured in Azure AI Chat Playground the "Add your data" (Preview) with GPT4o deployment.

It is connected to my Azure AI Search Index which has 9 PDF documents.

Whenever I chat with the bot, it successfully respond to the first question, however if I ask another question related to the documents, its easily throws an error which states

"Server responded with status 429. Error message: {'error': {'code': '429', 'message': 'Rate limit is exceeded. Try again in 27 seconds.'}"

I have tested it on the other Chat Playground where I do not configure enterprise data, and it seems to work quite well. Would like to know what could be the reason for this? does this have to do with TPM quota limits?

Azure AI Search
Azure AI Search
An Azure search service with built-in artificial intelligence capabilities that enrich information to help identify and explore relevant content at scale.
1,069 questions
Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
3,259 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,912 questions
{count} votes

Accepted answer
  1. Amira Bedhiafi 26,571 Reputation points
    2024-07-03T13:52:14.1633333+00:00

    The error 429 means that you have submitted too many tokens or requests in a short period of time and have exceeded the number of requests allowed.

    Azure services often enforce quotas on transactions per minute (TPM). If your application or service exceeds this quota, it will start receiving 429 errors.

    So you need to check your Azure portal for the specific TPM limits for your Azure AI service (in this case, Azure AI Chat Playground with GPT4o deployment and Azure AI Search). If you're hitting the TPM limits, you may need to adjust your application logic to spread out requests more evenly over time or consider increasing your TPM quota.

    https://help.openai.com/en/articles/6891829-error-code-429-rate-limit-reached-for-requests

    https://learn.microsoft.com/en-us/azure/ai-services/openai/quotas-limits


0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.