Azure AI - Error 429 GPT4o Chat Playground

Question

Azure AI - Error 429 GPT4o Chat Playground

Abigail Joyce Cuadra 20

Dear Team,

I have configured in Azure AI Chat Playground the "Add your data" (Preview) with GPT4o deployment.

It is connected to my Azure AI Search Index which has 9 PDF documents.

Whenever I chat with the bot, it successfully respond to the first question, however if I ask another question related to the documents, its easily throws an error which states

"Server responded with status 429. Error message: {'error': {'code': '429', 'message': 'Rate limit is exceeded. Try again in 27 seconds.'}"

I have tested it on the other Chat Playground where I do not configure enterprise data, and it seems to work quite well. Would like to know what could be the reason for this? does this have to do with TPM quota limits?

ajkuma 28,036 Reputation points Microsoft Employee Moderator

2024-07-11T10:34:32.96+00:00

Abigail Joyce Cuadra, Just checking in to see if you had got a chance to see the previous response. If the answer helped (pointed you in the right direction) > please click Accept Answer Or please share the requested/more info to help you better.

Accepted answer

0 additional answers

Your answer

ajkuma 28,036 Reputation points Microsoft Employee Moderator

2024-07-11T10:34:32.96+00:00

Abigail Joyce Cuadra, Just checking in to see if you had got a chance to see the previous response. If the answer helped (pointed you in the right direction) > please click Accept Answer Or please share the requested/more info to help you better.

Answer 1

Amira Bedhiafi 33,631 Volunteer Moderator

The error 429 means that you have submitted too many tokens or requests in a short period of time and have exceeded the number of requests allowed.

Azure services often enforce quotas on transactions per minute (TPM). If your application or service exceeds this quota, it will start receiving 429 errors.

So you need to check your Azure portal for the specific TPM limits for your Azure AI service (in this case, Azure AI Chat Playground with GPT4o deployment and Azure AI Search). If you're hitting the TPM limits, you may need to adjust your application logic to spread out requests more evenly over time or consider increasing your TPM quota.

https://help.openai.com/en/articles/6891829-error-code-429-rate-limit-reached-for-requests

https://learn.microsoft.com/en-us/azure/ai-services/openai/quotas-limits

Abigail Joyce Cuadra 20 Reputation points

2024-07-12T07:23:16.5733333+00:00

Thank you! now, the issue got resolved, indeed there were too many tokens per request on my deployment and adjusted the TPM limits for my GPT4o model deployment. Appreciate your help!
Dvir Hanum 5 Reputation points

2024-12-26T06:27:18.8666667+00:00

Actually I face the same issue. There is no way that I exceeded the limitation. I configured to the response to be 300 tokens, I have only one document, I sent very short prompt (75 tokens). And only the first prompt works, after that it always gets 429 error. When the timeout passed, I tried again and it just continue to say 429

Share via

Azure AI - Error 429 GPT4o Chat Playground

0 additional answers

Your answer