Rate Limited Exceeded Azure OpenAI Standard

Question

Rate Limited Exceeded Azure OpenAI Standard

Barry Briggs 110

Connected Azure OpenAI to small nonvectorized data set in Azure AI Search. Responses in Azure OpenAI Chat Playground are set to be limited to the dataset. (GPT-4o, Standard S0)

When I ask one of the sample questions (in the Chat UI) which I know to not be in the dataset it returns "The requested information is not available in the retrieved data. Please try another query or topic" -- which is correct.

When I ask a question I know to be in the Azure AI Search index, it consistently returns "Server responded with status 429. Error message: {'error': {'code': '429', 'message': 'Rate limit is exceeded. Try again in 59 seconds.'}}" -- no matter how long I wait.

I've only typed in a half-dozen or so very short (6 words-ish) prompts and updated the system prompt (~35-40 words). I've waited much longer than 59 seconds between prompts.

Barry Briggs 110 Reputation points

2024-12-04T23:08:00.3633333+00:00

All resources in East US

Accepted answer

0 additional answers

Your answer

Barry Briggs 110 Reputation points

2024-12-04T23:08:00.3633333+00:00

All resources in East US

Answer 1

Max Lacy 345

I understand you are experiencing a rate limit issue when trying to utilize the Azure OpenAI Chat Playground when utilizing the connect your data feature and limiting the responses to only those derived from the dataset.

When a deployment is created, the assigned TPM will directly map to the tokens-per-minute rate limit enforced on its inferencing requests. A Requests-Per-Minute (RPM) rate limit will also be enforced whose value is set proportionally to the TPM assignment using the following ratio:

6 RPM per 1000 TPM.

Connecting to your data increases the calls per minute. The flow of API call is then Assitant API -> AI Search -> Assitants API. If you're returning a large data set that can also trigger the rate limit.

To solve your problem look at increasing your Token per minute in the Azure AI Portal. This will increase the allowed RPM to ensure you hit less rate limits.

Screenshot of the deployment UI of Azure AI Foundry

Barry Briggs 110 Reputation points

2024-12-05T14:57:02.82+00:00

Perfect, thanks. Just a tip it would have been just a tiny bit more helpful to say that that dialog is located in Deployments | <select deployment> | Edit. Otherwise, thanks, this was great and solved the problem which had me stumped!!
Max Lacy 345 Reputation points

2024-12-06T15:31:53.44+00:00

You're right I was lazy with my directions on how to get there! I will include this in my next one. Thanks!!!
Barry Briggs 110 Reputation points

2024-12-06T16:59:37.29+00:00

Thanks again explanation was super clear and helpful!

Share via

Rate Limited Exceeded Azure OpenAI Standard

0 additional answers

Your answer