An error occurred when calling Azure OpenAI: Server responded with status 429. Error message: {'error': {'code': '429', 'message': 'Rate limit is exceeded. Try again in 10 seconds.'}}

MATTHEW NG 30 Reputation points
2024-08-15T05:54:04.3833333+00:00

I added my own data which was just a a text document which says "The picture consists of a cat and a dog". That's all!

But when I tried running my chatbot using that data, I asked "What does my picture consists of?" I get "An error occurred when calling Azure OpenAI: Server responded with status 429. Error message: {'error': {'code': '429', 'message': 'Rate limit is exceeded. Try again in 10 seconds.'}}"

What is this? I checked my TPM 1K and rate limit 1000 but I see no problem with the rate limit every being exceeded. Everything is fine when I use the regular chatbot function for general query but want I use any data of my own the same error appears. How can I fix this?

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
4,080 questions
0 comments No comments
{count} votes

Accepted answer
  1. AshokPeddakotla-MSFT 35,971 Reputation points Moderator
    2024-08-16T05:44:30.36+00:00

    MATTHEW NG Greetings & Welcome to Microsoft Q&A forum!

    Glad to hear your issue is resolved.

    Since the Microsoft Q&A community has a policy that the question author cannot accept their own answer, they can only accept answers by others, I'll repost your solution in case you'd like to Accept the answer.

    Error Message:

    I added my own data which was just a a text document which says "The picture consists of a cat and a dog". That's all! But when I tried running my chatbot using that data, I asked "What does my picture consists of?" I get "An error occurred when calling Azure OpenAI: Server responded with status 429. Error message: {'error': {'code': '429', 'message': 'Rate limit is exceeded. Try again in 10 seconds.'}}"

    Solution:

    Problem solved. Requested for more quota limit. Increasing the TPM and rate limit has helped fix the issue.

    The error message is related to rate limits, which is a common practice in APIs to prevent abuse and ensure fair usage.

    In your case, the error message indicates that you’ve exceeded the token rate limit of your current AI Services S0 pricing tier.

    Azure OpenAI’s quota feature enables assignment of rate limits to your deployments, up-to a global limit called your “quota.” Quota is assigned to your subscription on a per-region, per-model basis in units of Tokens-per-Minute (TPM).

    You can check this documentation for more details.

    User's image

    To give more context, Tokens-Per-Minute (TPM) and Requests-Per-Minute (RPM) rate limits for the deployment.

    TPM rate limits are based on the maximum number of tokens that are estimated to be processed by a request at the time the request is received.

    RPM rate limits are based on the number of requests received over time. The rate limit expects that requests be evenly distributed over a one-minute period. If this average flow isn't maintained, then requests may receive a 429 response even though the limit isn't met when measured over the course of a minute.

    Please see Manage Azure OpenAI Service quota for more details.

    To view your quota allocations across deployments in a given region, select Shared Resources> Quota in Azure OpenAI studio and click on the link to increase the quota*.*

    User's image

    I Hope this helps. Do let me know if you have any further queries.

    If you have any other questions, please let me know. Thank you again for your time and patience throughout this issue.

    1 person found this answer helpful.
    0 comments No comments

2 additional answers

Sort by: Most helpful
  1. MATTHEW NG 30 Reputation points
    2024-08-15T15:25:05.87+00:00

    Problem solved. Requested for more quota limit. Increasing the TPM and rate limit has helped fix the issue.

    1 person found this answer helpful.

  2. Usman Yousuf 0 Reputation points
    2025-02-26T15:39:02.85+00:00

    I am facing the same issue Microsoft should increase the token limit

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.