OpenAI GPT4 API rate limit error

Question

OpenAI GPT4 API rate limit error

highever highever 0

I'm getting a ChatGPT error 429 message when calling GPT4API for 10 consecutive requests. The error says that I exceeded the token rate limit of my current OpenAI SO pricing tier. I tried to adjust the credit limit by submitting an application, but I didn't receive any response. Can someone suggest how to resolve this issue? Here's the exact error message:

ChatGPT error 429: {"error":{"code":"429","message": "Requests to the ChatCompletions_Create Operation under Azure OpenAI API version 2023-07-01-preview have exceeded token rate limit of your current OpenAI S0 pricing tier. Please retry after 50 seconds. Please go here: https://aka.ms/oai/quotaincrease if you would like to further increase the default rate limit."}}

AshokPeddakotla-MSFT 35,971 Reputation points Moderator

2023-11-28T02:09:43.91+00:00

highever highever We received your feedback that the answer provided on the thread was not helpful.

Kindly let us know what we could have done better to improve the answer and make your engagement experience good. We are here to help you and strive to make your experience better and greatly value your feedback.

As mentioned earlier, Please note that due to overwhelming demand, quota increase requests are being accepted and will be filled in the order they are received. Priority will be given to customers who generate traffic that consumes the existing quota allocation, and your request may be denied if this condition is not met. We will make every effort to accommodate your request; however, allocation is based on our current capacity and future deployments, and is subject to availability. I would suggest you, Kindly wait for request approval.

Do let us know if you have any further queries so that we can assist you accordingly.

If you wish, you may re-surveying/rating for the engagement you received on the thread. Your feedback is very important to us.

Looking forward to you reply. Much appreciate your feedback!
AshokPeddakotla-MSFT 35,971 Reputation points Moderator

2023-11-29T02:17:23.29+00:00

highever highever Just checking to see if you had a chance to review my response.

Do let us know if that helps or have any other queries. Looking forward to you reply. Much appreciate your feedback!

1 answer

Your answer

AshokPeddakotla-MSFT 35,971 Reputation points Moderator

2023-11-28T02:09:43.91+00:00

highever highever We received your feedback that the answer provided on the thread was not helpful.

Kindly let us know what we could have done better to improve the answer and make your engagement experience good. We are here to help you and strive to make your experience better and greatly value your feedback.

As mentioned earlier, Please note that due to overwhelming demand, quota increase requests are being accepted and will be filled in the order they are received. Priority will be given to customers who generate traffic that consumes the existing quota allocation, and your request may be denied if this condition is not met. We will make every effort to accommodate your request; however, allocation is based on our current capacity and future deployments, and is subject to availability. I would suggest you, Kindly wait for request approval.

Do let us know if you have any further queries so that we can assist you accordingly.

If you wish, you may re-surveying/rating for the engagement you received on the thread. Your feedback is very important to us.

Looking forward to you reply. Much appreciate your feedback!
AshokPeddakotla-MSFT 35,971 Reputation points Moderator

2023-11-29T02:17:23.29+00:00

highever highever Just checking to see if you had a chance to review my response.

Do let us know if that helps or have any other queries. Looking forward to you reply. Much appreciate your feedback!

Answer 1

highever highever Greetings & Welcome to Microsoft Q&A forum!

I'm getting a ChatGPT error 429 message when calling GPT4API for 10 consecutive requests.The error says that I exceeded the token rate limit of my current OpenAI SO pricing tier.

To give more context, As each request is received, Azure OpenAI computes an estimated max processed-token count that includes the following:

Prompt text and count
The max_tokens parameter setting
The best_of parameter setting

As requests come into the deployment endpoint, the estimated max-processed-token count is added to a running token count of all requests that is reset each minute. If at any time during that minute, the TPM rate limit value is reached, then further requests will receive a 429 response code until the counter resets. For more details, see Understanding rate limits.

I tried to adjust the credit limit by submitting an application, but I didn't receive any response.

I understand that you have already submitted an application for quota increase through https://aka.ms/oai/quotaincrease

As mentioned in the form, Priority will be given to customers who generate traffic that consumes the existing quota allocation, and your request may be denied if this condition is not met.

We will make every effort to accommodate your request; however, allocation is based on our current capacity and future deployments, and is subject to availability.

I would suggest you, Kindly wait for request approval.

Can someone suggest how to resolve this issue?

To minimize issues related to rate limits, it's a good idea to use the following techniques:

Set max_tokens and best_of to the minimum values that serve the needs of your scenario. For example, don’t set a large max-tokens value if you expect your responses to be small.
Use quota management to increase TPM on deployments with high traffic, and to reduce TPM on deployments with limited needs.
Implement retry logic in your application.
Avoid sharp changes in the workload. Increase the workload gradually.
Test different load increase patterns.

Hope this helps. Do let me know if you have any further queries.

If the response helped, please do click Accept Answer and Yes for was this answer helpful.

Doing so would help other community members with similar issue identify the solution. I highly appreciate your contribution to the community.

Share via

OpenAI GPT4 API rate limit error

1 answer

Your answer