Getting error about Rate Limit Exceeded

Linda Manganaro 40 Reputation points
2025-05-08T04:40:22.2366667+00:00

I am doing a simple Learning exercise on Generative AI in Azure Portal and when I try the chat exercise I keep getting the message "Rate Limit Exceeded. Adjust your tokens per minute rate limit in Models + endpoints or try again later." The initial prompt works fine, but all subsequent ones fail with that message. Currently the TPS shows as 6. Shouldn't that be enough for a simple Learning task? What do I need to do for this to work?

Azure | Azure Training
{count} votes

1 answer

Sort by: Most helpful
  1. VarunTha 14,880 Reputation points Microsoft External Staff Moderator
    2025-05-08T20:59:28.93+00:00

    Hi Linda Manganaro,

    Thank you for providing the additional details.

    We attempted to reproduce the issue using the exercise and obtained the following outcome, as shown in the screenshot below:

    User's image

    User's image

    Could you please confirm whether you have selected the GPT-4 model? Selecting a different model may lead to a "rate limit exceeded" error. We have followed the documentation using GPT-4 and were able to achieve successful results.

    If you have any further questions or need additional assistance, feel free to ask.


Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.