Azure OpenAI, gpt-4o-mini have exceeded token rate limit of your current OpenAI S0 pricing tier.

santai 0 Reputation points
2024-11-15T05:35:45.5366667+00:00

User's image

When I use playgroud to chat. When I only type Text, it works well.

But when I upload an image. it prompts: Requests to the ChatCompletions_Create Operation under Azure OpenAI API version 2024-10-01-preview have exceeded token rate limit of your current OpenAI S0 pricing tier. Please retry after 86400 seconds. Please go here: https://aka.ms/oai/quotaincrease if you would like to further increase the default rate limit. | Apim-request-id: 5914fe68-c41b-4afe-8185-5b6fc55f0a4f

I caculated the uploaded image's tokens. The tokens value is 17650. I see my gpt-4o-mini TPM is 30K.

17,650 did not exceed 30k. so Why?

User's image

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
4,098 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Daniel Fang 1,060 Reputation points MVP
    2024-11-18T10:10:35.8166667+00:00

    HI @santai

    There is a token breakdown in the playground that you can check. see below.

    The main reason i am guess is that you might have chat history in the request which will be included in the token. i.e. multiple chat turns in the chat window.

    User's image

    on top of this break down, you can also

    • use the view code option and switch to curl. run the curl -i in the bash to see the actual quota headers that shows remaining token and request allowed
    • use the switch to JSON object to inspect the full actual api request payload (that you shall see all chat histories)
    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.