AzureOpenAi remaining requests do not match the requests per minute (rpm) given on my deployment, any idea why?

Linus Nestler 20 Reputation points
2024-07-16T11:13:41.06+00:00

Hello I am working on a project that requieres Azure OpenAi Service.

I deployed the ada-002 embedding model in Azure (Eu-West) with the max tokens and requests per minute.
User's image The rpm should be 1440. But when sending a request and looking at the headers the numbers are very diffrent:

User's image

This has also caused some problems in my project as i call the api more than 240 times per minute resulting in "HTTP 429 (429)

Requests to the Embeddings_Create Operation under Azure OpenAI API version 2024-05-01-preview have exceeded call rate limit of your current OpenAI S0 pricing tier."

I don't understand why the rpm i get are so diffrent. Does anyone know why that could be?

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
2,987 questions
{count} votes

Accepted answer
  1. YutongTie-MSFT 51,256 Reputation points
    2024-07-17T06:41:35.51+00:00

    No problem, please share your Azure Subscription ID to me via private message, I will ping you there in case you can not find it. At the meantime, please accept the answer here, so that we can close this case.

    Regards,

    Yutong

    0 comments No comments

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.