RateLimitError: Requests to the Embeddings_Create Operation under Azure OpenAI API

Mauro Minella 5 Reputation points
2023-05-03T06:32:33.8133333+00:00

Hi, I activate an "S0 Standard" OpenAI service on my Azure subscription.

So far I just used the Completion service, even for requests with several hundred tokens, without any issue.

Yesterday I started exploring the Embeddings feature of the same OpenAI instance.

I created my first embedding and it worked, but as soon as I tried generating the second one, I go the error

"

RateLimitError: Requests to the Embeddings_Create Operation under Azure OpenAI API version 2022-12-01 have exceeded call rate limit of your current OpenAI S0 pricing tier. Please retry after 57 seconds. Please go here: https://aka.ms/oai/quotaincrease if you would like to further increase the default rate limit.

"

In other words, I can make a single call per minute, which makes this service un-usable.

As said I'm not using a free version but the (only) paid version on Azure, and I can't believe I need to ask a quota increase for doing more than a call per minute.

Any hints, really appreciated. Thanks

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
{count} vote

2 answers

Sort by: Most helpful
  1. Sivachandran NKK 0 Reputation points
    2023-06-28T13:45:58.19+00:00

    Mauro Minella
    You can add delay in calling the embedding function so that it will by-pass the 4sec retry error

    adding sleep()

    0 comments No comments

  2. Ahmad Shafiq 190 Reputation points
    2023-12-06T08:51:01.45+00:00

    I am facing the same issue while creating an Index using the azure open ai Embedding Model!!
    How to resolve this issue??


Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.