RateLimitError: Requests to the Embeddings_Create Operation under Azure OpenAI API

Mauro Minella 0 Reputation points
2023-05-03T06:32:33.8133333+00:00

Hi, I activate an "S0 Standard" OpenAI service on my Azure subscription.

So far I just used the Completion service, even for requests with several hundred tokens, without any issue.

Yesterday I started exploring the Embeddings feature of the same OpenAI instance.

I created my first embedding and it worked, but as soon as I tried generating the second one, I go the error

"

RateLimitError: Requests to the Embeddings_Create Operation under Azure OpenAI API version 2022-12-01 have exceeded call rate limit of your current OpenAI S0 pricing tier. Please retry after 57 seconds. Please go here: https://aka.ms/oai/quotaincrease if you would like to further increase the default rate limit.

"

In other words, I can make a single call per minute, which makes this service un-usable.

As said I'm not using a free version but the (only) paid version on Azure, and I can't believe I need to ask a quota increase for doing more than a call per minute.

Any hints, really appreciated. Thanks

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
1,407 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Sivachandran NKK 0 Reputation points
    2023-06-28T13:45:58.19+00:00

    Mauro Minella
    You can add delay in calling the embedding function so that it will by-pass the 4sec retry error

    adding sleep()

    0 comments No comments