Why is the response unstable when I use text-embedding-ada-002?

zcc 20 Reputation points
2024-02-06T08:34:27.6766667+00:00

I am using the text-embedding-ada-002 model provided by Azure OpenAI to generate embeddings, but its response time is unstable, with a 40% chance of being very slow. Usually, my response finishes within 2 seconds, but sometimes it can take up to 20 seconds, or even longer. What could be the reason for this? Here is my code:

response = client.embeddings.create(
                            input=input_text,
                            model=vector_engine,
                        )
```"

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
4,082 questions
{count} votes

1 answer

Sort by: Most helpful
  1. 45711212 0 Reputation points
    2025-03-04T07:25:23.58+00:00

    Is there a solution to this problem, I'm facing the same issue. The embedded text are only a few words. On average it takes 600ms but sometimes it can take up to 8s for no reason. We only do like 100 calls per day, so we are far away from the rate Limit.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.