response times of chatgpt3.5 increase

Emanuele 0 Reputation points
2023-08-28T08:01:59.99+00:00

Good morning,

I have noticed that, on average, on the same input, the response times of chatgpt3.5-16k doubled at the end of July compared to the beginning of summer and are now even quadrupled on average. Sometimes we even started getting more errors than usual in response (always on the same inputs that, at the beginning of the summer, seem to give no problems).

Is this a problem that Microsoft knows? Are they going to mitigate it? Is there something I can do about it without changing the prompt (that at the beginning of the summer was perfect both in terms of quality and response time)?

Thank you very much!

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
4,098 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. romungi-MSFT 48,911 Reputation points Microsoft Employee Moderator
    2023-08-28T15:00:10.4833333+00:00

    @Emanuele You might want to enable monitoring on your azure OpenAI resource to check for latency. You can also report the behavior by providing details about your resource like model, region, version, scenario and no. of tokens used. This would help the team to check if there is any service issues that could increase your deployment response times. Thanks!!

    If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.