gpt-4-1106-preview API is very very slow, waiting for more than 60 seconds

xiansen wang 100 Reputation points
2024-01-23T08:33:54.6333333+00:00

I am using the gpt-4-1106-preview API with regions set at Norway East, South India, and UK South. However, the response time in each region is extremely slow, even exceeding 120 seconds, rendering the system unusable. My Max tokens is set to 2048, and I am using stream streaming transmission. I have just tried 3 times, and all the results have timed out. Image 039

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
3,628 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.