gpt-4-1106-preview API is very very slow, waiting for more than 60 seconds

xiansen wang 80 Reputation points

I am using the gpt-4-1106-preview API with regions set at Norway East, South India, and UK South. However, the response time in each region is extremely slow, even exceeding 120 seconds, rendering the system unusable. My Max tokens is set to 2048, and I am using stream streaming transmission. I have just tried 3 times, and all the results have timed out. Image 039

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
1,843 questions
{count} votes