Azure OpenAI Service and OpenAI service have different response times that depend on several factors such as server load, and network latency. Also in Azure OpenAI, you have the option for a pay-as-you-go model or provisioned throughput Units options. You can improve Azure OpenAI performance by
- Make sure you deploy the OpenAI service to a region near your location
- You may use provisioned throughput to achieve minimal latency
Refer: https://azure.microsoft.com/en-us/pricing/details/cognitive-services/openai-service/
https://learn.microsoft.com/en-us/azure/ai-services/openai/overview
Hope this helps