Switched to Azure OpenAI from OpenAI's OpenAI. Using region francecentral. The measured response time went up, from an average of 2407 ms with OpenAI to 3032 ms with Azure, or an extra latency of +643 ms.
This is with model gpt-35-turbo version 0613
for both.
Has anyone tested this across region and models?
I'm wondering if:
- Some regions wouldn't be faster?
- Would 0301 be faster than 0613?