What is the reason for significant differences in latency between OpenAI and OpenAI Azure for GPT-4-vision?

Kirk Hadely 0 Reputation points
2024-05-21T20:37:35.0333333+00:00

Requests sent to the gpt-v vision preview in us-west through the OpenAI Azure integration have an order of magnitude higher latency than the same request(s) sent through the OpenAI platform directly. We would like to understand the reasons behind this divergence.

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
2,437 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,511 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Charlie Wei 3,305 Reputation points
    2024-05-22T00:22:51.34+00:00

    Hello Kirk Hadely,

    As a user, I have also noticed the differences between the two, but unfortunately, there is no official explanation available at this time. Additionally, I suggest reviewing the document for various methods to reduce latency.

    Best regards,
    Charlie


    If you find my response helpful, please consider accepting this answer and voting yes to support the community. Thank you!