Share via

How to get low latency on Azure central/south India server.

Aditya Purohit 5 Reputation points
2026-02-12T18:44:15.78+00:00

Have been facing high latency on the GPT-4.1-mini and other models, with every response exceeding 1000ms.

Azure OpenAI Service
Azure OpenAI Service

An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.

0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Anshika Varshney 7,970 Reputation points Microsoft External Staff Moderator
    2026-02-12T21:35:40.78+00:00

    Hi Aditya Purohit,

    Following up with an update from engineering.

    Based on our investigation, the elevated latency observed on GPT‑4.1‑mini and related models in the Central/South India region was due to a service-side issue, which is being tracked under INC 747008667.

    Issue:

    Requests were experiencing higher-than-expected end‑to‑end latency (>1000 ms) due to a backend service regression affecting request handling and routing in the region.

    Resolution:

    Engineering has applied a service-side mitigation to address the latency issue. The fix restores normal request handling behavior and improves response times. No configuration or code changes are required on the customer side. The improvements will be reflected automatically as the fix completes rollout.

    If you are still seeing higher latency after this update, please share:

    Deployment name

    Region

    Approximate request timestamps

    This will help us validate that traffic is hitting the updated path.

    If you have any remaining questions or additional details to share, feel free to let us know. We’ll be glad to provide further clarification or guidance.


    If this answers your query, please do click Accept Answer and Yes for was this answer helpful.

    Thank you!


Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.