@Rajat Aggarwal welcome to the Microsoft Q&A community
You're running into a quota issue when trying to deploy the GPT-4o model in the South India region. Even though you haven't used any quota, Azure might have default regional limits or availability constraints that prevent deployment in certain locations.
Here’s what you can try:
- Check Azure OpenAI Quotas – You can review the quota limits for different regions in the Azure OpenAI quotas and limits documentation.
- Request a Quota Increase – In the Azure portal, navigate to your OpenAI resource, go to "Usage + quotas," and request an increase for the South India region. More details on managing quotas can be found here.
- Verify Model Availability – Some models, including GPT-4o, may have restricted availability in certain regions. You can check the latest model availability in the Azure OpenAI models documentation.
- Contact Azure Support – If the quota request doesn’t resolve the issue, reaching out to Azure support might help clarify whether South India is currently supported for GPT-4o deployments.
Since your goal is to improve latency, deploying in a nearby region with lower network latency might be an alternative if South India remains unavailable.
I hope these helps. Let me know if you have any further questions or need additional assistance.
Also if these answers your query, do click the "Upvote" and click "Accept the answer" of which might be beneficial to other community members reading this thread.