Latency and Timeout Errors for Azure OpenAI o3mini API request

Question

Latency and Timeout Errors for Azure OpenAI o3mini API request

Nicolas Narozniak 20

Azure openai o3mini APIs are not working anymore. Requests result in timeout errors.

Zone : francecentral

Nicolas Narozniak 20 Reputation points

2025-05-19T20:23:10.9033333+00:00

Time To response > 10 min , whatever reasoning_effort is used
Nicolas Narozniak 20 Reputation points

2025-05-19T20:25:52.88+00:00

Service Health not showing any issue for Azure OpenAI in francecentral despite these latencies
Nicolas Narozniak 20 Reputation points

2025-05-19T20:29:46.2966667+00:00

Service Health not showing any issue for Azure OpenAI in francecentral despite these latencies
Nicolas Narozniak 20 Reputation points

2025-05-21T06:54:55.99+00:00

Hi @Jerald Felix ,

Thank you for your reply, and for the suggestions you provided.

Unfortunately, after testing your recommendations, I have not seen any significant improvement. I also tried with different Azure directories and observed the same latency issues—response times have increased from seconds to several minutes since the beginning of this week on o3-mini.

Meanwhile, the same prompts work fine and very quickly on the OpenAI playground directly (same speed as Azure OpenAI until end of last week).

Regarding your first recommendation to "Monitor Service Health," I could not find any incidents related to Azure OpenAI in francecentral or sweden region, even though these latency issues persist. Could you please share a direct link to the appropriate Service Health dashboard or section so I can monitor updates and resolution progress for this incident?

Additionally, following up on @Ajay 1. Gupta (Nokia) question, do you have any information about the expected timeline for resolution? As my production environment is affected, I may need to consider switching to the direct OpenAI APIs if this persists, depending on the estimated time to recovery.

Thanks again for your support.

Nicolas
Chris Le 85 Reputation points Independent Advisor

2025-05-21T08:31:20.0333333+00:00

Hi Nicolas Narozniak,

Thank you for contacting Q&A Forum. I would like to provide my findings and proposed solution:

I understand your concern regarding the slow response for the 3o-mini model. Microsoft has acknowledged the problem in community forums and is investigating, but no official ETA has been provided for a resolution.

Kindly let me know if this work for you and please let me know if you have any further question.

If I have answered your question, please accept this as answer as a token of appreciation and don't forget to thumbs up for "Was it helpful"!

Best regards,

Chris
SriLakshmi C 6,010 Reputation points Microsoft External Staff Moderator

2025-05-21T15:07:33.2833333+00:00

Hi @Nicolas Narozniak,

There is currently an ongoing latency issue affecting the o3-mini model in the EU region. The product engineering (PG) team is actively investigating the root cause and working toward a resolution. We understand this may be impacting your workflows, and we truly appreciate your patience during this time.

Once the issue is fully resolved, I will update you promptly. Please let me know if you have any additional questions or concerns.

Thank you!
Nicolas Narozniak 20 Reputation points

2025-05-27T06:37:14.7833333+00:00

Hi @SriLakshmi C

Thank you very much for the follow-up.

I can confirm that the issue appears to be resolved and the response times are back to normal.

Thank you!
SriLakshmi C 6,010 Reputation points Microsoft External Staff Moderator

2025-05-27T08:54:38.64+00:00

Hi @Nicolas Narozniak,

I'm glad to hear the issue has been resolved and that response times are back to normal.

I'm now converting my previous comment into an official answer please accept the answer it if everything looks good on your end. If you have any other questions or run into further issues, feel free to reach out anytime!

Thank you!

Accepted answer

1 additional answer

Your answer

Nicolas Narozniak 20 Reputation points

2025-05-19T20:23:10.9033333+00:00

Time To response > 10 min , whatever reasoning_effort is used
Nicolas Narozniak 20 Reputation points

2025-05-19T20:25:52.88+00:00

Service Health not showing any issue for Azure OpenAI in francecentral despite these latencies
Nicolas Narozniak 20 Reputation points

2025-05-19T20:29:46.2966667+00:00

Service Health not showing any issue for Azure OpenAI in francecentral despite these latencies
Nicolas Narozniak 20 Reputation points

2025-05-21T06:54:55.99+00:00

Hi @Jerald Felix ,

Thank you for your reply, and for the suggestions you provided.

Unfortunately, after testing your recommendations, I have not seen any significant improvement. I also tried with different Azure directories and observed the same latency issues—response times have increased from seconds to several minutes since the beginning of this week on o3-mini.

Meanwhile, the same prompts work fine and very quickly on the OpenAI playground directly (same speed as Azure OpenAI until end of last week).

Regarding your first recommendation to "Monitor Service Health," I could not find any incidents related to Azure OpenAI in francecentral or sweden region, even though these latency issues persist. Could you please share a direct link to the appropriate Service Health dashboard or section so I can monitor updates and resolution progress for this incident?

Additionally, following up on @Ajay 1. Gupta (Nokia) question, do you have any information about the expected timeline for resolution? As my production environment is affected, I may need to consider switching to the direct OpenAI APIs if this persists, depending on the estimated time to recovery.

Thanks again for your support.

Nicolas
Chris Le 85 Reputation points Independent Advisor

2025-05-21T08:31:20.0333333+00:00

Hi Nicolas Narozniak,

Thank you for contacting Q&A Forum. I would like to provide my findings and proposed solution:

I understand your concern regarding the slow response for the 3o-mini model. Microsoft has acknowledged the problem in community forums and is investigating, but no official ETA has been provided for a resolution.

Kindly let me know if this work for you and please let me know if you have any further question.

If I have answered your question, please accept this as answer as a token of appreciation and don't forget to thumbs up for "Was it helpful"!

Best regards,

Chris
SriLakshmi C 6,010 Reputation points Microsoft External Staff Moderator

2025-05-21T15:07:33.2833333+00:00

Hi @Nicolas Narozniak,

There is currently an ongoing latency issue affecting the o3-mini model in the EU region. The product engineering (PG) team is actively investigating the root cause and working toward a resolution. We understand this may be impacting your workflows, and we truly appreciate your patience during this time.

Once the issue is fully resolved, I will update you promptly. Please let me know if you have any additional questions or concerns.

Thank you!
Nicolas Narozniak 20 Reputation points

2025-05-27T06:37:14.7833333+00:00

Hi @SriLakshmi C

Thank you very much for the follow-up.

I can confirm that the issue appears to be resolved and the response times are back to normal.

Thank you!
SriLakshmi C 6,010 Reputation points Microsoft External Staff Moderator

2025-05-27T08:54:38.64+00:00

Hi @Nicolas Narozniak,

I'm glad to hear the issue has been resolved and that response times are back to normal.

I'm now converting my previous comment into an official answer please accept the answer it if everything looks good on your end. If you have any other questions or run into further issues, feel free to reach out anytime!

Thank you!

Answer 1

SriLakshmi C 6,010 Microsoft External Staff Moderator

Hi @Nicolas Narozniak,

The latency issue affecting the o3-mini model in the EU region has now been mitigated. Could you please check on your end and confirm if it's resolved? Let me know if you have any further questions or need additional assistance.

Thank you!

Answer 2

Hello Nicolas Narozniak,

The latency and timeout issues you're experiencing with the Azure OpenAI o3-mini model in the France Central region are part of a broader pattern observed across multiple regions, including East US 2 and Sweden Central. These challenges have been linked to capacity constraints and architectural limitations, particularly affecting the o1 and o3-mini models.

Known Issues and Root Causes

Extended Response Times: Users have reported response times exceeding 10 minutes, regardless of the reasoning_effort parameter settings.
Service Health Dashboard: Despite these issues, the Azure Service Health dashboard may not always reflect ongoing problems, as was the case in France Central.
Capacity Limitations: In regions like East US 2, similar latency problems were attributed to capacity limits. Microsoft's product team addressed this by implementing dynamic routing to alleviate timeouts.

Recommended Actions

Monitor Service Health: Regularly check the Azure Service Health dashboard for updates on your region.

Optimize Requests:

Reduce Prompt Complexity: Simplify prompts to decrease processing time.

Limit Token Usage: Lower the max_tokens parameter to reduce response size.

Implement Streaming: Use streaming responses to receive data incrementally.

Manage Request Rates: Even within quota limits, high concurrent requests can lead to throttling. Implement rate limiting to distribute requests evenly over time.
Consider Alternative Regions: If feasible, deploy your application in regions with better performance metrics.
Explore Provisioned Throughput: For latency-sensitive applications, consider using Provisioned Throughput Units (PTUs) to ensure consistent performance.

Engage with Support: If issues persist, contact Azure Support to report the problem and receive assistance.

By implementing these strategies, you can mitigate latency and timeout issues while Microsoft continues to enhance the Azure OpenAI service infrastructure.

Best Regards,

Jerald Felix

Ajay 1. Gupta (Nokia) 0 Reputation points

2025-05-20T09:06:06.72+00:00

Any expected timeline for resolution ?
Max Lacy 345 Reputation points

2025-05-21T19:50:06.87+00:00

Seeing similar latency issues in East US. Any timeline on resolution?
Luc Cary 0 Reputation points

2025-05-21T20:22:29.4233333+00:00

Confirmed that we are seeing the same issue in East US 2

Share via

Latency and Timeout Errors for Azure OpenAI o3mini API request

1 additional answer

Your answer