Azure OpenAI service Intermittent 500 Internal server error
Hi , We are experimenting with Azure OpenAI services. It seems around half of the time, it will return 500 internal server error with gpt-35-turbo. Is this a known issue and something wrong with the Azure OpenAI services?
Azure OpenAI Service
-
VasimTamboli • 5,110 Reputation points
2023-06-04T11:51:11.3266667+00:00 Here are a few steps you can take to troubleshoot the issue:
Retry the request: If you encounter a 500 Internal Server error, you can try retrying the request after a short period of time. Transient errors can occur, and retrying may result in a successful response.
Check service status: Visit the Azure status page (status.azure.com) to check if there are any ongoing issues or service disruptions with Azure OpenAI services. Look for any reported incidents or updates related to the service you're using.
Review error details: When you receive a 500 Internal Server error, check if the error response provides any additional details or error codes. This information can help in troubleshooting and understanding the specific issue.
- Monitor error frequency: Keep track of how frequently you're encountering the 500 Internal Server errors. If the errors persist and occur consistently, it's recommended to report the issue to the Azure support team. They can investigate the problem and provide further assistance.
-
VasaviLankipalle-MSFT • 18,386 Reputation points
2023-06-06T02:57:08.7266667+00:00 Hi @Zhenqi , Thanks for using Microsoft Q&A Platform.
Could you please share the region you were using? Have you checked for the service health in the Azure portal or found any logs related to this?
-
Zhenqi • 10 Reputation points
2023-06-06T23:22:32.7733333+00:00 Hi @VasaviLankipalle-MSFT , sure, I am using East US region. Yes, I see many "Health event" Activated -> In Progress -> Resolved logs in the activity log, almost once every hour. And when I look at the details, it shows here in the properties. Is there something wrong with the Server?
"properties": { "title": "Unknown", "details": "Unknown", "currentHealthStatus": "Unavailable", "previousHealthStatus": "Degraded", "type": "Downtime", "cause": "Unknown" },
-
Zhenqi • 10 Reputation points
2023-06-06T23:27:34.34+00:00 We also see "Azure OpenAI Service - East US - Mitigated, Service Issue, Azure OpenAI Service, East US,
6/2/2023 23:14:42" in the health history. It is saying already mitigated, but we are still seeing this Intermittent 500 Internal server error today. -
Zhenqi • 10 Reputation points
2023-06-06T23:31:17.34+00:00 We also see "Azure OpenAI Service - East US - Mitigated, Service Issue, Azure OpenAI Service, East US, 6/2/2023, 23:14:" in the health history. It is saying already mitigated, but we are still facing this intermittent 500 internal server error today.
-
VasaviLankipalle-MSFT • 18,386 Reputation points
2023-06-07T00:45:50.51+00:00 Hello @Zhenqi , Thanks for sharing this with us. Let me check with the PG team on this.
-
Zhenqi • 10 Reputation points
2023-06-10T04:46:41.03+00:00 Hi @VasaviLankipalle-MSFT , thank you! So any update for this? It's still happening and seems the service is not that stable...
-
01 iccth • 5 Reputation points
2023-06-10T15:45:38.1766667+00:00 我也经常遇到这个, 我现在直接把请求加进自动循环里, 感觉平均3次就会报一次500错误, 太频繁了
-
Zhenqi • 10 Reputation points
2023-06-15T20:27:32.36+00:00 是啊,这个 service 的 quality 有点弱,不太好使用。。。
-
Zhenqi • 10 Reputation points
2023-06-15T20:36:27.7566667+00:00 Hi @VasaviLankipalle-MSFT , the OpenAI service starts to return other errors frequently since last couple of days, "deployment not found", "timeout" etc. It seems very unstable, which hurts our user experience. I don't think it's a production ready service for us now, so we canceled and disabled the Azure OpenAI service for our product. If in the future, the team fix these issues and make the service stable, I could give another try.
-
Deleted
This comment has been deleted due to a violation of our Code of Conduct. The comment was manually reported or identified through automated detection before action was taken. Please refer to our Code of Conduct for more information.
-
VasaviLankipalle-MSFT • 18,386 Reputation points
2023-06-16T03:02:02.4533333+00:00 Hello @Zhenqi , we are sorry to hear about your experience. We have forwarded this feedback to the product team already. Will reach out to you if we hear back from the team.
Sign in to comment