How to fix broken azure openai model deployment?

Aleksandar Pavlovic 0 Reputation points
2025-02-28T10:24:16.87+00:00

Our deployment, gpt-4o-data-zone, was unavailable for over four hours (we noticed it at Feb 27, 2025 at 10:28:30.825 am CET), impacting our operations significantly. All calls were failing with status 500 model_error. Do you know what could be a cause of this outage? More importantly, we would appreciate your guidance on how to handle such incidents in the future to minimize potential disruptions, as this could have a substantial negative impact on our customers. We tried re-creating the deployment but that didn't help either. Is there anything we could do on our side to recover from it?

We tried using it from postman and your chat UI as well but we got the same issue:image.pngimage.png

Kind regards,

Aleksandar

Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
3,603 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Amira Bedhiafi 33,071 Reputation points Volunteer Moderator
    2025-03-02T23:00:49.8766667+00:00

    Always try to check the Azure Service Health dashboard to see if there are any ongoing issues with Azure OpenAI or related services. .

    Navigate to your specific resource in the Azure portal and check the Resource Health status.

    Or sometimes, simply deleting and recreating the deployment can resolve issues.

    If the issue persists, try deploying the model in a different Azure region to see if it’s a region-specific problem.

    If the issue persists and you cannot resolve it through the above steps, consider opening a support ticket with Azure.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.