Azure ML real-time inference endpoint deloyment stuck - with deployment state as Transitioning for over 2 hours.

DaLi 61 Reputation points


I was deploying a real-time inference pipeline into an AKS compute in East US region today. The endpoint deployment state was stuck at Transitioning for over 2 hours and never finished and I had to delete it. A separate deployment to region East US 2 got stuck as well. I was able to deploy the same pipeline to East US the day before yesterday.

I wonder if this is likely an error related to my account/resources or a system wide issue? Did anyone else encounter the similar issue?

thanks in advance!

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
2,711 questions
{count} votes

Accepted answer
  1. romungi-MSFT 43,686 Reputation points Microsoft Employee

    Hello All,

    We have deployed a fix now to all regions and this should be fixed. Could you please retry and let us know if there are any issues.


    1 person found this answer helpful.

1 additional answer

Sort by: Most helpful
  1. SD-EVO 1 Reputation point

    A "real-time endpoint deployment" that "failed" cannot be deleted

    I also can't update the deployment to a working version.

    I tried deleting using "Azure CLI", "Azure Portal", "Azure ML Studio" and Python SDK.

    Now I'm trying to update the deployment with "bicep" but it's been running for 20 minutes now so I think it will fail again.

    I also can't delete the endpoint that contains the deployment.

    Now that this is day 3, my next try is to delete the entire "Azure ML"

    0 comments No comments