An Azure service that provides serverless Kubernetes, an integrated continuous integration and continuous delivery experience, and enterprise-grade security and governance.
Hello @Technical Administration (Bee-Relevant),
When an Azure Kubernetes Service (AKS) cluster is stuck due to a long-running operation not completing, it can lead to various issues such as the az aks update command hanging, node pools being stuck in Updating or Cancelled states, and Virtual Machine Scale Sets (VMSS) looping. This situation often arises when a long-running operation is either stuck or failing, which can be aborted if it is the last running operation on the managed cluster or agent pool.
To address this, you can use the Azure CLI to abort the operation. For example, you can run the command:
Azure CLI
az aks operation-abort \
--name myAKSCluster \
--resource-group myResourceGroup
This command will terminate the operation and return an HTTP status code of 204 if successful.
If the node pool is in a failed state, it may be due to issues such as insufficient capacity, quota limits, or network issues. To troubleshoot, you can check the provisioning state of the node pool using the command az aks nodepool show and look for any error messages.
Additionally, reviewing the activity log and diagnostic settings can help identify the cause of the failure.
If this answers your query, do click Accept Answer and Up-Vote for the same. And, if you have any further query do let us know.