AKS node pool stuck in "starting" state after stop --> start of cluster

Markus Geiger 26 Reputation points
2022-07-13T11:58:06.923+00:00

Hi everybody,

in an AKS training, we observed the following behavior:
Several participants could not bring up their previously working AKS cluster with 3 nodes in 1 node pool after it has been stopped via the azure portal.
The node pool ist stuck in state "starting" for several hours.
Deleting the cluster took a very long time, but for some it ultimately worked so that they could recreate it.

Any thoughts on how this could happen and how to resolve the situation without deleting and recreating the cluster?

Thanks!
Markus

Azure Kubernetes Service (AKS)
Azure Kubernetes Service (AKS)
An Azure service that provides serverless Kubernetes, an integrated continuous integration and continuous delivery experience, and enterprise-grade security and governance.
2,127 questions
0 comments No comments
{count} votes

Accepted answer
  1. srbhatta-MSFT 8,561 Reputation points Microsoft Employee
    2022-07-13T12:20:14.353+00:00

    Hello @Markus Geiger ,
    Thanks for posting your query on Microsoft QnA.
    I would like to share a word of caution before answering the question for your specific scenario. It is important that you don't repeatedly start/stop your cluster. Repeatedly starting/stopping your cluster may result in errors. Once your cluster is stopped, you should wait 15-30 minutes before starting it up again.

    In your case, I understand that the cluster is stuck is "starting" state for hours when you start the cluster from Portal.
    I would recommend you to perform the stop as well as start of your cluster via cli using the below commands. If you still see the same issue persisting, then kindly drop a comment here and tag me so that i can look further into this. Please note, starting and stopping a cluster via cli also takes a few minutes. If the nodepool size is small then it should not take more than 5 minutes (this is what I have observed from experience).

    az aks stop --name <cluster-name> --resource-group <rg-name>  
    az aks start --name <cluster-name> --resource-group <rg-name>  
    

    Hope this helps. Looking forward to hear back from you. Please do tag me in your reply. Thank You.


0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.