Aks A node cannot be automatically allocated

Yongchao Liu (Neusoft America Inc) 186 Reputation points Microsoft Vendor
2022-07-06T01:17:19.503+00:00

I have a total of 16 Gpus pods deployed, each pod occupies a node, but I don't know why one node can't deploy pod, I have to add a redundant node.

217942-image.png

217961-image.png

If someone can give us some guidance.
Thanks

Azure Kubernetes Service (AKS)
Azure Kubernetes Service (AKS)
An Azure service that provides serverless Kubernetes, an integrated continuous integration and continuous delivery experience, and enterprise-grade security and governance.
1,781 questions
{count} votes

1 answer

Sort by: Most helpful
  1. srbhatta-MSFT 8,536 Reputation points Microsoft Employee
    2022-07-12T10:34:07.233+00:00

    Hi @Yongchao Liu (Neusoft America Inc) ,
    Thanks for providing the additional information.

    Could you provide what is the status you are seeing in kubectl get pods/<affected-pod> and also provide 'Events' log for same by doing a kubectl describe pods/<affected-pod> ?

    You will see something under Events for the pod that states why the pod is not getting scheduled. For example, please see below -

    219881-2022-07-12-16-03-12-clipboard.png

    Looking forward to your reply.