ImagePullBackOff persists on AKS node despite confirmed AcrPull role assignment to kubelet identity

Question

ImagePullBackOff persists on AKS node despite confirmed AcrPull role assignment to kubelet identity

David Campbell 5

Azure Support Ticket: ImagePullBackOff Issue on AKS

Issue Summary

Title:

ImagePullBackOff persists on AKS node despite confirmed AcrPull role assignment to kubelet identity

Description:

We are experiencing an ImagePullBackOff error on one of our AKS pods pulling from our private Azure Container Registry (ACR), even though the AKS kubelet identity has been granted the AcrPull role. Other pods on different nodes are pulling the same image (frontend:latest) successfully. The affected node consistently fails with a 401 Unauthorized error when attempting to pull the image.

Environment Details

Cluster Name: apip-dev-aks-uaenorth
Resource Group: apip-dev-rg-uaenorth
ACR Name: apipdevacr
Region: uaenorth
Image: apipdevacr.azurecr.io/frontend:latest
Kubelet Object ID: 747ad783-9416-48a7-bcab-a1bc64898b45
ACR Scope: Correctly scoped to the ACR registry resource
Assignment Timestamp: 2025-04-16T05:49:16Z

Troubleshooting Performed

Confirmed kubelet identity:
- Retrieved from az aks show --query identityProfile.kubeletidentity.objectId
Verified ACR role assignment:
- Used az role assignment list to confirm the AcrPull role is correctly assigned to the kubelet identity for the ACR scope
Image verified in ACR:
- Pulled frontend:latest manually using Docker and az acr login
- Working successfully on other AKS nodes
Pod consistently fails on node aks-agentpool-22692403-vmss000000:
- Output of kubectl describe pod confirms:
```
     failed to fetch anonymous token: unexpected status from GET ... 401 Unauthorized
```
Deleted and recreated pod:
- Pod reschedules but fails again on the same node
Confirmed issue is isolated to a specific node:
- Other pods using the same image are running fine on different nodes
Waited >30 minutes for potential role propagation:
- Error persists beyond typical AAD propagation window

Request

We request assistance from Azure support to:

Investigate potential misconfiguration or delay in RBAC token propagation at the node or VMSS instance level
Validate whether the kubelet on the specific node has successfully received the updated token permissions
Suggest additional diagnostics or a workaround to refresh or reset the identity on the affected node

Anusree Nashetty 6,475 Reputation points Microsoft External Staff Moderator

2025-04-18T10:03:02.6333333+00:00

Hi David Campbell,

Did you get a chance to check my response, If you have any further queries, let me know. If the information is helpful, please click on Upvote.

1 answer

Your answer

Anusree Nashetty 6,475 Reputation points Microsoft External Staff Moderator

2025-04-18T10:03:02.6333333+00:00

Hi David Campbell,

Did you get a chance to check my response, If you have any further queries, let me know. If the information is helpful, please click on Upvote.

Answer 1

Hi David Campbell,

It is a possible reason of ImagePullBackOff that the node couldn't access to container registry due to authentication error. This may occur when using a private container registry. To pull image from a private registry, it is required to authorize registry access using credential such as username and password.

When the kubelet attempts to pull an image from a private registry but cannot authenticate with the registry, the following error message is shown in the pod events, and the pod fails to start.

Please check this detailed documentation, which has step by step guide: Cause 1: 401 Unauthorized error

If you have any further queries, let me know.

Share via

ImagePullBackOff persists on AKS node despite confirmed AcrPull role assignment to kubelet identity

Azure Support Ticket: ImagePullBackOff Issue on AKS

Issue Summary

Environment Details

Troubleshooting Performed

Request

1 answer

Your answer