CoreDNS error in AKS - can't reach applicaiton

Joe White 0 Reputation points
2023-08-21T15:08:55.5266667+00:00

We have a Kubernetes cluster that suddenly stopped working. The Resource Health tab says the information below. I have rebooted coreDNS but it didn't resolve the issue. The application is completely unreachable at this time. Anything to try?

Degraded (Customer Initiated) : CoreDNS became unavailable (Customer Initiated)


At Monday, August 21, 2023 at 3:43:33 AM MDT, the Azure monitoring system received the following information regarding your Azure Kubernetes Service (AKS):

We've temporarily lost connection to Core DNS. If the AKS cluster is running as expected, the status of the resource will change to Available after a few minutes.

Recommended Steps

  • Check back here for status updates. AKS is trying to recover this issue
  • For coreDNS Troubleshooting please check the CoreDNS Troubleshooting documentation.
  • Problems with CoreDNS may be a symptom of wider issues with you cluster, visit Diagnose and solve problems to further diagnose and troubleshoot the issue.
  • If this error persists and you are having issues interacting with your cluster contact support
Azure Kubernetes Service (AKS)
Azure Kubernetes Service (AKS)
An Azure service that provides serverless Kubernetes, an integrated continuous integration and continuous delivery experience, and enterprise-grade security and governance.
2,101 questions
{count} votes

2 answers

Sort by: Most helpful
  1. Joe White 0 Reputation points
    2023-08-21T19:20:45.53+00:00

    We resolved this but were forced to upgrade our cluster to get it resolved. I'm not entirely sure what happened to the DNS pods. They were healthy. We then had to rebuild our ingress to match the new spec.


  2. Adeboye Famurewa 0 Reputation points Microsoft Employee
    2024-04-09T13:22:04.8+00:00

    Hello @joe white If you see the Error message below on the portal.

    • We've temporarily lost connection to Core DNS. If the AKS cluster is running as expected, the status of the resource will change to Available after a few minutes.

    User's image

    • Then, you do not need to do anything as the Coredns will resolve itself, just give it something for it to change it status, whether it is in a CrashLoopBackOff.

    Since it is not what you manage.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.