Azure Kubernetes Service - Inconsistent Private DNS Zone Resolution

devopsfj 176 Reputation points
2024-01-29T13:43:07.78+00:00

I am seeing very intermittent issues with AKS and Private DNS Zones/Private Endpoints. I have AKS Clusters which connects to multiple PaaS services (Service Bus & Azure Cosmos MongoDB) which have Private Endpoints, I keep seeing at complete random times, days apart from each other, DNS resolution fails causing our system to error and even crash, as I say, 95% of the time, everything operates as expected. For example, I have two AKS clusters (Prod & UAT), which both site in completely separate Virtual Networks with no peering's or relation to each other, both of these Virtual Network use the default DNS settings, however both have a Virtual Network link to Private DNS zone privatelink.servicebus.windows.net which sits in our Hub. At exactly the same time, both clusters lost connection to their respective Service Bus (Prod & UAT). This to me suggests there is not an issue with the PaaS resources but something is breaking DNS resolution temporarily. We have the exact same issue with a Database the other day which cause a P1 as the system did not recover itself. This is causing major issues for us and I am losing trust with private endpoints and considering switching back to public access with only allowing certain IP addresses, less safe but more reliable. Has anyone else experience this and what can I do to help, I have had no fix from Microsoft Support yet!

Azure DNS
Azure DNS
An Azure service that enables hosting Domain Name System (DNS) domains in Azure.
603 questions
Azure Kubernetes Service (AKS)
Azure Kubernetes Service (AKS)
An Azure service that provides serverless Kubernetes, an integrated continuous integration and continuous delivery experience, and enterprise-grade security and governance.
1,887 questions
{count} votes