Volume Snapshots Fail on AKS with 502 and/or timeout or HTML error page
Volume snapshots are failing repeatedly, sometimes with 502 or HTML-based error pages from Azure, here's an example: Warning Error 6m24s cloudnative-pg-backup snapshot backup failed: Failed to create snapshot: failed to take snapshot of…
Unable to deploy a AKS cluster in Azure
I'm trying to create a AKS cluster and the "Cluster configuration" is "Production Economy" but the deployment fails with this error message.. I also tried the default "Standard_D8ds_v4" for the nodes and tried changing the…
AKS performing automatic upgrades when turned off
Hello, We want to do manual upgrades for our AKS environment and have disabled the automatic upgrade scheduler. We had manually upgraded to 1.30.6 and it seems Azure has force upgraded to 1.30.9. Why is Azure performing the automatic upgrade if the…
AKS auto upgrading on its own when it should not be
Hello, We disabled the auto upgrade schedule for Kubernetes, however the system seems to have performed an upgrade regardless of the setting. How come the upgrade still happened? We manually upgraded from 1.29 to 1.30.6 and now we are seeing 1.30.9. The…
AKS/Docker: I am processing PDF's as large as 5,000 to 10,000 pages for OCR. I'm trying to run the pages concurrently in groups of 50 but running into a situtation where the memory goes to 10GB for a 200MB file.
I am processing PDF's as large as 5,000 to 10,000 pages for OCR with 200 MB files. I'm trying to run the pages concurrently in groups of 50 but running into a situtation where the memory goes to 10GB for a 200MB file. I've tried cleaning up the…
On AKS Backend Pool is empty
I have deployed application gateway as my Ingress, backend pool doesn't have any targets, its coming empty. My pods are getting IP's and accessible the application within the cluster. Backend pool is not getting updated.
Azure Kubernetes Service: Cluster Stuck in Stopping - Abort Operation Stuck
My AKS cluster was affected by a billing suspension. After payment my subscription was re-enabled and I attempted to stop and restart the cluster, but it is now stuck in 'Stopping' state. Stopping Operation Start Time: 2025-02-14T16:02:47Z Abort…
How can I use istio mtls stric mode with aks application routing add-on
Hi, I'm using "the application routing add-on" in my aks cluster and I installed istio add-on. I added sidecar injection to istio member pods. I want to use mtls mode in strict. Could you please advice me the best way of using the mtls in…
The VM size of standard_d4s_v3 is only allowed in zones [2] in your subscription in location 'westus2'.
Hello guys, I tried to create an AKS in westus2 location with all zones([1, 2, 3]), but it reported below error: Kubernetes Cluster Name: "azure-westus2-test-24dtt-snc"): performing CreateOrUpdate: unexpected status 400 (400 Bad Request) with…
Scheduled maintenance for Azure VMs
Dear support, we received a notification regarding scheduled maintenance for Azure Virtual Machines in set of regions. Currently our workloads live in the mentioned regions. We have several AKS clusters based on node pools built on Virtual Machine Scale…
What is the current market share comparison between Kubernetes and Azure Container Apps?
I am conducting research for my thesis on container orchestration platforms, specifically comparing Kubernetes with Azure Container Apps in terms of market share, adoption rates, and industry trends. Kubernetes has long been the dominant platform for…
AKS cluster cannot create succeed in own vnet/subnet
ProvisioningState/failed/VMExtensionProvisioningError
Recieved this message "This cluster is in a failed state. If you didn't do an operation, AKS may resolve the provisioning status automatically ..." but monitored for a few hours and pods are good and otherwise solution is good.
CONTEXT: -While using our current AKS solution we discover that some of the pods where down and the age of the nodes correlated with an update initiated by Azure Container Services. Or at least that was the only log that correlated with the age on the…
Cannot reach API-server of private AKS cluster
Hi! I am experiencing timeouts on kubectl commands towards my private AKS cluster. The Azure portal states that the cluster is healthy and that the nodepool is running with a single target node. The are however 0 ready nodes but the "Diagnose and…
AKS is not able to pull images from ACR
I am unable to get our AKS to be able to pull images from our ACR. I get the following error on the pod Failed to pull image "**REMOVE**.azurecr.io/**REMOVED**:dev": [rpc error: code = NotFound desc = failed to pull and unpack image…
[AKS] A pod running on a Windows container in AKS is frequently restarting without a clear cause.
Issue Description: A pod running on a Windows container in AKS is frequently restarting without a clear cause. Error Details: The last recorded state shows that the pod terminated with exit code -1073741819 (0xC0000005) The pod started at Tue, Feb 4,…
permissions for New-AzAKSCluster
Hello, I have 2 Azure environments (different Tennant / Subscription). In the first Tennant the script is running New-AzKeyVault ... New-AzAKSCluster ... without any problems... In the 2nd environment, New-AzKeyvault ... runs perfect and creates the…
Volume Snapshots Fail on AKS with 502 and/or timeout or HTML error page
Volume snapshots are failing repeatedly, sometimes with 502 or HTML-based error pages from Azure, here's an example: Warning Error 6m24s cloudnative-pg-backup snapshot backup failed: Failed to create snapshot: failed to take snapshot of…
Unable to delete AKS cluster
Dear Support, The deletion of the AKS cluster aks_test remains in a pending state and has no effect. Additionally, attempting to remove the resource group RisorseDiTest also fails. We have been stuck since this morning. Could you please investigate and…
How to integrate private registry with AKS
I need attache a private container registry (Nexus) outside of Azure to my AKS. The only documentation I was able to find was…