Azure Kubernetes Service cost analysis

An Azure Kubernetes Service (AKS) cluster is reliant on Azure resources like virtual machines, virtual disks, load-balancers, and public IP addresses. Multiple applications can use these resources, which might be maintained by different teams within your organization. Resource consumption patterns for those applications are often variable, so their contribution towards the total cluster resource cost can also vary. Some applications can also have footprints across multiple clusters, which can pose a challenge when performing cost attribution and cost management.

Previously, Microsoft Cost Management (MCM) aggregated cluster resource consumption under the cluster resource group. You could use MCM to analyze costs, but there were several challenges:

  • There was no Azure-native capability to display cluster resource usage at a level more granular than a cluster. There was no breakdown into discrete categories such as compute (including CPU cores and memory), storage, and networking.

  • There was no Azure-native functionality to distinguish between types of costs, for example between individual application costs and shared costs. MCM reported the cost of resources, but there was no insight into how much of the resource cost was used to run individual applications, how much was reserved for system processes required by the cluster, or what were the idle costs associated with the cluster.

  • There was no Azure-native mechanism to analyze costs across multiple clusters in the same subscription scope.

As a result, you might have used third-party solutions to gather and analyze resource consumption and costs by Kubernetes-specific levels of granularity, such as by namespace or pod. Third-party solutions, however, require effort to deploy, fine-tune, and maintain for each AKS cluster. In some cases, you even need to pay for advanced features, increasing the cluster's total cost of ownership.

To address this challenge, AKS has integrated with MCM to offer detailed cost drill-down scoped to Kubernetes constructs, such as cluster and namespace, in addition to Azure Compute, Network, and Storage categories.

The AKS cost analysis addon is built on top of OpenCost, an open-source Cloud Native Computing Foundation Sandbox project for usage data collection. The cost analysis is reconciled with your Azure invoice data. Post-processed data is visible directly in the MCM Cost Analysis portal experience.

Prerequisites and limitations

  • Your cluster must be either Standard or Premium tier, not the Free tier.

  • To view cost analysis information, you must have one of the following roles on the subscription hosting the cluster: Owner, Contributor, Reader, Cost management contributor, or Cost management reader.

  • Once you have enabled cost analysis, you can't downgrade your cluster to the Free tier without first disabling cost analysis.

  • Your cluster must be deployed with a Microsoft Entra Workload ID configured.

  • Kubernetes cost views are available only for the following Microsoft Azure Offer types. For more information on offer types, see Supported Microsoft Azure offers.

    • Enterprise Agreement
    • Microsoft Customer Agreement
  • Access to the Azure API including Azure Resource Manager (ARM) API. For a list of fully qualified domain names (FQDNs) required, see AKS Cost Analysis required FQDN.

  • Virtual nodes aren't supported at this time.

  • AKS Automatic is not supported at this time.

  • If using the Azure CLI, you must have version 2.61.0 or later installed.

Enable cost analysis on your AKS cluster

You can enable the cost analysis with the --enable-cost-analysis flag during one of the following operations:

  • Create a Standard or Premium tier AKS cluster.

  • Update an AKS cluster that is already in Standard or Premium tier.

  • Upgrade a Free cluster to Standard or Premium.

  • Upgrade a Standard cluster to Premium.

  • Downgrade a Premium cluster to Standard tier.

The following example creates a new AKS cluster in the Standard tier with cost analysis enabled:

az aks create --resource-group <resource-group> --name <cluster-name> --location <location> --enable-managed-identity --generate-ssh-keys --tier standard --enable-cost-analysis

The following example updates an existing AKS cluster in the Standard tier to enable cost analysis:

az aks update --resource-group <resource-group> --name <cluster-name> --enable-cost-analysis

Warning

The AKS cost analysis add-on Memory usage is dependent on the number of containers deployed. Memory consumption can be roughly approximated by 200 MB + 0.5 MB per container. The current memory limit is set to 4 GB which will support approximately 7000 containers per cluster. These estimates could be more or less depending on various factors and are subject to change.

If you are experiencing issues such as the add-on pod getting OOMKilled or stuck in a Pending state, refer to the AKS cost analysis add-on issues troubleshooting guide.

Disable cost analysis

You can disable cost analysis at any time using az aks update.

az aks update --name myAKSCluster --resource-group myResourceGroup --disable-cost-analysis

Note

If you intend to downgrade your cluster from the Standard or Premium tiers to the Free tier while cost analysis is enabled, you must first explicitly disable cost analysis.

View the cost data

You can view cost allocation data in the Azure portal. To learn more about how to navigate the cost analysis UI view, see the Cost Management documentation.

Cost definitions

In the Kubernetes namespaces and assets views you'll see the following charges:

  • Idle charges: Represents the cost of available resource capacity that wasn't used by any workloads.
  • Service charges: Represents the charges associated with the service like Uptime SLA, Microsoft Defender for Containers etc.
  • System charges: Represents the cost of capacity reserved by AKS on each node to run system processes required by the cluster, including the kubelet and container runtime. Learn more.
  • Unallocated charges: Represents the cost of resources that couldn't be allocated to namespaces.

Note

It might take up to one day for data to finalize. After 24 hours, any fluctuations in costs for the previous day will have stabilized.

Troubleshooting

See the following guide to troubleshoot AKS cost analysis add-on issues.

Learn more

Visibility is one element of cost management. Refer to Optimize Costs in Azure Kubernetes Service (AKS) for other best practices on how to gain control over your kubernetes cost.