The alert is triggered when Prometheus detects errors or unavailability in aggregated API endpoints, averaged over a specific period (e.g., 10 minutes).
In your case, the API v1beta1.metrics.k8s.io
is provided by the Kubernetes metrics-server, which is crucial for collecting resource metrics (CPU/memory) across the cluster. If the metrics-server
pod is restarting or under-resourced, it can temporarily become unavailable, triggering this alert. However, you've noted that this is not happening in your clusters.
If the alert resolves on its own and there are no visible impacts on workload performance or metrics collection, it is generally considered a transient or benign issue. This is especially true in managed environments like AKS, where the control plane components are not under direct user control apiserver-aggregation
If you found information is helpful, please click "Upvote" on the post to let us know.
If you have any further queries feel free to ask us we are happy to assist you.
Thank You.