About the pod scale issue 【autoscale_target_utilization】

Question

About the pod scale issue 【autoscale_target_utilization】

Yongchao Liu (Neusoft America Inc) 191 Microsoft External Staff

We deployed pod that automatically scale

autoscale_target_utilization: 55

https://github.com/Azure/aml-deploy/blob/master/README.md#aks-deployment
autoscale_target_utilizationint: [1, 100]70The target utilization (in percent out of 100) the autoscaler should attempt to maintain for this Webservice.

May I ask how this parameter is calculated?
Originally, it was thought that when the node cpu reached 55%, it would trigger automatic scale,

but later it was found that the node cpu would also scale if it did not reach 55%

Our pod only has request without limit

Is it based on request? Is it based on request 55%?
User's image

In this example, the cpu Max 209 and then there's a pod that scales
thanks

vipullag-MSFT 26,492 Reputation points Moderator

2023-04-10T06:47:36.51+00:00

Hello Yongchao Liu (Neusoft America Inc) Just checking in to see if you got a chance to see previous response. If the suggested response helped you resolve your issue, please 'Accept as answer', so that it can help others in the community looking for help on similar topics.

Accepted answer

0 additional answers

Your answer

vipullag-MSFT 26,492 Reputation points Moderator

2023-04-10T06:47:36.51+00:00

Hello Yongchao Liu (Neusoft America Inc) Just checking in to see if you got a chance to see previous response. If the suggested response helped you resolve your issue, please 'Accept as answer', so that it can help others in the community looking for help on similar topics.

Answer 1

vipullag-MSFT 26,492 Moderator

Hello Yongchao Liu (Neusoft America Inc)

Welcome to Microsoft Q&A Platform, thanks for posting your query here.

Based on your ask looks like your query is about the behavior of Horizontal Pod Autoscalers.

If so you can find the detailed explanation here: https://kubernetes.io/docs/tasks/run-application/horizontal-pod-autoscale/

Basically HPAs measure utilization against pod requests, in order to decide if scale-in or scale-out is needed. Please check the document for more details.

Hope this helps.

Share via

About the pod scale issue 【autoscale_target_utilization】

0 additional answers

Your answer