Hello Yongchao Liu (Neusoft America Inc)
Welcome to Microsoft Q&A Platform, thanks for posting your query here.
Based on your ask looks like your query is about the behavior of Horizontal Pod Autoscalers.
If so you can find the detailed explanation here: https://kubernetes.io/docs/tasks/run-application/horizontal-pod-autoscale/
Basically HPAs measure utilization against pod requests, in order to decide if scale-in or scale-out is needed. Please check the document for more details.
Hope this helps.