Hi Manakkal. Subash,
Thank you for replying back with further information.
As there are other critical workloads running on the cluster, removing CPU limits entirely could jeopardize other workloads during contention because Kubernetes allows unlimited CPU usage when limits are not set.
Instead of removing CPU limits, try adjusting them properly by setting a slightly higher limit. This ensures consistent CPU allocation without throttling. This also allows occasional bursts while still preventing excessive resource consumption.
Checking the throttling rate of your pods:
Just login to the pod and run cat /sys/fs/cgroup/cpu/cpu.stat
.
-
nr_periods
— Total schedule period -
nr_throttled
— Total throttled period out of nr_periods -
throttled_time
— Total throttled time in ns Ifnr_throttled
is high, it means your pod is hitting the CPU limit frequently. You can try considering a Larger Node (D16s_v3) → More room to distribute workloads. https://kubernetes.io/docs/concepts/configuration/manage-resources-containers/#troubleshooting
Please feel free to tag me in the comments for further assistance.