HPC Pack 2019 AutoGrowShrink Parameters

Sven 21 Reputation points
2022-06-30T09:03:45.95+00:00

Hi,
our environment consists of about 30 on-prem compute nodes and about 30 ready set up (unapproved) Azure IaaS Nodes.
Those Azure nodes can easily be started by right-clicking on the resource -> Start and they start their work immediately.
All Azure nodes have 5 subscribed sockets and 10 cores. All jobs are submitted on socket basis.
But to automize it a bit, I'm struggling with the AutoGrowShrink parameters: shrinking is working, but not growing.
My current parameterization is:
PS C:\Users... [06/30/2022 08:38:17]> Get-HpcClusterProperty -scheduler:lsas... -autogrowshrink

Name Value
---- -----
EnableGrowShrink True
TasksPerResourceUnit 5
GrowThreshold 50
GrowInterval 2
GrowOnlyFullySatisfied False
IdleNodePool 0
GrowTimeout 10
ShrinkInterval 3
ShrinkIdleTimes 3
ExtraNodesGrowRatio 0
GrowByMin False
SoaJobGrowThreshold 50000
SoaRequestsPerCore 20000
SoaJobTimeToComplete 0
ExcludeNodeGroups GPUNodes

I understood to be able to auto grow one node at a time, if 50 tasks at minimum are in queued state along 20 minutes.
as said, somehow either misunderstanding or misconfiguration or bug :)...

Azure Virtual Machines
Azure Virtual Machines
An Azure service that is used to provision Windows and Linux virtual machines.
7,585 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Prrudram-MSFT 23,211 Reputation points
    2022-06-30T19:02:23.727+00:00

    Hello @Sven ,

    Thank you for reaching out to the Microsoft Q&A platform. Happy to answer your question.

    HPC auto grow shrink cannot auto add compute nodes to cluster, you need add the nodes to your HPC IaaS cluster first, then if you enable auto grow shrink, it can stop the nodes when no job, and start the nodes when new job is coming.

    An explicit note added in the document about the workflow of autogrowshrink in HPC here hpcpack-auto-grow-shrink
    216661-image.png

    Hope this helps!
    --please don't forget to upvote and accept as answer if the reply is helpful--