Is it possible (sensible?) to run Docker containers on Azure CycleCloud using Slurm?
I have been successfully running Azure CycleCloud & Slurm scheduler for running our HPC (CFD & CAE) Analysis Solving jobs from a /shared/apps loadpoint in a regular manner. I demo'd our HPC Solving capabilities to our Climate modelling team and…
Azure CycleCloud Web UI stops responding every few days
Hi, I am running a few Slurm Clusters on Azure CycleCloud 8.6, using the fully updated CentOS 7.9 (I know it drops off support soon) platform image from the Marketplace. Every few days, the CycleCloud Web UI stops responding and I seem to have to restart…
DeletingCloudOnlyObjectNotAllowed
Hello, I received this error "DeletingCloudOnlyObjectNotAllowed" multiple times a day. I'm not sure how to resolve it. I've been looking all over the places but still can't find the solution. Our on-premise Active Directory syncs with Azure…
How to set permissions on a cluster node's scratch disk filesystem so that users can write to it?
Hi, I have created a custom cluster that builds a nodearray with an attached local disk and mounts it to the node successfully on node startup and formats it (it is not persistent). How do I now set permissions on the /scratch filesystem - so that my…
Problem getting GPU solving to work with our Azure CycleCloud / Slurm HPC cluster System
I am using the Azure CycleCloud 8.4 Marketplace image and it is fully updated, along with Slurm version 22.05.8-1. I have configured a GPU Enabled Slurm Partition consisting of some NC24sv3 VMs (which have 4x Nvidia Tesla V100 GPUS in each), but the…
How to preconfigure new users using Active Directory at Cyclecloud
I configured the Azure Active Directory Domain Services as the Authentication method on Cyclecloud. When a new user logs into the server a new user is created without permissions. Can this process be configured somehow to add permissions to the new user…
CycleCloud/Cluster Configuration/Cluster Operations
600 cores (120 available) when I tried to add node I got this message: "Regional quota exceeded: Cannot add any more nodes."
Regional quota exceeded: Cannot add any more nodes.
Quota limit: 600 cores, 480 (4 nodes) already used and running, however to add an additional node (5th node) I got this message in CycleCloud GUI: Regional quota exceeded: Cannot add any more nodes. Also, I increased the quota in my subscription (Portal)…
Azure ML - Notebook - Jupyter Kernel Error - No Kernel connection
In ML Studio, when I create a notebook the top of my screen says "Jupyter kernel error" in red. I have a compute instance running (it's green), but it also says "No Kernel connected". To correct this matter, can you please…
Can CycleCloud create VM scale sets with multiple network interfaces?
The tutorials and public CycleCloud examples on github only include entries for [[[network-interface eth0]]]. Adding a configuration section to the template for [[[network-interface eth1]]] does not cause a second network interface to be added to the…
What if the attribute is blank? is that True or False when sync'd to the cloud?
Sorry wrong post. Please delete
When distributing jobs using HPC Pack 2019, is there a criterion for selecting nodes in HPC Pack?
Hi team. We were using 2 Head Nodes configured as HA and 5 Compute Nodes. Recently we add 5 more Compute Nodes. The curious thing here is that the task is first assigned to the added Compute node. For Example We use job template set to unit type is…
Azure CycleCloud - terrible HPC CFD performance and scaling vs on-prem benchmark?
Hi, I have setup a PoC Azure CycleCloud Slurm Cluster to evaluate cost & performance vs on-prem. Our nearest comparison cluster on-prem has an older CPU but slightly faster clockspeed and slower infiniband performance - so they should be in…
Detected corrupt RPM database, rebuilding
With a custom image I get a "Detected corrupted RPM database, rebuilding..." error when starting up a node. For the custom image I use OpenLogic HPC Centos 7.7 on which I ran yum update, that’s all. I tried to use the custom image prior…
Azure CycleCloud Slurm template VMs - some OS's don't startup sucessfully
Hi, If I use the CentOS 7 OS choice for HPC VMs then they start OK, but I have also tried AlmaLinux and SLES15 HPC - and neither of these work. The error with both is (and I have left it waiting for some time now). Otherwise, most of the other…
CycleCloud Slurm Cluster cannot seem to communicate with the nodes
This is my first stab at setting up Azure HPC using CycleCloud and Slurm, so forgive me for stupid mistakes... I have built a simple (default) Slurm cluster using CycleCloud and the nodes start/stop OK, but when I run a simple (hostname) job it just…
How to modify the CycleCloud default Slurm template to allow spot VMs for HPC Clusters?
The default Azure CycleCloud Slurm template only allows Azure Spot VMs to be configured for HTC partitions. I would like to know how to modify the default slurm template to allow the selection of Spot VMs for HPC Partitions? Thanks for any help…
Why is the subscription disabled?
I recently signed up. I wanted to check how Microsoft Azure works. My subscription was suddenly disabled. Even without warning. I think it was unacceptable
Am I stupid or is the CycleCloud QuickStart completely wrong?
Hi, CycleCloud looks fantastic, but I've been going out of my mind trying to follow the seemingly simple instructions for installing it on my Azure subscription. Either there seem to be some gross deficiencies in its documentation, or I am missing…