An Azure machine learning service for building and deploying models.
Hello Ferrero Alberto,
Thank you for sharing the details. I understand you’re deploying an LLM on an Azure Machine Learning managed online endpoint using the Standard_NC40ads_H100_v5 SKU, but you only see ~128 GB disk available, even though the VM-size documentation lists 3576 GiB local storage.
Why you see this difference
- The 3576 GiB shown for Standard_NC40ads_H100_v5 is the VM’s local temporary (host) storage when you deploy that SKU as an IaaS VM. This is documented under Local (temp) storage for the NCads H100 v5 series.
Link - (NCads_H100_v5 size series - Azure Virtual Machines) - In Azure ML managed online endpoints, the model is hosted inside a managed containerized environment, and the service does not expose the full VM local NVMe/temp disk to your container.
**Can the disk be increased or can an external volume be mounted?
** For managed online endpoints, there is no supported option today to increase that local disk size or attach/mount an additional Azure managed disk into the managed endpoint container environment.
Recommended options (supported paths)
- If your model artifacts/weights fit within the ~128 GB limit:
- Keep large artifacts in Azure Storage (e.g., Blob) and download required files at container startup, ensuring the total footprint on local disk remains under the limit. (General managed online endpoint concept:
Link - https://learn.microsoft.com/en-us/azure/machine-learning/concept-endpoints-online?view=azureml-api-2
- Keep large artifacts in Azure Storage (e.g., Blob) and download required files at container startup, ensuring the total footprint on local disk remains under the limit. (General managed online endpoint concept:
- If your model weights are larger than ~128 GB (or you need persistent >128 GB storage):
- A managed online endpoint is not the right hosting option for that storage requirement. Use one of the following instead:
- Kubernetes online endpoint (AKS) so you control the cluster and can use persistent storage patterns, or
- Deploy directly on an H100 VM / VMSS where you can attach/size disks as needed and leverage the SKU’s capabilities. (Endpoint type choices,
VM local storage spec for the SKU:
Link - https://learn.microsoft.com/en-us/azure/virtual-machines/sizes/gpu-accelerated/ncadsh100v5-series?tabs=sizebasic
Thanks,
Manish