Share via

Extra disk space in Azure Machine Learning Online Endpoint

Ferrero Alberto 20 Reputation points
2026-02-10T09:21:48.73+00:00

I need to deploy an LLM via AML Managed Online Endpoint but the machine that i want to use (standard_nc40ads_h100_v5) apparently has only 128 GB of disk space
User's image

even if the documentation says that it should have 3576 GiB https://learn.microsoft.com/en-us/azure/virtual-machines/ncads-h100-v5

How can i increase the disk space, maybe via mounting some external volume?

Azure Machine Learning
0 comments No comments
{count} votes

Answer accepted by question author
  1. Manish Deshpande 4,225 Reputation points Microsoft External Staff Moderator
    2026-02-13T15:36:57.9266667+00:00

    Hello Ferrero Alberto,

    Thank you for sharing the details. I understand you’re deploying an LLM on an Azure Machine Learning managed online endpoint using the Standard_NC40ads_H100_v5 SKU, but you only see ~128 GB disk available, even though the VM-size documentation lists 3576 GiB local storage.

    Why you see this difference

    • The 3576 GiB shown for Standard_NC40ads_H100_v5 is the VM’s local temporary (host) storage when you deploy that SKU as an IaaS VM. This is documented under Local (temp) storage for the NCads H100 v5 series.
      Link - (NCads_H100_v5 size series - Azure Virtual Machines)
    • In Azure ML managed online endpoints, the model is hosted inside a managed containerized environment, and the service does not expose the full VM local NVMe/temp disk to your container.

    **Can the disk be increased or can an external volume be mounted?

    ** For managed online endpoints, there is no supported option today to increase that local disk size or attach/mount an additional Azure managed disk into the managed endpoint container environment.

    Recommended options (supported paths)

    1. If your model artifacts/weights fit within the ~128 GB limit:
    2. If your model weights are larger than ~128 GB (or you need persistent >128 GB storage):

    Thanks,
    Manish


1 additional answer

Sort by: Most helpful
  1. Alex Burlachenko 19,530 Reputation points Volunteer Moderator
    2026-02-10T11:05:51.12+00:00

    hi Ferrero Alberto,

    In a managed online endpoint in Azure Machine Learning you always get only 128 GB of disk. That limit is fixed. The 3.5 TB shown in the VM documentation is local NVMe on the physical host, but the managed service does not expose it to your container. You cannot increase the disk size and you cannot mount an extra volume.

    If your model is larger than 128 GB, store the weights in Azure Blob Storage and download them when the container starts. This is the standard and supported approach. If the model is very large and you do not want to pay the cold start cost, then a managed online endpoint is the wrong tool. In that case use a Kubernetes online endpoint with a persistent volume, or run the model directly on a regular H100 VM where you control the disks.

    lmk if its works for you,

    rgds,

    Alex


Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.