NCads H100 v5-series

Applies to: ✔️ Linux VMs ✔️ Windows VMs ✔️ Flexible scale sets ✔️ Uniform scale sets

The NCads H100 v5 series virtual machines (VMs) are a new addition to the Azure GPU family. You can use this series for real-world Azure Applied AI training and batch inference workloads. The NCads H100 v5 series virtual machines are powered by NVIDIA H100 NVL GPU and 4th-generation AMD EPYC™ Genoa processors. The VMs feature up to 2 NVIDIA H100 NVL GPUs with 94GB memory each, up to 96 non-multithreaded AMD EPYC Genoa processor cores and 640 GiB of system memory. These VMs are ideal for real-world Applied AI workloads, such as:

  • GPU-accelerated analytics and databases
  • Batch inferencing with heavy pre- and post-processing
  • Autonomy model training
  • Oil and gas reservoir simulation
  • Machine learning (ML) development
  • Video processing
  • AI/ML web services

Supported features

To get started with NCads H100 v5 VMs, refer to HPC Workload Configuration and Optimization for steps including driver and network configuration.

Due to increased GPU memory I/O footprint, the NCads H100 v5 requires the use of Generation 2 VMs and marketplace images. Please follow instruction Azure HPC images for configuration.

Size vCPU Memory (GiB) Temp Disk NVMe (GiB) GPU GPU Memory (GiB) Max data disks Max uncached disk throughput (IOPS / MBps) Max NICs/network bandwidth (MBps)
Standard_NC40ads_H100_v5 40 320 3576 1 94 8 100000/3000 2/40,000
Standard_NC80adis_H100_v5 80 640 7152 2 188 16 240000/7000 4/80,000

1 1 GPU = one H100 card
2 Local NVMe disks are ephemeral. Data is lost on these disks if you stop/deallocate your VM. Local NVMe disks aren't encrypted by Azure Storage encryption, even if you enable encryption at host.

Size table definitions

  • Storage capacity is shown in units of GiB or 1024^3 bytes. When you compare disks measured in GB (1000^3 bytes) to disks measured in GiB (1024^3) remember that capacity numbers given in GiB may appear smaller. For example, 1023 GiB = 1098.4 GB.

  • Disk throughput is measured in input/output operations per second (IOPS) and MBps where MBps = 10^6 bytes/sec.

  • Data disks can operate in cached or uncached modes. For cached data disk operation, the host cache mode is set to ReadOnly or ReadWrite. For uncached data disk operation, the host cache mode is set to None.

  • To learn how to get the best storage performance for your VMs, see Virtual machine and disk performance.

  • Expected network bandwidth is the maximum aggregated bandwidth allocated per VM type across all NICs, for all destinations. For more information, see Virtual machine network bandwidth.

    Upper limits aren't guaranteed. Limits offer guidance for selecting the right VM type for the intended application. Actual network performance will depend on several factors including network congestion, application loads, and network settings. For information on optimizing network throughput, see Optimize network throughput for Azure virtual machines. To achieve the expected network performance on Linux or Windows, you may need to select a specific version or optimize your VM. For more information, see Bandwidth/Throughput testing (NTTTCP).

Other sizes and information

You can use the pricing calculator to estimate your Azure VMs costs.

For more information on disk types, see What disk types are available in Azure?

Next step