ND GB200-v6 sizes series

2025-03-25

The ND-GB200-v6 series virtual machine (VM) is a flagship addition to the Azure GPU family, delivering unmatched performance for Deep Learning training, Generative AI, and HPC workloads. These VMs leverage the NVIDIA GB200 Tensor Core GPUs, built on the Blackwell architecture, which offer significant advancements in computational power, memory bandwidth, and scalability over previous generations. Each ND-GB200-v6 VM is powered by two NVIDIA Grace CPUs and four NVIDIA Blackwell GPUs. The GPUs are interconnected via fifth-generation NVLink, providing a total of 4× 1.8 TB/s NVLink bandwidth per VM. This robust scale-up interconnect enables seamless, high-speed communication between GPUs within the VM. In addition, the VM offers a scale-out backend network with 4× 400 GB/s NVIDIA Quantum-2 CX7 InfiniBand connections per VM, ensuring high-throughput and low-latency communication when interconnecting multiple VMs. NVIDIA GB200 NVL72 connects up to 72 GPUs per rack, enabling system to operate as a single computer. This 72 GPU rack scale system comprised of groups of 18 ND GB200 v6 VMs delivering up to 1.4 Exa-FLOPS of FP4 Tensor Core throughput, 13.5 TB of shared high bandwidth memory, 130TB/s of cross sectional NVLINK bandwidth, and 28.8Tb/s scale-out networking.

With 128 vCPUs per VM supporting the overall system, the architecture is optimized to efficiently distribute workloads and memory demands for AI and scientific applications. This design enables seamless multi-GPU scaling and robust handling of large-scale models. These instances deliver best-in-class performance for AI, ML, and analytics workloads with out-of-the-box support for frameworks like TensorFlow, PyTorch, JAX, RAPIDS, and more. The scale-out InfiniBand interconnect is optimized for existing AI and HPC tools built on NVIDIA’s NCCL communication libraries, ensuring efficient distributed computing across large clusters.

Host specifications

Part	Quantity ^{Count Units}	Specs ^{SKU ID, Performance Units, etc.}
Processor	128 vCPUs	Nvidia Grace CPU
Memory	900GB VM	LPDDR
Local Storage	4 Disks	16TB NVME Direct
Remote Storage	16 Disks	80000 IOPS/1200 MBps
Network	1 NICs	160Gb/s Ethernet
Accelerators	4 GPUs	Nvidia Blackwell GPU (192GB)

Feature support

Premium Storage: Supported
Premium Storage caching: Supported
Live Migration: Not Supported
Memory Preserving Updates: Not Supported
Generation 2 VMs: Supported
Generation 1 VMs: Not Supported
Accelerated Networking: Supported
Ephemeral OS Disk: Supported
Nested Virtualization: Not Supported

Sizes in series

vCPUs (Qty.) and Memory for each size

Size Name	vCPUs (Qty.)	Memory (GB)
Standard_ND128isr_NDR_GB200_v6	128	900

VM Basics resources

Check vCPU quotas

Local (temp) storage info for each size

Size Name	Max Temp Storage Disks (Qty.)	Temp Disk Size (TB)
Standard_ND128isr_NDR_GB200_v6	4	16

Storage resources

Table definitions

Temp disk speed often differs between RR (Random Read) and RW (Random Write) operations. RR operations are typically faster than RW operations. The RW speed is usually slower than the RR speed on series where only the RR speed value is listed.
Storage capacity is shown in units of GiB or 1024^3 bytes. When you compare disks measured in GB (1000^3 bytes) to disks measured in GiB (1024^3) remember that capacity numbers given in GiB may appear smaller. For example, 1023 GiB = 1098.4 GB.
Disk throughput is measured in input/output operations per second (IOPS) and MBps where MBps = 10^6 bytes/sec.
To learn how to get the best storage performance for your VMs, see Virtual machine and disk performance.

Remote (uncached) storage info for each size

Size Name	Max Remote Storage Disks (Qty.)	Uncached Disk IOPS	Uncached Disk Speed (MBps)
Standard_ND128isr_NDR_GB200_v6	316	80000	1200

Storage resources

Table definitions

Some sizes support bursting to temporarily increase disk performance. Burst speeds can be maintained for up to 30 minutes at a time.
Special Storage refers to either Ultra Disk or Premium SSD v2 storage.
Storage capacity is shown in units of GiB or 1024^3 bytes. When you compare disks measured in GB (1000^3 bytes) to disks measured in GiB (1024^3) remember that capacity numbers given in GiB may appear smaller. For example, 1023 GiB = 1098.4 GB.
Disk throughput is measured in input/output operations per second (IOPS) and MBps where MBps = 10^6 bytes/sec.
Data disks can operate in cached or uncached modes. For cached data disk operation, the host cache mode is set to ReadOnly or ReadWrite. For uncached data disk operation, the host cache mode is set to None.
To learn how to get the best storage performance for your VMs, see Virtual machine and disk performance.

Network interface info for each size

Size Name	Max Front-end NICs (Qty.)	Max Front-end Bandwidth (Mbps)	Max Scale-up NICS (Qty.)	Max Scale-Up Bandwidth (Gbps)¹	Max Scale-out NICS (Qty.)	Max Scale-Out Bandwidth (TBps)²
Standard_ND128isr_NDR_GB200_v6	1	160	4	400	4	1.8

Networking resources

Table definitions

¹ Bandwidth is per NIC to up to 100,000 of GPUs through InfiniBand
² Bandwidth is up to 72 GPUs per domain through NVLINK
Expected network bandwidth is the maximum aggregated bandwidth allocated per VM type across all NICs, for all destinations. For more information, see Virtual machine network bandwidth
Upper limits aren't guaranteed. Limits offer guidance for selecting the right VM type for the intended application. Actual network performance will depend on several factors including network congestion, application loads, and network settings. For information on optimizing network throughput, see Optimize network throughput for Azure virtual machines.
To achieve the expected network performance on Linux or Windows, you may need to select a specific version or optimize your VM. For more information, see Bandwidth/Throughput testing (NTTTCP).

Size Name	Accelerators (Qty.)	Accelerator-Memory (GB)
Standard_ND128isr_NDR_GB200_v6	4	192

Other size information

List of all available sizes: Sizes

Pricing Calculator: Pricing Calculator

Information on Disk Types: Disk Types

Next steps

Take advantage of the latest performance and features available for your workloads by changing the size of a virtual machine.

Utilize Microsoft's in-house designed ARM processors with Azure Cobalt VMs.

Learn how to Monitor Azure virtual machines.

Share via

ND GB200-v6 sizes series

Host specifications

Feature support

Sizes in series

VM Basics resources

Other size information

Next steps

Feedback

Additional resources