An Azure service that is used to provision Windows and Linux virtual machines.
Hello Peter,
Thank you for sharing the details and the error information.
The issue you’re encountering on the NVads A10 v5 VM with Ubuntu 24.04 is typically related to a driver and kernel compatibility mismatch, which prevents the NVIDIA driver from initializing correctly during boot.
The GRID drivers redistributed by Azure don't work on most non-NV series VMs like NC, NCv2, NCv3, ND, and NDv2-series VMs but works on NCasT4v3 series.
For more information on the specific vGPU and driver branch versions, visit the NVIDIA websit
For NVads A10 v5 VMs, it is important to stay on a supported and up-to-date driver version. NVIDIA driver branches follow a compatibility model where each major version is generally backward compatible only with the immediately previous version (for example, vGPU 17.x supports 16.x, but not older versions). Using unsupported or mismatched driver versions can lead to driver initialization failures.
Please also note the following:
- Ensure Secure Boot is disabled, as it can block driver loading
- GRID drivers may have issues with newer kernels (such as kernel 6.11). In such cases, downgrading to a supported version like 6.8 can help
- Avoid mixing different driver types (CUDA, open drivers, GRID), and use only the Azure-supported GRID driver for this VM series
- GRID Driver 17.3 currently supports only NCasT4_v3 series of VMs. To use this driver, download and install GRID Driver 17.3 manually
Please follow the steps below to reinstall the driver cleanly:
- Remove any manually installed NVIDIA drivers
sudo apt purge "*nvidia*"
sudo apt autoremove
sudo reboot
2.Install Azure-supported NVIDIA GRID driver Download the validated driver:
wget https://download.microsoft.com/download/c5319e92-672e-4067-8d85-ab66a7a64db3/NVIDIA-Linux-x86_64-550.144.06-grid-azure.run
Make it executable and install:
chmod +x NVIDIA-Linux-x86_64-550.144.06-grid-azure.run
sudo ./NVIDIA-Linux-x86_64-550.144.06-grid-azure.run --silent
Check the CPU detection and reboot the VM.
The below document gives clear information and details on how to install the GRID Drivers!
https://learn.microsoft.com/en-us/azure/virtual-machines/linux/n-series-driver-setup
If the issue persists on Ubuntu 24.04, we recommend testing with Ubuntu 22.04 LTS, which is currently more stable and widely validated for this VM series.
Hope this helps! Please let me know if you have any queries