Hi,
The VM were created with NvidiaGpuDriverLinux. and it has the NVidia Driver and Cuda toolkit running. we have tried setting CUDA_LAUNCH_BLOCKING=1 and rebooting the machine with no success.
nvidia-smi Wed Mar 12 10:18:20 2025
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 550.144.03 Driver Version: 550.144.03 CUDA Version: 12.4 |
|-----------------------------------------+------------------------+----------------------+
How to check if it's a hardware problem? Should we recreate the VM?
Regards,
Fadi