Nvidia Graphics card disconnects from Ubuntu OS after restart

Simranjeet Singh 6 Reputation points
2021-10-11T11:36:30.843+00:00

Hi,

I deployed a VM image from Marketplace with Pytorch and Cuda preinstalled on an Ubuntu OS. At first setup, everything works fine and I'm able to detect NVIDIA GPU from within torch package and my code runs fine.
However, when I restart my VM, the connection between OS and GPU seems to be broken and my VM can no longer detect the graphics card. Only solution so far is to redeploy my VM and start from scratch.
Has anyone faced similar issue or if anyone can help me solving this weird problem?

Thanks

Azure Virtual Machines
Azure Virtual Machines
An Azure service that is used to provision Windows and Linux virtual machines.
7,196 questions
{count} votes

1 answer

Sort by: Most helpful
  1. kobulloc-MSFT 23,646 Reputation points Microsoft Employee
    2021-10-13T01:10:33.637+00:00

    Hello, @Simranjeet Singh !

    I haven't been able to run into the issues you are describing. This is the setup I used:

    • Image: NVIDIA GPU-Optimized PyTorch Image - v21.06.0 - Gen2 (Azure Marketplace link)
    • VM SKU: Standard_NV12s.v3

    After restarting the VM, I'm still able to see the GPU:

    140021-image.png

    I would try using the same image and see if you still encounter this issue.

    0 comments No comments