Azure Tesla V100 Driver Problem

DevAzure 6 Reputation points
2021-04-02T20:48:12.947+00:00

Hello

I wish you a happy day.

There is a driver problem in virtual machine instances on Azure. Tesla v100 graphics cards cannot be assigned by default. Detailed information is attached.

84146-image.png

84108-image.png

Azure Virtual Machines
Azure Virtual Machines
An Azure service that is used to provision Windows and Linux virtual machines.
7,129 questions
0 comments No comments
{count} votes

2 answers

Sort by: Most helpful
  1. Olga Os - MSFT 5,831 Reputation points Microsoft Employee
    2021-04-05T05:50:13.783+00:00

    @DevAzure Apologies for delay in response and all the inconvenience caused because of the issue.

    Please correct me if I am wrong. You are not able to install NVIDIA Tesla V100 driver on the N-Series Azure VM. Could you please share your VM SKU/OS? What steps are you following in your set up?

    I have looked into several documents, the driver will be installed after installing the NVIDIA GPU Driver Extension. Not sure if you have gone through these documents before:

    As example, NCv3-series:

    To take advantage of the GPU capabilities of Azure N-series VMs, NVIDIA GPU drivers must be installed.

    The NVIDIA GPU Driver Extension installs appropriate NVIDIA CUDA or GRID drivers on an N-series VM. Install or manage the extension using the Azure portal or tools such as Azure PowerShell or Azure Resource Manager templates. See the NVIDIA GPU Driver Extension documentation for supported operating systems and deployment steps. For general information about VM extensions, see Azure virtual machine extensions and features.

    If you choose to install NVIDIA GPU drivers manually, see N-series GPU driver setup for Windows or N-series GPU driver setup for Linux for supported operating systems, drivers, installation, and verification steps.

    NVIDIA GPU Driver Extension for Windows
    NVIDIA GPU Driver Extension for Linux

    Hope it helps!!!

    0 comments No comments

  2. DevAzure 6 Reputation points
    2021-04-05T08:30:12.907+00:00

    Hello @olgaoos thank you very much, how are you?

    The series I use are as follows ...

    NC24s v3 (4X Tesla V100 GPUS) OS: Windows 10

    I have tried installing the drivers two ways.

    1) I downloaded the drivers from the NVIDIA official site. The result is negative.

    2) There is a "Extensions" menu on the Azure portal. I installed the drivers through this menu. The result is negative.

    The devices are showing up fine. But when I want to set it as the default graphics device I get the above error. So you can understand I cannot use Tesla V100 devices at full performance.

    I will be grateful if you help me.

    Yours sincerely.