Is it possible to use CUDA v12 on NVadsA10 v5-series instances in AKS?

Question

Is it possible to use CUDA v12 on NVadsA10 v5-series instances in AKS?

Patrick Deubel 50

Hi, I have a "Standard_NV12ads_A10_v5" node pool in my AKS cluster. My application that I deploy crashes because it requires CUDA v12, which in turn is caused by the fact that the image that is used on the node uses CUDA v11.6. I have also used other instances such as the single 80GB A100 which to my suprise uses CUDA v12. Why does the A10 instance use CUDA 11.6, is that because it is a slice of a full GPU? Do I have to use a full GPU or can the CUDA version be updated? I also want to autoscale this node pool so that would have to be automated, is that a road block?

Thanks for any answers!

Accepted answer

0 additional answers

Your answer

Answer 1

Prrudram-MSFT 28,286 Microsoft Employee Moderator

Hello @Patrick Deubel

Thank you for reaching out to the Microsoft Q&A platform.

The reason why the "Standard_NV12ads_A10_v5" node pool in your AKS cluster is using CUDA v11.6 is because it is based on the NVIDIA Tesla V100 GPU, which supports CUDA 11.6. The A10 GPU is a slice of the V100 GPU and has a smaller number of CUDA cores, which is why it is using the same version of CUDA.

To use CUDA v12, you will need to use a GPU that supports it, such as the NVIDIA A100 GPU. You can create a new node pool in your AKS cluster that uses the A100 GPU and deploy your application to that node pool. You can also use a custom image that has CUDA v12 installed on it.

To automate the autoscaling of the node pool, you can use the Kubernetes Horizontal Pod Autoscaler (HPA) to automatically scale the number of pods based on CPU or memory utilization. You can also use the Kubernetes Cluster Autoscaler (CA) to automatically scale the number of nodes in the node pool based on the demand for resources. The HPA and CA can work together to ensure that your application has the resources it needs to run efficiently.

Please click "Accept as answer" and do a Thumbs-Up if this helps

Patrick Deubel 50 Reputation points

2023-11-10T07:49:46.08+00:00

Hi @Prrudram-MSFT

thanks for your fast answer! I find that highly confusing. Why is the instance type based on a V100 when the documentation for the instance says

The NVadsA10v5-series virtual machines are powered by NVIDIA A10 GPUs [...]

and do not mention the V100 at all. From reading through the documentation I clearly get the intention that you get a third of an actual A10 GPU, thus also having its features. Maybe you could raise an issue internally about updating the documentation, I think that would help a lot.
Prrudram-MSFT 28,286 Reputation points Microsoft Employee Moderator

2023-11-15T03:59:44.0633333+00:00

Patrick Deubel I will review this internally, thanks for pointing that out.

Share via

Is it possible to use CUDA v12 on NVadsA10 v5-series instances in AKS?

0 additional answers

Your answer