@TanulBhasin-0866 Apologies for delay in response and all the inconvenience caused because of the issue.
Graphical processing units (GPUs) are often used for compute-intensive workloads such as graphics and visualization workloads. AKS supports the creation of GPU-enabled node pools to run these compute-intensive workloads in Kubernetes
Currently AKS does not allow pods to share GPUs, you can have only as many replicas of a GPU-enabled web service as there are GPUs in the cluster.
You can find the same information here as well.
You can refer to this article as well which might be helpful to understand the use of GPU in AKS.
Hope it helps!!!
Do let me know in case of any more queries.
Please 'Accept as answer' if it helped, so that it can help others in the community looking for help on similar topics