Hello, @rantash68 !
How do I create a VM with an NVIDIA GPU?
This is a frequent question as AI has hit unprecedented levels of popularity with ChatGPT, GPT powered search, and different image generation models. There are more people using GPU VMs than ever before and that has meant new challenges when keeping up with demands.
With this new surge in GPU use, the best practical answer is to keep checking less popular regions like South Central US as new capacity is constantly being added and understand that many others are doing the same thing:
Ensuring capacity for our customers is a top priority for Microsoft and we are working around the clock to deliver on this. The increasing demand for Azure services is evidence of the popularity of Azure, and emphasizes the need to scale up our infrastructure even more rapidly. With that in mind, we are expediting expansions and are improving our resource deployment process to respond to this strong customer demand. In fact, we are adding a significant amount of compute infrastructure monthly. We have identified several improvements on how we load-balance under a high resource usage situation, and how to trigger the timely deployment of needed resources. Furthermore, we are increasing our capacity significantly – and will continue to plan for strong customer demand across all of our regions. This September 2021 blog post covers improvements towards delivering a resilient cloud supply chain. Before implementing any restrictions on customers, we first pull every lever available to expedite new capacity, and to preserve existing capacity – this includes shaping internal customers to non-constrained regions whenever possible. If these steps are not sufficient to avoid a capacity shortfall, we then implement “offer restrictions” which prevent newly-created subscriptions from being able to deploy to the constrained region. Should you be experiencing an allocation failure, there are many ways to troubleshoot. We also work closely with our impacted customers to offer alternate solutions, should a capacity constraint affect their deployment.