An Azure service that is used to provision Windows and Linux virtual machines.
Hello tek,
Thank you for providing the detailed information and screenshots. I appreciate your patience while we reviewed the issue.
Based on the behavior observed, this issue does not appear to be related to the AMD GPU hardware or the VM provisioning itself. The error is occurring before the actual GPU driver installation starts.
The message ErrorLoadingExtensionAndDefinition associated with Microsoft_Azure_CreateUIDef generally indicates that the Azure Portal is unable to properly load the extension configuration UI required for deploying the AMD GPU driver extension. In these scenarios, the issue is typically related to the Azure Portal experience, extension metadata loading, browser cache/session issues, or compatibility validation with newer operating system builds.
As an initial step, please try clearing the browser cache and cookies, using an InPrivate/Incognito session, or switching to Microsoft Edge before retrying the operation. In many similar cases, this resolved the issue by forcing the Azure Portal to reload the extension UI components correctly.
We also recommend stopping (deallocating) the VM and starting it again once. This refreshes the Azure VM Guest Agent, which is responsible for handling extension deployments inside the VM.
Additionally, please verify that the Microsoft.Compute resource provider is properly registered in the subscription. We also recommend confirming that the Azure VM Guest Agent service inside the VM is running and set to Automatic, since VM extensions depend on the Guest Agent service for installation and execution.
If there are any partially failed or stale VM extensions already associated with the VM, please remove the existing extension entry before retrying the deployment. In similar reported cases, removing the failed extension and redeploying it resolved the issue successfully.
You may use the following command to remove the existing extension:
Remove-AzVMExtension `
-ResourceGroupName "<ResourceGroupName>" `
-VMName "<VMName>" `
-Name "AmdGpuDriverWindows"
Since the issue appears to be limited to the Azure Portal UI experience, the recommended approach is to deploy the AMD GPU driver extension using Azure PowerShell or Azure CLI, which bypasses the Portal completely and invokes the extension deployment directly through the backend.
You may use the following PowerShell command after replacing the VM name and resource group details:
Set-AzVMExtension `
-ResourceGroupName "<ResourceGroupName>" `
-VMName "<VMName>" `
-Location "uksouth" `
-Publisher "Microsoft.HpcCompute" `
-ExtensionName "AmdGpuDriverWindows" `
-ExtensionType "AmdGpuDriverWindows" `
-TypeHandlerVersion "1.1"
For additional troubleshooting, you can also review the VM extension logs inside the VM at the following location:
C:\WindowsAzure\Logs\Plugins
If the extension deployment still does not succeed, you can proceed with manual driver installation by connecting to the VM and following the Microsoft documentation below:
Install AMD GPU drivers on N-series VMs running Windows
NVIDIA GPU Driver Extension for Windows