Hello, @Egbertn !
How do I use GPU resources (preview) with Azure Container Instances (ACI)?
Using GPU resources with Azure Container Instances (ACI) is still in preview, so the recommendation is to follow the documentation as closely as possible while keeping in mind that some features may not be implemented and that performance may not yet be at the level of production SLAs.
Documentation:
https://learn.microsoft.com/en-us/azure/container-instances/container-instances-gpu
Following the documentation above, I was able to get both V100 and K80 deployed although it did take a couple attempts due to availability and some intermittent execution errors/timeouts.
az deployment group create --resource-group myResourceGroup --template-file gpudeploy.json
K80 modified ARM template:
{
"$schema": "https://schema.management.azure.com/schemas/2015-01-01/deploymentTemplate.json#",
"contentVersion": "1.0.0.0",
"parameters": {
"containerGroupName": {
"type": "string",
"defaultValue": "gpucontainergrouprm",
"metadata": {
"description": "Container Group name."
}
}
},
"variables": {
"containername": "gpucontainer",
"containerimage": "mcr.microsoft.com/azuredocs/samples-tf-mnist-demo:gpu"
},
"resources": [
{
"name": "[parameters('containerGroupName')]",
"type": "Microsoft.ContainerInstance/containerGroups",
"apiVersion": "2021-09-01",
"location": "[resourceGroup().location]",
"properties": {
"containers": [
{
"name": "[variables('containername')]",
"properties": {
"image": "[variables('containerimage')]",
"resources": {
"requests": {
"cpu": 4.0,
"memoryInGb": 12.0,
"gpu": {
"count": 1,
"sku": "K80"
}
}
}
}
}
],
"osType": "Linux",
"restartPolicy": "OnFailure"
}
}
]
}
Successful deployment of K80 in West Europe after navigating to the directory with the ARM template