Formerly known as Azure AI Services or Azure Cognitive Services is a unified collection of prebuilt AI capabilities within the Microsoft Foundry platform
Hi Kai Neuffer
Each model asks specific VM type for deploying to endpoint.
It is asking NC24ADSA100_V4 compute for deployment in this case.
You might need to raise quota if you want to serve the model as endpoint.
Recommendation
For POC Proposes, we should be able to call it via foundry client and run predictions.
There are other varieties of Aurora mentioned here to register via Mlflow and deploy.
Attached relevant document for reference
https://github.com/microsoft/aurora/blob/main/package_mlflow.py
https://mlflow.org/docs/latest/ml/deep-learning/transformers/large-models/
Thank you.