@A-4824 Thanks for the question. Azure Machine Learning offers a way to run these Foundation Models. AOAI you pay per use (tokens), where with other foundation models you need to host them yourselves and thus pay based on the compute required.
Foundation Models in Azure Machine Learning - Azure Machine Learning | Microsoft Learn.
Blogs LLAMA and Flacon on Azure ML environment: https://blogs.microsoft.com/blog/2023/07/18/microsoft-and-meta-expand-their-ai-partnership-with-llama-2-on-azure-and-windows/
Kindly accept the answer if it is helpful.