Select Azure platform services (PaaS) for AI

2025-07-02

This article offers resource selection recommendations for Azure AI platform-as-a-service (PaaS) solutions. The following table provides an overview of the primary Azure AI PaaS solutions and important decision criteria.

AI services	AI type	Description	Skills required
Azure AI Foundry	Generative AI & non-generative AI	A platform for building and deploying generative and non-generative AI applications	Developer and data science skills
Azure AI services	Generative AI & non-generative AI	Various services that provide prebuilt generative and non-generative AI models	Developer skills
Azure OpenAI	Generative AI	A service for accessing OpenAI models	Developer and data science skills
Azure Machine Learning	Machine learning	A service for training and deploying machine learning models	Developer skills and advanced data science skills

Select resources for generative AI workloads

Generative AI combines various resources to process input data and generate meaningful outputs. Selecting the right resources ensures that applications, such as those using retrieval augmented generation (RAG), deliver accurate results by grounding AI models effectively.

In a typical RAG workload: (1) The workload receives a user query. (2) An orchestrator, such as Azure AI Foundry Agent Service, Semantic Kernel, or LangChain, manages the data flow. (3) A search and retrieval mechanism identifies the appropriate (4) grounding data to send to the generative AI endpoint. (5) A generative AI model endpoint generates a response based on the user query and grounding data. Use the following recommendations to build generative RAG workloads:

Choose a generative AI platform. Select Azure AI Foundry or Azure OpenAI to deploy and manage generative AI models. Azure AI Foundry provides a code-first platform with built-in tools for developing, deploying, and orchestrating applications. Use Azure OpenAI Service if you only need access to OpenAI models.
Choose the appropriate AI compute type. Azure AI Foundry requires compute instances for certain capabilities. Select a compute type that meets your performance and budget needs.
Pick an orchestrator. Use popular orchestrators like Azure AI Foundry Agent Service, Semantic Kernel, or LangChain to manage data flow and interactions.
Pick a search and knowledge retrieval mechanism. Ground generative AI models by creating an index or vector database for relevant data retrieval. Use Azure AI Search to build traditional and vector indexes from various data sources, apply data chunking, and use multiple query types. For structured databases, consider Azure Cosmos DB, Azure Database for PostgreSQL, or Azure Cache for Redis.
Choose a data source for grounding data. Store grounding data in Azure Blob Storage for images, audio, video, or large datasets. Alternatively, use databases supported by Azure AI Search or vector databases.
Pick a compute platform. Use the Azure compute decision tree to select the right platform for your workload.

Select resources for nongenerative AI workloads

Nongenerative AI workloads use platforms, compute resources, data sources, and data processing tools to support machine learning tasks. Selecting the right resources ensures you can build AI workloads using prebuilt or custom solutions effectively.

In a nongenerative AI workload: (1) The workload ingests data. (2) An optional data processing mechanism extracts or manipulates incoming data. (3) An AI model endpoint analyzes the data. (4) Data supports training or fine-tuning of AI models. Use the following recommendations to build nongenerative AI workloads:

Choose a nongenerative AI platform. Select Azure AI services or Azure Machine Learning based on your requirements. Azure AI services provide prebuilt AI models that simplify deployment and reduce the need for deep data science expertise. Use Azure Machine Learning if you need a platform to develop custom machine learning models using your data and integrate those models into your workloads.
Choose the appropriate AI compute type. Azure Machine Learning requires compute resources to run jobs or host endpoints. Select a compute type that meets your performance and budget needs. Azure AI services don't require compute resources.
Pick a data source. Use supported data sources to host training data for Azure Machine Learning. For Azure AI services, many services don't require fine-tuning data, and some, like Azure AI Custom Vision, allow you to upload local files to a managed data storage solution.
Pick a compute platform. Use the Azure compute decision tree to select the right platform for your workload.
Pick a data processing service (optional). Use Azure Functions for serverless data processing or Azure Event Grid to trigger data processing pipelines.

Next step

Networking

Share via

Select Azure platform services (PaaS) for AI

Select resources for generative AI workloads

Select resources for nongenerative AI workloads

Next step

Feedback

Additional resources