Exercise - Deploy an AI inference API to Azure Kubernetes Service

Completed

In this exercise, you deploy Azure resources including a Microsoft Foundry AI model, Azure Container Registry (ACR), and Azure Kubernetes Service (AKS) cluster. You then complete Kubernetes manifest files to define container specifications, health probes, resource limits, and load balancing. After deploying the containerized API to AKS, you use a Python client application to test the deployed API endpoints including health checks, readiness validation, and AI model inference requests.

Tasks performed in this exercise:

  • Download the project starter files
  • Deploy resources to Azure
  • Complete the deployment.yaml and service.yaml files and deploy the container to AKS
  • Run the client app to test the API

This exercise takes approximately 30-40 minutes to complete.

Before you start

To complete the exercise, you need:

Get started

Select the Launch Exercise button to open the exercise instructions in a new browser window. When you're finished with the exercise, return here to:

  • Complete the module
  • Earn a badge for completing this module

Button to launch exercise.