Monitor your generative AI application

Module
9 Units

Intermediate

AI Engineer

Microsoft Foundry

Learn how to monitor the performance of your generative AI application using Microsoft Foundry. This module teaches you to track key metrics like latency and token usage to make informed, cost-effective deployment decisions.

Learning objectives

By the end of this module, you'll be able to:

Understand why monitoring is essential when moving Gen AI apps toward production readiness.
Identify and interpret key performance metrics: latency, throughput, token usage, and error rates.
Use Azure Monitor together with Microsoft Foundry to observe and analyze app behavior.
Apply insights to optimize performance, cost, and user experience in Gen AI solutions.

Prerequisites

Before starting this module, you should be familiar with:

Basic software development concepts
Basic AI concepts
Basic Azure concepts

Introduction min
Why do you need to monitor? min
Understand key metrics to monitor min
Explore how to monitor with Azure min
Integrate monitoring into your app min
Interpret monitoring results min
Exercise - Enable monitoring for a generative AI application min
Knowledge check min
Summary min

Start