Monitor your generative AI application

Intermediate
AI Engineer
Microsoft Foundry

Learn how to monitor the performance of your generative AI application using Microsoft Foundry. This module teaches you to track key metrics like latency and token usage to make informed, cost-effective deployment decisions.

Learning objectives

By the end of this module, you'll be able to:

  • Understand why monitoring is essential when moving Gen AI apps toward production readiness.
  • Identify and interpret key performance metrics: latency, throughput, token usage, and error rates.
  • Use Azure Monitor together with Microsoft Foundry to observe and analyze app behavior.
  • Apply insights to optimize performance, cost, and user experience in Gen AI solutions.

Prerequisites

Before starting this module, you should be familiar with:

  • Basic software development concepts
  • Basic AI concepts
  • Basic Azure concepts