Develop a speech agent with the Azure Speech MCP server

Module
6 Units

Intermediate

AI Engineer

Developer

Microsoft Foundry

Azure AI Speech

Learn how to build an AI agent that uses the Azure Speech MCP server to perform speech-to-text and text-to-speech tasks.

Learning objectives

After completing this module, you'll be able to:

Describe the Azure Speech MCP server and the speech capabilities it exposes.
Explain how MCP enables dynamic tool discovery and selection by AI agents.
Set up Azure Blob Storage for audio file input and output.
Connect the Azure Speech MCP server to an agent in Microsoft Foundry.
Build a Python client application that invokes an agent to perform speech tasks.

Prerequisites

Before starting this module, you should:

Be familiar with Azure services and the Microsoft Foundry portal.
Have experience deploying generative AI models in Microsoft Foundry.
Have some familiarity with Python programming.

Introduction min
Understand the Azure Speech MCP server min
Connect and use the Speech MCP server with an agent min
Exercise - Use Azure Speech in an agent min
Knowledge check min
Summary min