Develop a speech agent with the Azure Speech MCP server

Intermediate
AI Engineer
Developer
Microsoft Foundry
Azure AI Speech

Learn how to build an AI agent that uses the Azure Speech MCP server to perform speech-to-text and text-to-speech tasks.

Learning objectives

After completing this module, you'll be able to:

  • Describe the Azure Speech MCP server and the speech capabilities it exposes.
  • Explain how MCP enables dynamic tool discovery and selection by AI agents.
  • Set up Azure Blob Storage for audio file input and output.
  • Connect the Azure Speech MCP server to an agent in Microsoft Foundry.
  • Build a Python client application that invokes an agent to perform speech tasks.

Prerequisites

Before starting this module, you should:

  • Be familiar with Azure services and the Microsoft Foundry portal.
  • Have experience deploying generative AI models in Microsoft Foundry.
  • Have some familiarity with Python programming.