Summary

2 minutes

The Azure Speech MCP server connects AI agents to speech-to-text and text-to-speech capabilities through the Model Context Protocol. In this module, you learned how to use this server to build an agent that can transcribe audio and generate speech.

In this module, you learned how to:

Describe the Azure Speech MCP server and the speech capabilities it exposes.
Explain how MCP enables dynamic tool discovery and selection by AI agents.
Set up Azure Blob Storage for audio file input and output.
Connect the Azure Speech MCP server to an agent in Microsoft Foundry.
Build a Python client application that invokes an agent with speech tools using the Foundry SDK.

Learn more

Azure Speech in Foundry Tools for the Azure MCP Server
Connect to Model Context Protocol servers
Azure AI Projects SDK for Python
Azure Speech service overview

Feedback

Was this page helpful?