Summary
The Azure Speech MCP server connects AI agents to speech-to-text and text-to-speech capabilities through the Model Context Protocol. In this module, you learned how to use this server to build an agent that can transcribe audio and generate speech.
In this module, you learned how to:
- Describe the Azure Speech MCP server and the speech capabilities it exposes.
- Explain how MCP enables dynamic tool discovery and selection by AI agents.
- Set up Azure Blob Storage for audio file input and output.
- Connect the Azure Speech MCP server to an agent in Microsoft Foundry.
- Build a Python client application that invokes an agent with speech tools using the Foundry SDK.