Introduction

1 minute

Speech is one of the most natural ways humans communicate, and bringing speech capabilities to AI applications creates more intuitive, accessible, and engaging user experiences. Whether you're building a voice assistant, creating accessible applications, or developing conversational AI agents, understanding speech technologies is essential for modern AI solutions.

In this module, you'll explore the two fundamental speech capabilities that power voice-enabled applications: speech recognition (converting spoken words to text) and speech synthesis (converting text to natural-sounding speech). You'll discover how these technologies work together to create seamless voice interactions and learn about the real-world scenarios where speech can transform user experiences.

Note

We recognize that different people like to learn in different ways. You can choose to complete this module in video-based format or you can read the content as text and images. The text contains greater detail than the videos, so in some cases you might want to refer to it as supplemental material to the video presentation.

Feedback

Was this page helpful?