Language learning with Azure AI Speech

One of the most important aspects of learning a new language is speaking and listening. Azure AI Speech provides features that can be used to help language learners.

Pronunciation Assessment

The Pronunciation Assessment feature is designed to provide instant and comprehensive feedback to users on the accuracy, fluency, prosody, vocabulary usage, grammar correctness, and topic understanding of their speech when learning a new language, so that they can speak and present in a new language with confidence. For information about availability of pronunciation assessment, see supported languages and available regions.

The Pronunciation Assessment feature offers several benefits for educators, service providers, and students.

  • For educators, it provides instant feedback, eliminates the need for time-consuming oral language assessments, and offers consistent and comprehensive assessments.
  • For service providers, it offers high real-time capabilities, worldwide Azure AI Speech and supports growing global business.
  • For students and learners, it provides a convenient way to practice and receive feedback, authoritative scoring to compare with native pronunciation and helps to follow the exact text order for long sentences or full documents.

Speech to text

Azure Speech to text supports real-time language identification for multilingual language learning scenarios, help human-human interaction with better understanding and readable context.

Text to speech

Text to speech prebuilt neural voices can read out learning materials natively and empower self-served learning. A broad portfolio of languages and voices are supported for AI teacher, content read aloud capabilities, and more. Microsoft is continuously working on bringing new languages to the world.

Custom Neural Voice is available for you to create a customized synthetic voice for your applications. Education companies are using this technology to personalize language learning, by creating unique characters with distinct voices that match the culture and background of their target audience.

Next steps