Note
Please see Azure Cognitive Services for Speech documentation for the latest supported speech solutions.
Microsoft Speech Programming Guide
This programming guide provides an introduction to developing speech applications using the Microsoft.Speech namespaces in the Microsoft Speech Platform SDK 11.
The concepts and examples presented in this documentation will show you how to incorporate speech recognition and speech synthesis (text-to-speech or TTS) into applications running on Windows Server or other platforms that can leverage Microsoft's redistributable speech engines. We recommend that you read the sections titled Speech Synthesis and Speech Recognition before beginning a speech application.
In the section titled Speech Recognition (Microsoft.Speech), you will find discussion and examples that will help you accomplish the following:
Create grammars
Initialize and manage a speech recognition engine
Interpret the results returned by a speech recognition engine
Work with speech recognition events
The section titled Speech Synthesis (Microsoft.Speech) includes concepts and examples that will help you accomplish the following:
Author content for TTS
Generate speech in your application
Select and modify a voice
Work with speech synthesis events
In addition, you will find reference sections to industry-standard XML markup languages including speech recognition, speech synthesis, semantics, and pronunciation lexicons, as supported by the Speech Platform SDK 11.
In This Section
Reference
This programming guide includes links to documentation for specific classes and members of the Microsoft.Speech namespaces. You can find detailed documentation of all the classes and members of the Microsoft.Speech namespaces at the links listed below.
-
Gives applications control over loading resources.
-
Contains the types that specify the audio format used in speech recognition.
-
Contains the types for implementing speech recognition in an application.
Microsoft.Speech.Recognition.SrgsGrammar
Contains managed code types that correspond to the elements and attributes of the W3C Speech Recognition Grammar Specification Version 1.0.
-
Contains the types for implementing speech synthesis in an application.
-
Represents the results of text normalization.