Microsoft.CognitiveServices.Speech Namespace

Classes

AudioDataStream

Provides audio data as a stream. Added in 1.4.0

AutoDetectSourceLanguageConfig

Configures options for automatic detection of languages. Updated in 1.13.0

AutoDetectSourceLanguageResult

Contains languages detected by the Speech service. Added in 1.9.0

CancellationDetails

Contains detailed information about why a result was canceled.

ClassLanguageModel

Represents a list of grammars for dynamic grammar scenarios. Added in 1.7.0

Connection

A proxy class for managing connection to the speech service of the specified Recognizer. Added in 1.2.0

ConnectionEventArgs

Contains payload for Connected/Disconnected events Added in 1.2.0

ConnectionMessage

Represents implementation-specific messages sent to and received from the speech service. For debugging only. Added in 1.10.0

ConnectionMessageEventArgs

Contains payload for MessageReceived events of a Connection instance. Added in 1.10.0

ContentAssessmentResult

Content assessment results Added in 1.33.0

DetailedSpeechRecognitionResult

Contains recognition details including confidence score, recognized text, raw lexical form, normalized form, and normalized form with masked profanity. Changed in 1.7.0

EmbeddedSpeechConfig

Class that defines embedded (offline) speech configuration.

Grammar

Represents base class grammar for customizing speech recognition. Added in 1.5.0

GrammarList

Represents a list of grammars for dynamic grammar scenarios. Added in 1.7.0

GrammarPhrase

Represents a phrase that can be spoken by the user. Added in 1.5.0

HybridSpeechConfig

Class that defines hybrid (cloud and embedded) configurations for speech recognition and speech synthesis.

KeywordRecognitionEventArgs

Class for the events emitted by the KeywordRecognizer.

KeywordRecognitionModel

Represents keyword recognition model that can trigger an event when pre-defined keywords are spoken.

KeywordRecognitionResult

Contains the results emitted by the KeywordRecognizer.

KeywordRecognizer

Recognizes a word or short phrase using a keyword model.

NoMatchDetails

Contains detailed information for NoMatch recognition results.

PhonemeLevelTimingResult

Phoneme level timing result. Added in 1.14.0

PhraseListGrammar

Identifies known phrases in audio data. Added in 1.5.0

PronunciationAssessmentNBestPhoneme

Pronunciation assessment nbest phoneme result Added in 1.20.0

PropertyCollection

Class to retrieve or set a property value from a property collection.

RecognitionEventArgs

Contains payload for recognition events like Speech Start/End Detected.

RecognitionResult

Contains detailed information about result of a recognition operation.

Recognizer

Base class that mostly contains common event handlers.

SessionEventArgs

Contains payload for SessionStarted and SessionStopped events.

SourceLanguageConfig

Source Language configuration. Added in 1.17.0

SourceLanguageRecognizer

Detects the spoken language on the input audio. Added in version 1.17.0

SpeechConfig

Information about your subscription, including your key and region, endpoint, host, or authorization token.

SpeechRecognitionCanceledEventArgs

Contains payload of speech recognition canceled result events.

SpeechRecognitionEventArgs

Contains payload of speech recognizing/recognized events.

SpeechRecognitionModel

Speech recognition model information.

SpeechRecognitionResult

Contains result of speech recognition.

SpeechRecognitionResultExtensions

Extension methods for speech recognition result

SpeechRecognizer

Transcribes speech into text. Speech can arrive via microphone, audio file, or other audio input stream.

SpeechSynthesisBookmarkEventArgs

Contains bookmark event in synthesized speech. Added in 1.16.0

SpeechSynthesisCancellationDetails

Contains detailed information about why a speech synthesis result was canceled. Added in 1.4.0

SpeechSynthesisEventArgs

Contains payload of speech synthesis events. Added in 1.4.0

SpeechSynthesisResult

Contains detailed information about result of a speech synthesis operation. Added in 1.4.0

SpeechSynthesisVisemeEventArgs

Contains facial pose events that correspond to time-based offsets in synthesized speech. Added in 1.16.0

SpeechSynthesisWordBoundaryEventArgs

Contains location and length details about words in synthesized speech. Added in 1.7.0

SpeechSynthesizer

Performs speech synthesis to speaker, file, or other audio output streams, and gets synthesized audio as result. Updated in 1.16.0

SpeechTranslationConfig

Speech translation configuration.

SpeechTranslationModel

Speech translation model information.

SyllableLevelTimingResult

Syllable level timing result. Added in 1.20.0

SynthesisVoicesResult

Contains detailed information about the retrieved synthesis voices list. Added in 1.16.0

VoiceInfo

Contains detailed information about the synthesis voice. Updated in 1.17.0

WordLevelTimingResult

For a recognized word in speech audio, contains the offset to the start and the duration, in ticks. 1 tick = 100 ns. Added in 1.7.0

Enums

CancellationErrorCode

Lists error codes possible when CancellationReason is Error. Added in 1.1.0

CancellationReason

Lists the possible reasons a recognition result might be canceled.

NoMatchReason

Lists the possible reasons a recognition result was not recognized.

OutputFormat

Output format.

ProfanityOption

Removes profanity (swearing), or replaces letters of profane words with stars. Added in 1.5.0

PropertyId

Lists speech property IDs.

RecognitionFactorScope

Lists the scope that a recognition factor applies to.

ResultReason

Describes a recognition result.

ServicePropertyChannel

Lists channels used to pass property settings to service. Added in 1.5.0

SpeechSynthesisBoundaryType

Defines the boundary type of speech synthesis boundary event Added in version 1.21.0

SpeechSynthesisOutputFormat

Lists synthesis output audio formats.

StreamStatus

Lists possible status values of an audio data stream. Added in 1.4.0

SynthesisVoiceGender

Lists synthesis voice gender. Added in version 1.17.0

SynthesisVoiceType

Lists synthesis voice types.