com.microsoft.cognitiveservices.speech

Reference

Package:: com.microsoft.cognitiveservices.speech

Maven Artifact:: com.microsoft.cognitiveservices.speech:client-sdk:1.38.0

Classes

AudioDataStream	Represents audio data stream used for operating audio data as a stream.
AutoDetectSourceLanguageConfig	Represents auto detect source language configuration used for specifying the possible source language candidates Note: close() must be called in order to release underlying resources held by the object.
AutoDetectSourceLanguageResult	Represents the result of auto detecting source languages Added in version 1.8.0
CancellationDetails	Contains detailed information about why a result was canceled.
ClassLanguageModel	Represents a ClassLanguageModel.
Connection	Connection is a proxy class for managing connection to the speech service of the specified Recognizer.
ConnectionEventArgs	Defines payload for connection events like Connected/Disconnected.
ConnectionMessage	ConnectionMessage represents implementation specific messages sent to and received from the speech service.
ConnectionMessageEventArgs	Defines payload for Connection's MessageReceived events.
ContentAssessmentResult	Represents the result of pronunciation assessment.
Diagnostics	Native logging and other diagnostics
EmbeddedSpeechConfig	Class that defines embedded (offline) speech configuration.
Grammar	Represents a generic grammar used to assist in improving speech recogniton accuracy.
GrammarList	Allows adding multiple grammars to a SpeechRecognizer to improve the accuracy of speech recognition.
HybridSpeechConfig	Class that defines hybrid (cloud and embedded) configurations for speech recognition and speech synthesis.
KeywordRecognitionEventArgs	Defines content of an keyword recognizing/recognized events.
KeywordRecognitionModel	Represents a keyword recognition model for recognizing when the user says a keyword to initiate further speech recognition.
KeywordRecognitionResult	Defines result of keyword recognition.
KeywordRecognizer	Performs keyword recognition on the speech input.
NoMatchDetails	Contains detailed information for NoMatch recognition results.
PhraseListGrammar	Allows additions of new phrases to improve speech recognition.
PronunciationAssessmentConfig	Represents pronunciation assessment configuration.
PronunciationAssessmentResult	Represents the result of pronunciation assessment.
PropertyCollection	Represents collection of properties and their values.
RecognitionEventArgs	Defines payload for recognition events like Speech Start/End Detected
RecognitionResult	Contains detailed information about result of a recognition operation.
Recognizer	Defines the base class Recognizer which mainly contains common event handlers.
SessionEventArgs	Defines payload for SessionStarted/Stopped events.
SourceLanguageConfig	Represents source language configuration used for specifying recognition source language.
SpeechConfig	Speech configuration.
SpeechRecognitionCanceledEventArgs	Defines payload of speech recognition canceled events.
SpeechRecognitionEventArgs	Defines contents of speech recognizing/recognized event.
SpeechRecognitionModel	Contains detailed speech recognition model information.
SpeechRecognitionResult	Defines result of speech recognition.
SpeechRecognizer	Performs speech recognition from microphone, file, or other audio input streams, and gets transcribed text as result.
SpeechSynthesisBookmarkEventArgs	Defines contents of speech synthesis bookmark event.
SpeechSynthesisCancellationDetails	Contains detailed information about why a speech synthesis was canceled.
SpeechSynthesisEventArgs	Defines contents of speech synthesis related event.
SpeechSynthesisResult	Contains detailed information about result of a speech synthesis operation.
SpeechSynthesisVisemeEventArgs	Defines contents of speech synthesis viseme event.
SpeechSynthesisWordBoundaryEventArgs	Defines contents of speech synthesis word boundary event.
SpeechSynthesizer	Performs speech synthesis to speaker, file, or other audio output streams, and gets synthesized audio as result.
SpeechTranslationModel	Contains detailed speech translation model information.
SynthesisVoicesResult	Contains detailed information about the retrieved synthesis voices list.
VoiceInfo	Contains detailed information about the synthesis voice information.

Enums

CancellationErrorCode	Defines error code in case that CancellationReason is Error.
CancellationReason	Defines the possible reasons a recognition result might be canceled.
NoMatchReason	Defines the possible reasons a recognition result might not be recognized.
OutputFormat	Define Speech Recognizer output formats.
ProfanityOption	Define profanity option for response result.
PronunciationAssessmentGradingSystem	Defines the point system for pronunciation score calibration; default value is FivePoint.
PronunciationAssessmentGranularity	Defines the pronunciation evaluation granularity; default value is Phoneme.
PropertyId	Defines property ids.
ResultReason	Defines the possible reasons a recognition result might be generated.
ServicePropertyChannel	Defines channels used to send service properties.
SpeechSynthesisBoundaryType	Defines the boundary type of speech synthesis boundary event.
SpeechSynthesisOutputFormat	Defines the possible speech synthesis output audio format.
StreamStatus	Defines the possible status of audio data stream.
SynthesisVoiceGender	Define synthesis voice gender.
SynthesisVoiceType	Define synthesis voice type.

Share via

com.microsoft.cognitiveservices.speech

Classes

Enums

Feedback

Additional resources