SpeechConfig Class

Package:: com.microsoft.cognitiveservices.speech

Maven Artifact:: com.microsoft.cognitiveservices.speech:client-sdk:1.48.1

java.lang.Object
- com.microsoft.cognitiveservices.speech.SpeechConfig

Implements

java.lang.AutoCloseable

public class SpeechConfig
implements java.lang.AutoCloseable

Speech configuration. Note: close() must be called in order to release underlying resources held by the object. Changed in version 1.7.0

Field Summary

Modifier and Type	Field and Description
static java.lang.Class<?>	speechConfigClass Accessor to load the native library.

Constructor Summary

Modifier	Constructor	Description
protected	SpeechConfig(long handleValue)	Internal constructor for speech configuration object.
protected	SpeechConfig(long configPtr, TokenCredential tokenCredential)	Creates a SpeechConfig instance using the given configuration pointer and token credential.

Method Summary

Modifier and Type	Method and Description
void	close() Dispose of associated resources.
void	enableAudioLogging() Enable audio logging in service.
void	enableDictation() Enable dictation.
static SpeechConfig	fromAuthorizationToken(String authorizationToken, String region) Creates an instance of a speech config with specified authorization token and service region.
static SpeechConfig	fromEndpoint(URI endpoint) Creates an instance of the speech config with specified endpoint.
static SpeechConfig	fromEndpoint(URI endpoint, KeyCredential keyCredential) Creates an instance of SpeechConfig with a custom endpoint and key credential.
static SpeechConfig	fromEndpoint(URI endpoint, String subscriptionKey) Creates an instance of the speech config with specified endpoint and subscription key.
static SpeechConfig	fromEndpoint(URI endpoint, TokenCredential credential) Creates a speech configuration instance using a specified endpoint and AAD token credential.
static SpeechConfig	fromHost(URI host) Creates an instance of the speech config with specified host.
static SpeechConfig	fromHost(URI host, String subscriptionKey) Creates an instance of the speech config with specified host and subscription key.
static SpeechConfig	fromSubscription(String subscriptionKey, String region) Creates an instance of a speech config with specified subscription key and service region.
java.lang.String	getAuthorizationToken() Gets the authorization token.
java.lang.String	getEndpointId() Gets the endpoint ID of a customized speech model that is used for speech recognition, or a custom voice model for speech synthesis.
SafeHandle	getImpl() Get the native handle to speech config object.
OutputFormat	getOutputFormat() Gets speech recognition output format (simple or detailed).
java.lang.String	getProperty(PropertyId id) Gets the property by propertyId.
java.lang.String	getProperty(String name) Gets a named property as value.
java.lang.String	getSpeechRecognitionLanguage() Gets the speech recognition language
java.lang.String	getSpeechSynthesisLanguage() Gets the speech synthesis language.
java.lang.String	getSpeechSynthesisOutputFormat() Gets the speech synthesis output format.
java.lang.String	getSpeechSynthesisVoiceName() Gets the speech synthesis voice name.
TokenCredential	getTokenCredential() Retrieves the TokenCredential used for authentication.
void	requestWordLevelTimestamps() Includes word-level timestamps.
void	setAuthorizationToken(String value) Sets the authorization token.
void	setEndpointId(String value) Sets the endpoint ID of a customized speech model that is used for speech recognition, or a custom voice model for speech synthesis.
void	setOutputFormat(OutputFormat format) Sets speech recognition output format (simple or detailed).
void	setProfanity(ProfanityOption profanity) Sets profanity option.
void	setProperty(PropertyId id, String value) Sets the property by propertyId.
void	setProperty(String name, String value) Sets a named property as value.
void	setProxy(String proxyHostName, int proxyPort, String proxyUserName, String proxyPassword) Sets proxy configuration Added in version 1.1.0.
void	setServiceProperty(String name, String value, ServicePropertyChannel channel) Sets a property value that will be passed to service using the specified channel.
void	setSpeechRecognitionLanguage(String value) Sets the speech recognition language
void	setSpeechSynthesisLanguage(String value) Sets the speech synthesis language.
void	setSpeechSynthesisOutputFormat(SpeechSynthesisOutputFormat value) Sets the speech synthesis output format.
void	setSpeechSynthesisVoiceName(String value) Sets the speech synthesis voice name.

Methods inherited from java.lang.Object

java.lang.Object.clone java.lang.Object.equals java.lang.Object.finalize java.lang.Object.getClass java.lang.Object.hashCode java.lang.Object.notify java.lang.Object.notifyAll java.lang.Object.toString java.lang.Object.wait java.lang.Object.wait java.lang.Object.wait

Field Details

speechConfigClass

public static Class<?> speechConfigClass

Accessor to load the native library. Holds the class active so the class GC does not reclaim it (and the local variables!)

Constructor Details

SpeechConfig

protected SpeechConfig(long handleValue)

Internal constructor for speech configuration object.

Parameters:

handleValue - The native handle.

SpeechConfig

protected SpeechConfig(long configPtr, TokenCredential tokenCredential)

Creates a SpeechConfig instance using the given configuration pointer and token credential.

Parameters:

configPtr - The pointer to the speech configuration.

tokenCredential - The TokenCredential for authentication.

Method Details

close

public void close()

Dispose of associated resources.

enableAudioLogging

public void enableAudioLogging()

Enable audio logging in service. Audio and content logs are stored either in Microsoft-owned storage, or in your own storage account linked to your Cognitive Services subscription (Bring Your Own Storage (BYOS) enabled Speech resource). Added in version 1.5.0.

enableDictation

public void enableDictation()

Enable dictation. Only supported in speech continuous recognition. Added in version 1.5.0.

fromAuthorizationToken

public static SpeechConfig fromAuthorizationToken(String authorizationToken, String region)

Creates an instance of a speech config with specified authorization token and service region. Note: The caller needs to ensure that the authorization token is valid. Before the authorization token expires, the caller needs to refresh it by calling this setter with a new valid token. As configuration values are copied when creating a new recognizer, the new token value will not apply to recognizers that have already been created. For recognizers that have been created before, you need to set authorization token of the corresponding recognizer to refresh the token. Otherwise, the recognizers will encounter errors during recognition.

Parameters:

authorizationToken - The authorization token.

region - The region name (see the region page).

Returns:

A speech config instance.

fromEndpoint

public static SpeechConfig fromEndpoint(URI endpoint)

Creates an instance of the speech config with specified endpoint. This method is intended only for users who use a non-standard service endpoint or parameters. Note: The query parameters specified in the endpoint URI are not changed, even if they are set by any other APIs. For example, if the recognition language is defined in URI as query parameter "language=de-DE", and also set by setSpeechRecognitionLanguage("en-US"), the language setting in URI takes precedence, and the effective language is "de-DE". Only the parameters that are not specified in the endpoint URI can be set by other APIs. Note: if the endpoint requires a subscription key for authentication, please use fromEndpoint(java.net.URI, String) to pass the subscription key as parameter. To use an authorization token with fromEndpoint, use this method to create a SpeechConfig instance, and then call setAuthorizationToken() on the created SpeechConfig instance. Note: Added in version 1.5.0.

Parameters:

endpoint - The service endpoint to connect to.

Returns:

A speech config instance.

fromEndpoint

public static SpeechConfig fromEndpoint(URI endpoint, KeyCredential keyCredential)

Creates an instance of SpeechConfig with a custom endpoint and key credential. The query parameters specified in the endpoint URI are not changed, even if they are set by any other API call. For example, if the recognition language is defined in the URI query parameter as "language=de-DE", and the property SpeechRecognitionLanguage is set to "en-US", the language set in the URI takes precedence, and "de-DE" remains the expected language. Since parameters included in the endpoint URI take priority, only parameters that are not specified in the endpoint URI can be set by other APIs. To use an authorization token with FromEndpoint, use FromEndpoint(java.net.URI), and then set the AuthorizationToken property on the new SpeechConfig instance.

Parameters:

endpoint - The service endpoint to connect to.

keyCredential - The key credential used to authenticate the request.

Returns:

A speech config instance.

fromEndpoint

public static SpeechConfig fromEndpoint(URI endpoint, String subscriptionKey)

Creates an instance of the speech config with specified endpoint and subscription key. This method is intended only for users who use a non-standard service endpoint or parameters. Note: The query parameters specified in the endpoint URI are not changed, even if they are set by any other APIs. Whether a specific query parameter is supported or not, depends on the endpoint and scenario. For example, if the recognition language is defined in URI as query parameter "language=de-DE", and also set by setSpeechRecognitionLanguage("en-US"), the language setting in URI takes precedence, and the effective language is "de-DE". The example only applies when the endpoint and scenario combination supports language as a query parameter. Only the parameters that are not specified in the endpoint URI can be set by other APIs. Note: To use an authorization token with fromEndpoint, please use fromEndpoint(java.net.URI), and then call setAuthorizationToken() on the created SpeechConfig instance.

Parameters:

endpoint - The service endpoint to connect to.

subscriptionKey - The subscription key.

Returns:

A speech config instance.

fromEndpoint

public static SpeechConfig fromEndpoint(URI endpoint, TokenCredential credential)

Creates a speech configuration instance using a specified endpoint and AAD token credential. Introduced in version 1.43.0. Note:This API is available for SpeechRecognizer, ConversationTranscriber and TranslationRecognizer. Note:This method requires a valid token credential. The authorization token is automatically refreshed before it expires. Note:Query parameters specified in the endpoint URI remain unchanged, even if modified through other APIs. The availability and support of specific query parameters depend on the endpoint and scenario. For example, if the URI contains the query parameter "language=de-DE", but the property SpeechRecognitionLanguage is set to "en-US", the URI setting takes precedence, and "de-DE" will be used as the recognition language. This behavior applies only when the endpoint and scenario combination supports language as a query parameter. Parameters not defined in the URI can be modified via other APIs. Note:If the endpoint requires authentication using a subscription key, use fromEndpoint(URI, String) to pass the key as a parameter. To authenticate with an authorization token, use fromEndpoint(URI), then set the AuthorizationToken property on the newly created SpeechConfig instance.

Parameters:

endpoint - The service endpoint with a custom domain. For more details, see: https://learn.microsoft.com/azure/ai-services/speech-service/speech-services-private-link?tabs=portal#create-a-custom-domain-name

credential - The AAD token credential used for authentication and token requests.

Returns:

A speech config instance.

fromHost

public static SpeechConfig fromHost(URI host)

Creates an instance of the speech config with specified host. This method is intended only for users who use a non-default service host. Standard resource path will be assumed. For services with a non-standard resource path or no path at all, use fromEndpoint instead. Note: Query parameters are not allowed in the host URI and must be set by other APIs. Note: If the host requires a subscription key for authentication, use fromHost(java.net.URI, String) to pass the subscription key as parameter. To use an authorization token with fromHost, use this method to create a SpeechConfig instance, and then call setAuthorizationToken() on the created SpeechConfig instance. Note: Added in version 1.8.0.

Parameters:

host - The service host to connect to. Format is "protocol://host:port" where ":port" is optional.

Returns:

A speech config instance.

fromHost

public static SpeechConfig fromHost(URI host, String subscriptionKey)

Creates an instance of the speech config with specified host and subscription key. This method is intended only for users who use a non-default service host. Standard resource path will be assumed. For services with a non-standard resource path or no path at all, use fromEndpoint instead. Note: Query parameters are not allowed in the host URI and must be set by other APIs. Note: To use an authorization token with fromHost, use fromHost(java.net.URI), and then call setAuthorizationToken() on the created SpeechConfig instance. Note: Added in version 1.8.0.

Parameters:

host - The service host to connect to. Format is "protocol://host:port" where ":port" is optional.

subscriptionKey - The subscription key.

Returns:

A speech config instance.

fromSubscription

public static SpeechConfig fromSubscription(String subscriptionKey, String region)

Creates an instance of a speech config with specified subscription key and service region.

Parameters:

subscriptionKey - The subscription key.

region - The region name (see the region page).

Returns:

A speech config instance.

getAuthorizationToken

public String getAuthorizationToken()

Gets the authorization token.

Returns:

The authorization token.

getEndpointId

public String getEndpointId()

Gets the endpoint ID of a customized speech model that is used for speech recognition, or a custom voice model for speech synthesis.

Returns:

The endpoint ID.

getImpl

public SafeHandle getImpl()

Get the native handle to speech config object.

Returns:

The native handle.

getOutputFormat

public OutputFormat getOutputFormat()

Gets speech recognition output format (simple or detailed). Note: This output format is for speech recognition results, use com.microsoft.cognitiveservices.speech.SpeechConfig#setSpeechSynthesisOutputFormat to set synthesized audio output format.

Returns:

Returns the speech recognition output format.

getProperty

public String getProperty(PropertyId id)

Gets the property by propertyId. Added in version 1.3.0.

Parameters:

id - PropertyId of the property.

Returns:

The value.

getProperty

public String getProperty(String name)

Gets a named property as value.

Parameters:

name - the name of the property.

Returns:

The value.

getSpeechRecognitionLanguage

public String getSpeechRecognitionLanguage()

Gets the speech recognition language

Returns:

Returns the recognition language.

getSpeechSynthesisLanguage

public String getSpeechSynthesisLanguage()

Gets the speech synthesis language. Added in version 1.7.0

Returns:

Returns the synthesis language.

getSpeechSynthesisOutputFormat

public String getSpeechSynthesisOutputFormat()

Gets the speech synthesis output format. Added in version 1.7.0

Returns:

Returns the synthesis output format.

getSpeechSynthesisVoiceName

public String getSpeechSynthesisVoiceName()

Gets the speech synthesis voice name. Added in version 1.7.0

Returns:

Returns the synthesis voice name.

getTokenCredential

public TokenCredential getTokenCredential()

Retrieves the TokenCredential used for authentication.

Returns:

The TokenCredential instance.

requestWordLevelTimestamps

public void requestWordLevelTimestamps()

Includes word-level timestamps. Added in version 1.5.0.

setAuthorizationToken

public void setAuthorizationToken(String value)

Sets the authorization token. Note: The caller needs to ensure that the authorization token is valid. Before the authorization token expires, the caller needs to refresh it by calling this setter with a new valid token. As configuration values are copied when creating a new recognizer, the new token value will not apply to recognizers that have already been created. For recognizers that have been created before, you need to set authorization token of the corresponding recognizer to refresh the token. Otherwise, the recognizers will encounter errors during recognition.

Parameters:

value - the authorization token.

setEndpointId

public void setEndpointId(String value)

Sets the endpoint ID of a customized speech model that is used for speech recognition, or a custom voice model for speech synthesis.

Parameters:

value - the endpoint ID.

setOutputFormat

public void setOutputFormat(OutputFormat format)

Sets speech recognition output format (simple or detailed). Note: This output format is for speech recognition results, use com.microsoft.cognitiveservices.speech.SpeechConfig#setSpeechSynthesisOutputFormat to set synthesized audio output format.

Parameters:

format - The speech recognition output format.

setProfanity

public void setProfanity(ProfanityOption profanity)

Sets profanity option. Added in version 1.5.0.

Parameters:

profanity - the property option to set.

setProperty

public void setProperty(PropertyId id, String value)

Sets the property by propertyId. Added in version 1.3.0.

Parameters:

id - PropertyId of the property.

value - The value.

setProperty

public void setProperty(String name, String value)

Sets a named property as value.

Parameters:

name - the name of the property.

value - the value.

setProxy

public void setProxy(String proxyHostName, int proxyPort, String proxyUserName, String proxyPassword)

Sets proxy configuration Added in version 1.1.0. Note: Proxy functionality is not available on macOS. This function will have no effect on this platform.

Parameters:

proxyHostName - the host name of the proxy server, without the protocol scheme (http://)

proxyPort - the port number of the proxy server.

proxyUserName - the user name of the proxy server. Use empty string if no user name is needed.

proxyPassword - the password of the proxy server. Use empty string if no user password is needed.

setServiceProperty

public void setServiceProperty(String name, String value, ServicePropertyChannel channel)

Sets a property value that will be passed to service using the specified channel. Added in version 1.5.0.

Parameters:

name - the property name.

value - the property value.

channel - the channel used to pass the specified property to service.

setSpeechRecognitionLanguage

public void setSpeechRecognitionLanguage(String value)

Sets the speech recognition language

Parameters:

value - the language identifier in BCP-47 format.

setSpeechSynthesisLanguage

public void setSpeechSynthesisLanguage(String value)

Sets the speech synthesis language. Added in version 1.7.0

Parameters:

value - the language identifier in BCP-47 format (e.g. en-US).

setSpeechSynthesisOutputFormat

public void setSpeechSynthesisOutputFormat(SpeechSynthesisOutputFormat value)

Sets the speech synthesis output format. Added in version 1.7.0

Parameters:

value - The synthesis output format ID (e.g. Riff16Khz16BitMonoPcm).

setSpeechSynthesisVoiceName

public void setSpeechSynthesisVoiceName(String value)

Sets the speech synthesis voice name. Added in version 1.7.0

Parameters:

value - The speech synthesis voice name.

Applies to

Feedback

Was this page helpful?

Share via

SpeechConfig Class

Field Summary

Constructor Summary

Method Summary

Methods inherited from java.lang.Object

Field Details

speechConfigClass

Constructor Details

SpeechConfig

SpeechConfig

Method Details

close

enableAudioLogging

enableDictation

fromAuthorizationToken

fromEndpoint

fromEndpoint

fromEndpoint

fromEndpoint

fromHost

fromHost

fromSubscription

getAuthorizationToken

getEndpointId

getImpl

getOutputFormat

getProperty

getProperty

getSpeechRecognitionLanguage

getSpeechSynthesisLanguage

getSpeechSynthesisOutputFormat

getSpeechSynthesisVoiceName

getTokenCredential

requestWordLevelTimestamps

setAuthorizationToken

setEndpointId

setOutputFormat

setProfanity

setProperty

setProperty

setProxy

setServiceProperty

setSpeechRecognitionLanguage

setSpeechSynthesisLanguage

setSpeechSynthesisOutputFormat

setSpeechSynthesisVoiceName

Applies to

Feedback