How to call a specific public base model ID in Streaming Speech-To-Text SDK ?

Question

Hello Speech-To-Text SDK team,

With Batch API i can call specific public base model ID (if not default) per https://learn.microsoft.com/en-us/azure/ai-services/speech-service/batch-transcription-create?pivots=rest-api#use-a-custom-model

Do you know with real-time Streaming API SDK, how can i call the same public specific base mode ID directly ?

Thank you.

Answer

To call a specific public base model ID in the Streaming Speech-To-Text SDK, you can specify the model ID in the SpeechConfig object when creating a SpeechRecognizer instance. Here's an example in C#:

// Replace "YourSubscriptionKey" and "YourServiceRegion" with your own values
var config = SpeechConfig.FromSubscription("YourSubscriptionKey", "YourServiceRegion");
config.SpeechRecognitionLanguage = "en-US";
config.SetProperty("cid", Guid.NewGuid().ToString()); // Optional: set a client ID for logging purposes
config.SetProperty("modelId", "YourBaseModelId"); // Replace "YourBaseModelId" with the ID of the base model you want to use

using var recognizer = new SpeechRecognizer(config);

And here's an example in Java:

// Replace "YourSubscriptionKey" and "YourServiceRegion" with your own values
SpeechConfig config = SpeechConfig.fromSubscription("YourSubscriptionKey", "YourServiceRegion");
config.setSpeechRecognitionLanguage("en-US");
config.setProperty("cid", UUID.randomUUID().toString()); // Optional: set a client ID for logging purposes
config.setProperty("modelId", "YourBaseModelId"); // Replace "YourBaseModelId" with the ID of the base model you want to use

try (SpeechRecognizer recognizer = new SpeechRecognizer(config)) {
    // Use the recognizer
}

Note that the modelId property is only applicable to the following locales: en-US, en-GB, de-DE, es-ES, fr-FR, it-IT, ja-JP, ko-KR, nl-NL, pt-BR, zh-CN, and zh-HK.

References:

Share via

How to call a specific public base model ID in Streaming Speech-To-Text SDK ?

1 answer

Your answer