How can we do language detection for audio url before performing speech to text of Cognitive services API?

Question

Hello all,

I can use speech to text batch transcription api by providing single language as part of request body. I could also detect the audio language in prior for the audio files from my local directory and pass that to the request body.

But I want to detect the language from the audio url and pass that to the request body.

{
"contentUrls": [
""
],
"properties": {
"wordLevelTimestampsEnabled": true
},
"locale": [detected lang] ,
"model": {
"self": "https://westus.api.cognitive.microsoft.com/speechtotext/v3.0/models/{id}"
},
"displayName": "Transcription of file using default model for en-US"
}

Is there any way to do the same?

Answer

@Jeb Million Thanks for the question. Language identification can be used to determine the language being spoken in audio that has been passed to the Speech SDK.

Please follow the document for sample using C#.
https://learn.microsoft.com/en-us/azure/cognitive-services/speech-service/how-to-automatic-language-detection?pivots=programming-language-csharp

How can we do language detection for audio url before performing speech to text of Cognitive services API?

1 answer