We are using Azure Batch Transcription to transcribe audio to text. With Batch Transcription it’s only possible to specify which language model to use for transcription. With the SDK it’s possible to use “automatic language detection for speech to text”.
What we would like is to use the same automatic language detection or perform language detection up-front before we call Azure Batch Transcription.
I also found a blog post about ”Automatically detect audio language with the Speech Language Detection Container”. But I can find any information on how to use those containers and the API to perform Speech Language Detection””.
Can you provider me with more information about:
- Language detection in conjunction with Batch Transcription.
- Speech Language Detection Container