Where to find List of Supported Languages for Azure REAL-TIME SPEECH-TO-TEXT TRANSCRIPTION?

Question

Where to find List of Supported Languages for Azure REAL-TIME SPEECH-TO-TEXT TRANSCRIPTION?

Johan S Daniel 0

https://learn.microsoft.com/en-us/azure/ai-services/speech-service/language-support?tabs=stt

Hello, I'm reaching out regarding the list of supported languages for the real-time speech-to-text transcription model.

I understand it is a Universal model under the hood and that fast transcription is a separate model.

I cross-checked with several Indic languages to see how fast-transcription and real-time speech transcription varied but unfortunately the API documentation does not mention anything about what all languages are supported for real-time speech transcription.

Some languages do not output Unicode characters but do recognize the end of sentences.

I would be really grateful if a support agent could point me to a resource that has the list of supported languages for real-time stt transcription.

Thank you.

navba-MSFT 27,550 Reputation points Microsoft Employee Moderator

2025-01-06T04:53:54.2933333+00:00

@Johan S Daniel Welcome to Microsoft Q&A Forum, Thank you for posting your query here! .

Just following up to check if the below answer helped. If that answers your query, do click "Accept the answer” for the same, which might be beneficial to other community members reading this thread. And, if you have any further query do let me know. I would be happy to help.
Johan S Daniel 0 Reputation points

2025-01-06T15:17:21.3766667+00:00

Oh well surprise.

Turns out the the model does support all the languages in the list.

It's just that the Visual Studio Code Terminal did not support the rendering of the Indic Unicode characters.

But thank you everyone for the assistance.
navba-MSFT 27,550 Reputation points Microsoft Employee Moderator

2025-01-07T06:27:58.0466667+00:00

@Johan S Daniel Thanks for your reply. Please do click "Accept the answer” for the same, which might be beneficial to other community members reading this thread

1 answer

Your answer

navba-MSFT 27,550 Reputation points Microsoft Employee Moderator

2025-01-06T04:53:54.2933333+00:00

@Johan S Daniel Welcome to Microsoft Q&A Forum, Thank you for posting your query here! .

Just following up to check if the below answer helped. If that answers your query, do click "Accept the answer” for the same, which might be beneficial to other community members reading this thread. And, if you have any further query do let me know. I would be happy to help.
Johan S Daniel 0 Reputation points

2025-01-06T15:17:21.3766667+00:00

Oh well surprise.

Turns out the the model does support all the languages in the list.

It's just that the Visual Studio Code Terminal did not support the rendering of the Indic Unicode characters.

But thank you everyone for the assistance.
navba-MSFT 27,550 Reputation points Microsoft Employee Moderator

2025-01-07T06:27:58.0466667+00:00

@Johan S Daniel Thanks for your reply. Please do click "Accept the answer” for the same, which might be beneficial to other community members reading this thread

Answer 1

As per https://learn.microsoft.com/en-us/azure/ai-services/speech-service/language-support?tabs=stt "The table in this section summarizes the locales supported for speech to text (real-time and batch transcription)."

Regarding your comments, real-time transcription leverages the Universal model, which is designed to cover a broad range of languages. It may support sentence detection even when full Unicode character recognition isn't available, which is why some Indic languages may appear to handle sentence endings but not produce expected text outputs. Sentence-ending recognition (e.g., punctuation or pauses) might still work because it relies on acoustic and prosodic cues rather than text encoding.

Fast transcription indeed uses a separate, optimized model for specific languages. The language support for this model tends to be more limited but is faster and more efficient for bulk transcription.

If the above response helps answer your question, remember to "Accept Answer" so that others in the community facing similar issues can easily find the solution. Your contribution is highly appreciated.

hth

Marcin

Johan S Daniel 0 Reputation points

2025-01-06T06:47:06.2133333+00:00

I see.

So if I wanted to check what all languages are truly supported (i.e. outputs the relevant Unicode characters) by the Universal model then I'd have to do a manual trial and error?

Share via

Where to find List of Supported Languages for Azure REAL-TIME SPEECH-TO-TEXT TRANSCRIPTION?

1 answer

Your answer