Does Azure Fast Transcription support auto language detection for single-language audio files (like Hindi or English) without needing the Continuous LID add-on, and is there any extra cost for using this feature? Also, please share the supported languages

Vaibhav Dewangan 0 Reputation points
2025-04-02T07:38:31.6533333+00:00

We are planning to use the Fast Transcription API in Azure Speech Service to transcribe audio files where each file contains only one spoken language, such as Hindi or English (India/US).

We have the following questions:

Does Fast Transcription support automatic language identification (at-start LID) for single-language audio files without requiring the Continuous Language Identification add-on?

Is there any additional cost involved in using this language identification feature for single-language detection within Fast Transcription?

Can you provide an official list of all supported languages/locales for the language identification feature specifically in Fast Transcription?

Thank you!

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,976 questions
{count} votes

1 answer

Sort by: Most helpful
  1. SriLakshmi C 4,230 Reputation points Microsoft External Staff
    2025-04-02T10:30:17.0833333+00:00

    Hello @Vaibhav Dewangan,

    Does Fast Transcription support automatic language identification (at-start LID) for single-language audio files without requiring the Continuous Language Identification add-on?

    Azure Fast Transcription does not support automatic language identification (at-start LID) for single-language audio files without requiring the Continuous Language Identification add-on. This feature allows the service to detect the spoken language at the beginning of the audio, enabling seamless transcription without prior knowledge of the language. It is particularly useful when working with audio files that contain only one language, such as Hindi or English. To utilize this functionality, you can enable language identification by configuring the languageIdentification parameter in your transcription request.

    Is there any additional cost involved in using this language identification feature for single-language detection within Fast Transcription?

    Regarding additional costs, Using the at-start language identification feature with Azure Fast Transcription does not incur any additional charges beyond the standard pricing for the service. While the context does not specify extra costs associated with this feature, please review the latest pricing details on the Azure AI Speech pricing page to ensure there have been no recent changes.

    Can you provide an official list of all supported languages/locales for the language identification feature specifically in Fast Transcription? Here are the supported languages:

    de-DE German (Germany) Yes Audio + human-labeled transcript Plain text Structured text Output format Pronunciation Phrase list
    en-GB English (United Kingdom) Yes Audio + human-labeled transcript Audio Plain text Structured text Output format Pronunciation Phrase list
    en-IN English (India) Yes Audio + human-labeled transcript Plain text Structured text Output format Pronunciation Phrase list
    en-US English (United States) Yes Audio + human-labeled transcript Audio Plain text Structured text Output format Pronunciation Phrase list
    es-ES Spanish (Spain) Yes Audio + human-labeled transcript Plain text Structured text Output format Pronunciation Phrase list
    es-MX Spanish (Mexico) Yes Audio + human-labeled transcript Plain text Structured text Output format Pronunciation Phrase list
    fr-FR French (France) Yes Audio + human-labeled transcript Plain text Structured text Output format Pronunciation Phrase list
    hi-IN Hindi (India) Yes Audio + human-labeled transcript Plain text Structured text Output format Phrase list
    it-IT Italian (Italy) Yes Audio + human-labeled transcript Plain text Structured text Output format Pronunciation Phrase list
    ja-JP Japanese (Japan) Yes Audio + human-labeled transcript Plain text Structured text Output format Phrase list
    ko-KR Korean (Korea) Yes Audio + human-labeled transcript Plain text Structured text Output format Phrase list
    pt-BR Portuguese (Brazil) Yes Audio + human-labeled transcript Plain text Structured text Output format Pronunciation Phrase list
    zh-CN Chinese (Mandarin, Simplified) Yes

    Kindly refer the link Supported languages.

    I Hope this helps. Do let me know if you have any further queries.


    If this answers your query, please do click Accept Answer and Yes for was this answer helpful.

    Thank you!

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.