I am using Text to Speech service, I have selected Neural-Multilingual voice for my usecase, If I select language which is not spoken by Voice, what output should endpoint send?

Nikita Khandare 60 Reputation points
2025-06-10T03:50:19.9633333+00:00

I am using Text to Speech service, I have selected Neural-Multilingual voice for my use case,
If I select language which is not spoken by Voice, what output should endpoint send?

For Example,
I am using a voice - en-US-FableTurboMultilingualNeural
which does not have Marathi language listed.

It is giving me some random response, like voice is speaking in some different language and not marathi.

It should have given a error code instead of audio output with unknown language.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,069 questions
{count} votes

Accepted answer
  1. Pavankumar Purilla 8,570 Reputation points Microsoft External Staff Moderator
    2025-06-11T07:03:35.14+00:00

    Hi Nikita Khandare,

    I have reproduced this behavior using the sample SSML code provided. When Marathi text is used as input with a multilingual neural voice such as en-US-BrandonMultilingualNeural, the Text to Speech service successfully generates audio output. However, since Marathi is not officially supported by this voice, the pronunciation may be inaccurate or unclear. In such cases, the service does not return an error. Instead, it attempts to phonetically interpret the input using the closest matching phonemes from supported languages. This can result in speech output that sounds incorrect or resembles a different language. This behavior is expected because the service validates only the structure and syntax of the SSML input, not the compatibility of the language content with the selected voice. As a result, even if the spoken output is unintelligible or misleading, the system treats it as a valid request and proceeds to produce audio.
    To ensure proper pronunciation and meaningful output, it is recommended to use a voice that officially supports the intended language, as documented in the Azure Text to Speech language support list.
    I hope this information helps. Thank you!


0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.