Issues with Recognizing Mixed Thai and English Audio in Azure Speech Service

santoshkc 8,940 Reputation points Microsoft Vendor
2024-07-31T11:02:48.8766667+00:00

How can I improve the accuracy of recognizing mixed Thai and English audio using Azure Speech Service?

PS - Based on common issues that we have seen from customers and other sources, we are posting these questions to help the Azure community.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,735 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. santoshkc 8,940 Reputation points Microsoft Vendor
    2024-07-31T11:06:15.34+00:00

    Greetings!

    When using a custom model based on the Thai language in Azure Speech Service to recognize mixed Thai and English sentences, you may encounter inaccuracies with English recognition. This issue arises because the base model is not designed to support multiple languages. Although using a Phrase List might slightly improve recognition, it is not specifically intended for this purpose and will generally favour Thai phrases.

    To improve the recognition accuracy for mixed language audio, you should use Azure Speech's continuous language identification feature. This feature allows the service to identify the language dynamically during continuous recognition. However, it is important to use full sentences rather than isolated words to enable effective language switching.

    For more details on language identification and how to implement it, refer to the following documentation:

    Hope this helps. If you have any follow-up questions, please let me know. I would be happy to help.

    Please do not forget to "up-vote" wherever the information provided helps you, as this can be beneficial to other community members.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.