How to transcribe foreign names and words within English sentences

Hashim Khan 0 Reputation points
2024-05-23T17:05:03.7633333+00:00

I use Azure Speech to transcribe audio files in English through a Java application.

There are however some reoccuring foreign words and names (Arabic) used in the middle of the English sentences and these are not properly transcribed.

What is the best way to handle these?

Should I be looking into Custom speech, Phrase lists, Custom keywords or anything else? Any help and advice would be much appreciated.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,474 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. VasaviLankipalle-MSFT 15,241 Reputation points
    2024-05-23T23:10:01.36+00:00

    Hello @Hashim Khan , Thanks for using Microsoft Q&A Platform.

    I would suggest you creating a custom speech model which can be trained on your specific vocabulary, including the foreign words and names.

    A custom model can be used to augment the base model to improve recognition of domain-specific vocabulary specific to the application by providing text data to train the model.

    Here is the documentation you can refer to: https://learn.microsoft.com/en-us/azure/ai-services/speech-service/custom-speech-overview

    I hope this helps.

    0 comments No comments