Speech recognition engine has difficulties recognizing separate letters (nl_NL)

Question

I have noticed that when pronouncing separate letters in a test set, the engine has a lot of difficulty to recognize them (Dutch language).
Of course this is not a big surprise, since there is no meaningful context, and the phoneme clusterse are relatively short.

However, I was wondering, does anyone have suggestions as how to improve the WER for this?

Perhaps just adding a training set of separate letters, including annotations?
Or would it be better to encourage customers to pronounce letters as full names, eg. when pronouncing zip codes or license plates?

Answer

Hi, the first step to improve recognition is to identify the type of error (Insertion, Deletion , Substitution). You can improve recognition results by adding training data such as related text sentences, audio with human-labeled transcripts, new words with pronunciation. However, understanding the type of error that exists helps you to apply the best approach. Please review Evaluate and improve Custom Speech Accuracy for more details.

--- *Kindly Accept Answer if the information helps. Thanks.*

Speech recognition engine has difficulties recognizing separate letters (nl_NL)

1 answer