question

VergeestLCLucas-5282 avatar image
0 Votes"
VergeestLCLucas-5282 asked GiftA-MSFT answered

Speech recognition engine has difficulties recognizing separate letters (nl_NL)

I have noticed that when pronouncing separate letters in a test set, the engine has a lot of difficulty to recognize them (Dutch language).
Of course this is not a big surprise, since there is no meaningful context, and the phoneme clusterse are relatively short.

However, I was wondering, does anyone have suggestions as how to improve the WER for this?

Perhaps just adding a training set of separate letters, including annotations?
Or would it be better to encourage customers to pronounce letters as full names, eg. when pronouncing zip codes or license plates?

azure-speech
5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.

1 Answer

GiftA-MSFT avatar image
0 Votes"
GiftA-MSFT answered

Hi, the first step to improve recognition is to identify the type of error (Insertion, Deletion , Substitution). You can improve recognition results by adding training data such as related text sentences, audio with human-labeled transcripts, new words with pronunciation. However, understanding the type of error that exists helps you to apply the best approach. Please review Evaluate and improve Custom Speech Accuracy for more details.



--- Kindly Accept Answer if the information helps. Thanks.

5 |1600 characters needed characters left characters exceeded

Up to 10 attachments (including images) can be used with a maximum of 3.0 MiB each and 30.0 MiB total.