Azure ASR returns different recognition results at different times with the same audio file

Question

Hi,

I am learning to use Azure Speech Service. I used Speech CLI and sent exact same audio file to Azure, but I received different results. I have been testing the CLI parameters, so I tried to sent audio on 4/13 and today, using the same command below:

spx recognize --file MyAudioFileName.wav --output batch file MyAudioFileName.json

But I found the returned results i got today were different compared to the results i requested on 4/13.

Was it because the Azure ASR models/engine have been updated, or it was because some other thing change? On my side, i used the same account, same command, and same audio file.
I want to get results consistently.
Please confirm the reason for the difference in the results.

Thanks,
Wayne

Accepted Answer

@Wayne Lee There has been no recent updates to the cli or the endpoint versions to cause such inconsistencies. There could be difference in output in scenarios where the mic is used because of various factors like audio quality, background noise etc. With the audio file the result would be consistent. Is there any major difference in the output in your case? There could be minor updates to the endpoints to fix some bugs that could effect the quality of the model but should not really cause a major difference in the recognition output. Is it possible to provide your audio and responses in both the cases along with your resource details like the region of your speech resource?

Share via

Azure ASR returns different recognition results at different times with the same audio file

0 additional answers