Azure Speech-To-Text: Accuracy difference b/w Rest API vs. SDK for short audio
Kun Wu
146
Reputation points Microsoft Employee
Hello,
i have lot of short audio wave files of 5 seconds or so in hand. When i transcribe them with Azure Speech-To-Text REST API and Java SDK respectively, i found REST API recognition accuracy seems always a little bit worse than that of Java SDK, though the gap is less than 1% CER (Character Error Rate).
- Rest API: https://learn.microsoft.com/en-us/azure/cognitive-services/speech-service/rest-speech-to-text#speech-to-text-rest-api-for-short-audio
- Java SDK: https://learn.microsoft.com/en-us/java/api/com.microsoft.cognitiveservices.speech.speechrecognizer.startcontinuousrecognitionasync?view=azure-java-stable#com_microsoft_cognitiveservices_speech_SpeechRecognizer_startContinuousRecognitionAsync__
Why there is such a gap b/w REST and SDK ?
Thank you.
Sign in to answer