Azure Speech-To-Text: Accuracy difference b/w Rest API vs. SDK for short audio

Kun Wu 146 Reputation points Microsoft Employee
2021-08-24T02:49:04.897+00:00

Hello,

i have lot of short audio wave files of 5 seconds or so in hand. When i transcribe them with Azure Speech-To-Text REST API and Java SDK respectively, i found REST API recognition accuracy seems always a little bit worse than that of Java SDK, though the gap is less than 1% CER (Character Error Rate).

Why there is such a gap b/w REST and SDK ?

Thank you.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,774 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.