what is QA process for Azure TTS?

Eugene Pugin 0 Reputation points
2023-02-16T13:21:47.52+00:00

Hello!
We are considering Azure TTS as a solution to power our application for voice guidance - so, the quality of test is on us, but the quality of voice samples we'd like to get from Azure TTS. Does Azure TTS have a mechanism in place to verify whether the produced synthesis matches the provided input text?

BR, Eugene

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
2,061 questions
{count} votes

1 answer

Sort by: Most helpful
  1. VasaviLankipalle-MSFT 18,676 Reputation points Moderator
    2023-02-21T18:47:16.6433333+00:00

    Hi @Eugene Pugin , Thanks for using Microsoft Q&A Platform.

    I don't believe Azure Text-to-Speech (TTS) includes a mechanism for evaluating the produced synthesis to the input text. But Azure TTS has a feature called "pronunciation assessment" that can be used to verify whether the produced synthesis matches the provided input text. The Pronunciation Assessment measures three aspects of pronunciation: accuracy, fluency, and completeness. It can detect errors such as extra, missing, or repeated words when compared to the reference text. This information helps obtain more accurate scoring to be used as diagnosis information.

    To use this feature, you must provide Azure TTS with both the input text and the reference audio file. Then it returns a score based on the quality of the synthesized speech's pronunciation. This can assist you in making sure that your application's voice guidance is accurate and of the best quality.

    Please refer to this pronunciation assessment use cases page to see if that fits your requirement. There are some limitations please read this page.

    You can try out pronunciation assessment in speech studio.

    I hope this helps.

    Regards,
    Vasavi

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.