Hello @Lokesh V
Thanks for reaching out to us, I think you are mentioning Pronunciation assessment of Azure AI Speech.
Pronunciation assessment uses the Speech to text capability to provide subjective and objective feedback for language learners. Practicing pronunciation and getting timely feedback are essential for improving language skills. Assessments driven by experienced teachers can take a lot of time and effort and makes a high-quality assessment expensive for learners. Pronunciation assessment can help make the language assessment more engaging and accessible to learners of all backgrounds.
Pronunciation assessment provides various assessment results in different granularities, from individual phonemes to the entire text input.
- At the full-text level, pronunciation assessment offers additional Fluency and Completeness scores: Fluency indicates how closely the speech matches a native speaker's use of silent breaks between words, and Completeness indicates how many words are pronounced in the speech to the reference text input. An overall score aggregated from Accuracy, Fluency and Completeness is then given to indicate the overall pronunciation quality of the given speech.
- At the word-level, pronunciation assessment can automatically detect miscues and provide accuracy score simultaneously, which provides more detailed information on omission, repetition, insertions, and mispronunciation in the given speech.
- Syllable-level accuracy scores are currently available via the JSON file or Speech SDK.
- At the phoneme level, pronunciation assessment provides accuracy scores of each phoneme, helping learners to better understand the pronunciation details of their speech.
Please refer to the Python SDK document -
Only one if you just want to do pronunciation assessment, please check out easy-to-deploy Pronunciation Assessment demo and watch the video tutorial of pronunciation assessment to see more details.
For cost, it is charged as standard Speech to Text, example:
For evaluation of 8 seconds of speech, you will be charged around $0.0023
More information, please refer to pricing page for Speech to Text - https://azure.microsoft.com/en-us/pricing/details/cognitive-services/speech-services/
I hope this helps!
Please let me know if you need more information and the community is happy to help.
Regards,
Yutong
-Please kindly accept the answer and vote 'Yes' if you feel helpful to support the community, thanks a lot.