Pronunciation check for japanese language

Lokesh V 0 Reputation points
2023-09-28T17:01:28.6733333+00:00

I want to check pronunciation for japanese language.
User will speak japanese language and I want to identify which word is wrongly pronounced.

Users will speak Japanese words or phrases.

The system should compare the spoken words to correct pronunciations.

If there's a mismatch or mispronunciation, the system should detect and highlight it.

Using Python

Thanks in Advance!

Windows
Windows
A family of Microsoft operating systems that run across personal computers, tablets, laptops, phones, internet of things devices, self-contained mixed reality headsets, large collaboration screens, and other devices.
5,404 questions
Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
3,081 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,856 questions
{count} votes

1 answer

Sort by: Most helpful
  1. YutongTie-MSFT 51,756 Reputation points
    2023-09-29T09:12:09.38+00:00

    Hello @Lokesh V

    Thanks for reaching out to us, I think you are mentioning Pronunciation assessment of Azure AI Speech.

    Pronunciation assessment uses the Speech to text capability to provide subjective and objective feedback for language learners. Practicing pronunciation and getting timely feedback are essential for improving language skills. Assessments driven by experienced teachers can take a lot of time and effort and makes a high-quality assessment expensive for learners. Pronunciation assessment can help make the language assessment more engaging and accessible to learners of all backgrounds.

    Pronunciation assessment provides various assessment results in different granularities, from individual phonemes to the entire text input.

    • At the full-text level, pronunciation assessment offers additional Fluency and Completeness scores: Fluency indicates how closely the speech matches a native speaker's use of silent breaks between words, and Completeness indicates how many words are pronounced in the speech to the reference text input. An overall score aggregated from Accuracy, Fluency and Completeness is then given to indicate the overall pronunciation quality of the given speech.
    • At the word-level, pronunciation assessment can automatically detect miscues and provide accuracy score simultaneously, which provides more detailed information on omission, repetition, insertions, and mispronunciation in the given speech.
    • Syllable-level accuracy scores are currently available via the JSON file or Speech SDK.
    • At the phoneme level, pronunciation assessment provides accuracy scores of each phoneme, helping learners to better understand the pronunciation details of their speech.

    Please refer to the Python SDK document -

    https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-pronunciation-assessment?pivots=programming-language-python

    Only one if you just want to do pronunciation assessment, please check out easy-to-deploy Pronunciation Assessment demo and watch the video tutorial of pronunciation assessment to see more details.

    For cost, it is charged as standard Speech to Text, example:
    For evaluation of 8 seconds of speech, you will be charged around $0.0023

    More information, please refer to pricing page for Speech to Text - https://azure.microsoft.com/en-us/pricing/details/cognitive-services/speech-services/

    I hope this helps!

    Please let me know if you need more information and the community is happy to help.

    Regards,

    Yutong

    -Please kindly accept the answer and vote 'Yes' if you feel helpful to support the community, thanks a lot.

    1 person found this answer helpful.
    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.