Hi @Chuong Phung,
Thank you for reaching out regarding the Azure Speech Recognition service. I was able to reproduce your scenario, and I found that the recognition results are quite strong when using the service alone. With the pronunciation assessment feature, I was also able to achieve meaningful insights.
However, I noticed that some pronunciation challenges in the audio may have affected the assessment results. Clear pronunciation is essential for optimal performance, and focusing on specific areas of improvement could enhance the effectiveness of the assessment.
Here are a couple of suggestions that might help improve the results:
- Reference Text Accuracy: Ensure that the reference text used for the pronunciation assessment closely matches the spoken content in the audio. Any discrepancies can lead to lower assessment scores.
- Audio Quality: Clearer audio with minimal background noise often leads to better performance in both recognition and assessment. If it's possible to provide a cleaner audio sample, it might enhance the outcome.
Refer to the python sdk here: cognitive-services-speech-sdk.
Screen-shot for reference:
If you continue to face any issues, please let us know, and we will escalate this issue to the relevant team for further assistance.
Thank you.
If this answers your query, do click Accept Answer
and Yes
for was this answer helpful.