azure prononciation assessment async assessment

Question

i'am using azure speech recognizer sdk , to do the prononciation assessment of an audio file. the problem when the speech is in french the results are always low , and no expressive

    const language = await detectSingleSpeechLanguage(text)

      //Connect to the prononciation assessment ressource
      const speechConfig = sdk.SpeechConfig.fromSubscription(
        "keys",
        "eastus"
        );
        speechConfig.setProperty()
      //download the audio of user from the blob storage
      const audioBlobClient = await storageConnectionAudio(audioBlobName);
      await waitForBlob(audioBlobClient);
    
      const audio = await audioBlobClient.download();
      const downloadedFile = await streamToBuffer(audio.readableStreamBody);
     
      let audioConfig = sdk.AudioConfig.fromWavFileInput(downloadedFile);
      console.log(audioConfig)
      let speechRecognizer = new sdk.SpeechRecognizer(
        speechConfig,
        audioConfig
      );
     
      const pronunciationAssessmentConfig = sdk.PronunciationAssessmentConfig.fromJSON(
        "{\"GradingSystem\": \"HundredMark\", \
        \"Granularity\": \"Phoneme\", \
        \"EnableMiscue\": \"True\", \
        \"EnableProsodyAssessment\": \"True\"}"
    );
    pronunciationAssessmentConfig.referenceText = text;
      if (language === "English") {
        speechConfig.speechRecognitionLanguage = "en-US";
      } 
      if (language === "French"){
        speechConfig.speechRecognitionLanguage = "fr-FR";
      }
      
      
      pronunciationAssessmentConfig.applyTo(speechRecognizer);
     //Start of the prononciation assessment 
      speechRecognizer.recognizeOnceAsync(async (result) =>
      
      {console.log(result)
        switch (result.reason) {

          case sdk.ResultReason.RecognizedSpeech:
            const pronunciation_result =
              sdk.PronunciationAssessmentResult.fromResult(result);

Answer

@Iheb Jandoubi Try to add the language in the creation of SpeechRecognizer() instance rather than speechConfig() as defined in the sample from github.

See the sample implementation from github.

speechRecognizer = new SpeechRecognizer(speechConfig, "fr-FR", audioConfig)

You can essentially keep the speechConfig() to setting the endpoint and set the language in the recognizer and apply the pronunciation config to the recognizer.

P.S: Please don't share your keys in code snippets. I have removed here to ensure the keys are mis-utilized. Please regenerate your keys and try the above recommendation. Thanks!!

Share via

azure prononciation assessment async assessment

1 answer