How to send audio via the pronunciation assessment api

Question

How to send audio via the pronunciation assessment api

Dominar el ingles 20

I'm trying to send a request to the Pronunciation Assessment API in my PHP application, and I'm receiving a 200 response from it, but the body of the response just looks something like this, which is not all that helpful:

{
    "RecognitionStatus": "EndOfDictation",
    "Offset": 5900000,
    "Duration": 0
}

I'm setting up the request like this:

User's image

To get the audio, I am doing this in javascript:

const audioBlob = new Blob(this.audioChunks, {'type': 'audio/wav'});
      formData.append('audio', audioBlob, 'recording.wav');
      formData.append('text', this.text);
      axios.post('/api/pronunciation-assessment', formData) ...

And then in PHP I am getting the contents of that passed audio and base64_encodeing it.

This is what I have been able to piece together from different sources on the web, since Azure has no official API documentation for this service as far as I can tell.

Any idea what I'm doing wrong here?

Accepted answer

0 additional answers

Your answer

Answer 1

romungi-MSFT 48,916 Microsoft Employee Moderator

@Dominar el ingles I think you are passing the pronunciation assessment parameters in the body of the request rather than headers. I followed these steps to create a header for assessment parameters using the below snippet from this script and then used them in my POST call through postman.

import requests
import base64
import json

referenceText = "Good morning."
pronAssessmentParamsJson = "\"ReferenceText\":\"%s\",\"GradingSystem\":\"HundredMark\",\"Dimension\":\"Comprehensive\"}" % referenceText
pronAssessmentParamsBase64 = base64.b64encode(bytes(pronAssessmentParamsJson, 'utf-8'))
pronAssessmentParams = str(pronAssessmentParamsBase64, "utf-8")
print(pronAssessmentParams)

Using Postman I selected the audio file required for assessment by selecting binary option as the body.

enter image description here

Copy the Assessment parameter header from the above snippet and use it in the request below as seen along with other parameters.

enter image description here

As observed, the response contains the result and the assessment of reference text with the scores. I hope this helps!!

If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

Dominar el ingles 20 Reputation points

2023-05-15T14:40:04.96+00:00

Right, yes. Thanks! I found this out on a random StackOverflow somewhere. Amazing that I can't find it documented anywhere by Microsoft.

I adjusted my PHP code to stick those parameters in the header and got that working:

Now the problem I'm having is that the phoneme accuracy grading doesn't seem to be working: https://learn.microsoft.com/en-us/answers/questions/1284533/azure-pronunciation-assessment-returning-the-same

And it's not just in my application that it doesn't work. It doesn't seem to work in the Speech Studio hosted by Microsoft either. So I'm waiting for an answer to that issue to see if this service will even work for me or not.
Lê Bá Hùng 0 Reputation points

2024-01-11T22:31:15.8666667+00:00

Hi were you able to send audio more than 1 minute through the API in PHP?

Share via

How to send audio via the pronunciation assessment api

0 additional answers

Your answer