Share via


Azure Speech Pronunciation Assessment (Preview)

Azure Speech Pronunciation assessment evaluates speech pronunciation and gives speakers feedback on the accuracy and fluency of their speech. 

This connector is available in the following products and regions:

Service Class Regions
Logic Apps Standard All Logic Apps regions except the following:
     -   Azure China regions
Power Automate Premium All Power Automate regions except the following:
     -   China Cloud operated by 21Vianet
Power Apps Premium All Power Apps regions except the following:
     -   China Cloud operated by 21Vianet
Contact
Name Speech Service Power Platform Team
URL https://docs.microsoft.com/azure/cognitive-services/speech-service/support
Email speechpowerplatform@microsoft.com
Connector Metadata
Publisher Microsoft
Website https://docs.microsoft.com/azure/cognitive-services/speech-service/
Privacy policy https://privacy.microsoft.com
Categories AI;Website

Pronunciation assessment evaluates speech pronunciation and gives speakers feedback on the accuracy and fluency of their speech. With pronunciation assessment, language learners can practice, get instant feedback and improve their pronunciation so they can speak and present with confidence.

Pre-requisites

You will need the following to proceed:

Creating a connection

The connector supports the following authentication types:

Api Key ApiKey All regions Shareable
Azure AD Integrated Use Azure Active Directory to access your speech service. All regions except Azure Government and Department of Defense (DoD) in Azure Government and US Government (GCC-High) Not shareable
Azure AD Integrated (Azure Government) Use Azure Active Directory to access your speech service. Azure Government and Department of Defense (DoD) in Azure Government and US Government (GCC-High) only Not shareable
Default [DEPRECATED] This option is only for older connections without an explicit authentication type, and is only provided for backward compatibility. All regions Not shareable

Api Key

Auth ID: keyBasedAuth

Applicable: All regions

ApiKey

This is shareable connection. If the power app is shared with another user, connection is shared as well. For more information, please see the Connectors overview for canvas apps - Power Apps | Microsoft Docs

Name Type Description Required
Account Key securestring Speech service key True
Region string Speech service region (Example: eastus) True

Azure AD Integrated

Auth ID: tokenBasedAuth

Applicable: All regions except Azure Government and Department of Defense (DoD) in Azure Government and US Government (GCC-High)

Use Azure Active Directory to access your speech service.

This is not shareable connection. If the power app is shared with another user, another user will be prompted to create new connection explicitly.

Name Type Description Required
Custom Subdomain string Custom subdomain endpoint url (Example: contoso) True

Azure AD Integrated (Azure Government)

Auth ID: tokenBasedAuth

Applicable: Azure Government and Department of Defense (DoD) in Azure Government and US Government (GCC-High) only

Use Azure Active Directory to access your speech service.

This is not shareable connection. If the power app is shared with another user, another user will be prompted to create new connection explicitly.

Name Type Description Required
Custom Subdomain string Custom subdomain endpoint url (Example: contoso) True

Default [DEPRECATED]

Applicable: All regions

This option is only for older connections without an explicit authentication type, and is only provided for backward compatibility.

This is not shareable connection. If the power app is shared with another user, another user will be prompted to create new connection explicitly.

Name Type Description Required
Account Key securestring Azure Speech Pronunciation Assessment Account Key True
Region string Speech service region (Example: eastus) True

Throttling Limits

Name Calls Renewal Period
API calls per connection 100 60 seconds

Actions

Create Pronunciation assessment(V1)

Creates a new pronunciation assessment.

Create Pronunciation assessment(V1)

Creates a new pronunciation assessment.

Parameters

Name Key Required Type Description
AudioContent
AudioContent True binary

The file to upload.

ReferenceText
ReferenceText True string

The text that the pronunciation will be evaluated against.

GradingSystem
GradingSystem string

The point system for score calibration. The FivePoint system gives a 0-5 floating point score, and HundredMark gives a 0-100 floating point score. Default: FivePoint.

Granularity
Granularity string

The evaluation granularity. Accepted values are:'Phoneme', which shows the score on the full-text, word, and phoneme levels.'Word', which shows the score on the full-text and word levels.'FullText', which shows the score on the full-text level only.The default setting is 'Phoneme'.

Dimension
Dimension string

Defines the output criteria. Accepted values are: 'Basic', which shows the accuracy score only.'Comprehensive', which shows scores on more dimensions (for example, fluency score and completeness score on the full-text level, and error type on the word level).The default setting is 'Basic'.

EnableMiscue
EnableMiscue boolean

Enables miscue calculation. With this parameter enabled, the pronounced words will be compared to the reference text. They'll be marked with omission or insertion based on the comparison. Accepted values are False and True. The default setting is False.

ScenarioId
ScenarioId string

A GUID that indicates a customized point system.

Language
language True string

The language of audio.