Azure Speech Pronunciation Assessment (Preview)
Azure Speech Pronunciation assessment evaluates speech pronunciation and gives speakers feedback on the accuracy and fluency of their speech.
This connector is available in the following products and regions:
Service | Class | Regions |
---|---|---|
Logic Apps | Standard | All Logic Apps regions except the following: - Azure China regions |
Power Automate | Premium | All Power Automate regions except the following: - China Cloud operated by 21Vianet |
Power Apps | Premium | All Power Apps regions except the following: - China Cloud operated by 21Vianet |
Contact | |
---|---|
Name | Speech Service Power Platform Team |
URL | https://docs.microsoft.com/azure/cognitive-services/speech-service/support |
speechpowerplatform@microsoft.com |
Connector Metadata | |
---|---|
Publisher | Microsoft |
Website | https://docs.microsoft.com/azure/cognitive-services/speech-service/ |
Privacy policy | https://privacy.microsoft.com |
Categories | AI;Website |
Pronunciation assessment evaluates speech pronunciation and gives speakers feedback on the accuracy and fluency of their speech. With pronunciation assessment, language learners can practice, get instant feedback and improve their pronunciation so they can speak and present with confidence.
Pre-requisites
You will need the following to proceed:
- Azure subscription - Create one for free
- Create a Speech resource in the Azure portal.
- Get the Speech resource key and region. After your Speech resource is deployed, select Go to resource to view and manage keys. For more information about Cognitive Services resources, see Get the keys for your resource.
- Learn more about Pronunciation Assessment.
Creating a connection
The connector supports the following authentication types:
Api Key | ApiKey | All regions | Shareable |
Azure AD Integrated | Use Azure Active Directory to access your speech service. | All regions except Azure Government and Department of Defense (DoD) in Azure Government and US Government (GCC-High) | Not shareable |
Azure AD Integrated (Azure Government) | Use Azure Active Directory to access your speech service. | Azure Government and Department of Defense (DoD) in Azure Government and US Government (GCC-High) only | Not shareable |
Default [DEPRECATED] | This option is only for older connections without an explicit authentication type, and is only provided for backward compatibility. | All regions | Not shareable |
Api Key
Auth ID: keyBasedAuth
Applicable: All regions
ApiKey
This is shareable connection. If the power app is shared with another user, connection is shared as well. For more information, please see the Connectors overview for canvas apps - Power Apps | Microsoft Docs
Name | Type | Description | Required |
---|---|---|---|
Account Key | securestring | Speech service key | True |
Region | string | Speech service region (Example: eastus) | True |
Azure AD Integrated
Auth ID: tokenBasedAuth
Applicable: All regions except Azure Government and Department of Defense (DoD) in Azure Government and US Government (GCC-High)
Use Azure Active Directory to access your speech service.
This is not shareable connection. If the power app is shared with another user, another user will be prompted to create new connection explicitly.
Name | Type | Description | Required |
---|---|---|---|
Custom Subdomain | string | Custom subdomain endpoint url (Example: contoso) | True |
Azure AD Integrated (Azure Government)
Auth ID: tokenBasedAuth
Applicable: Azure Government and Department of Defense (DoD) in Azure Government and US Government (GCC-High) only
Use Azure Active Directory to access your speech service.
This is not shareable connection. If the power app is shared with another user, another user will be prompted to create new connection explicitly.
Name | Type | Description | Required |
---|---|---|---|
Custom Subdomain | string | Custom subdomain endpoint url (Example: contoso) | True |
Default [DEPRECATED]
Applicable: All regions
This option is only for older connections without an explicit authentication type, and is only provided for backward compatibility.
This is not shareable connection. If the power app is shared with another user, another user will be prompted to create new connection explicitly.
Name | Type | Description | Required |
---|---|---|---|
Account Key | securestring | Azure Speech Pronunciation Assessment Account Key | True |
Region | string | Speech service region (Example: eastus) | True |
Throttling Limits
Name | Calls | Renewal Period |
---|---|---|
API calls per connection | 100 | 60 seconds |
Actions
Create Pronunciation assessment(V1) |
Creates a new pronunciation assessment. |
Create Pronunciation assessment(V1)
Creates a new pronunciation assessment.
Parameters
Name | Key | Required | Type | Description |
---|---|---|---|---|
AudioContent
|
AudioContent | True | binary |
The file to upload. |
ReferenceText
|
ReferenceText | True | string |
The text that the pronunciation will be evaluated against. |
GradingSystem
|
GradingSystem | string |
The point system for score calibration. The FivePoint system gives a 0-5 floating point score, and HundredMark gives a 0-100 floating point score. Default: FivePoint. |
|
Granularity
|
Granularity | string |
The evaluation granularity. Accepted values are:'Phoneme', which shows the score on the full-text, word, and phoneme levels.'Word', which shows the score on the full-text and word levels.'FullText', which shows the score on the full-text level only.The default setting is 'Phoneme'. |
|
Dimension
|
Dimension | string |
Defines the output criteria. Accepted values are: 'Basic', which shows the accuracy score only.'Comprehensive', which shows scores on more dimensions (for example, fluency score and completeness score on the full-text level, and error type on the word level).The default setting is 'Basic'. |
|
EnableMiscue
|
EnableMiscue | boolean |
Enables miscue calculation. With this parameter enabled, the pronounced words will be compared to the reference text. They'll be marked with omission or insertion based on the comparison. Accepted values are False and True. The default setting is False. |
|
ScenarioId
|
ScenarioId | string |
A GUID that indicates a customized point system. |
|
Language
|
language | True | string |
The language of audio. |