Share via

Speech To Text Batch API: Which languages support diarization?

Francis O'Brien 6 Reputation points
2021-08-04T08:08:17.503+00:00

Current when making requests to https://eastus.api.cognitive.microsoft.com/speechtotext/v3.0/transcriptions I get an error saying diarization is not supported for this language. Example request below:
{
"contentUrls": [
"https://contoso.com/mystoragelocation",
"https://contoso.com/myotherstoragelocation"
],
"properties": {
"diarizationEnabled": true,
"wordLevelTimestampsEnabled": true,
"punctuationMode": "DictatedAndAutomatic",
"profanityFilterMode": "Masked"
},
"locale": "he-IL",
"displayName": "Transcription using default model for he-IL"
}

Above example is for a Hebrew mono audio file. The documentation says Hebrew is supported here: https://learn.microsoft.com/en-us/azure/cognitive-services/speech-service/language-support

However requests set up as above return:

{"code":"InvalidPayload","message":"This locale does not support diarization."}}

Is there documentation to say which locales support diarization or is this a bug?

Thanks

Azure Speech in Foundry Tools

2 answers

Sort by: Most helpful
  1. Ramr-msft 17,836 Reputation points
    2021-08-30T05:17:11.843+00:00

    @Francis O'Brien Thanks, We have rolled out a fix for this issue to all public regions. Can you please try again.

    Was this answer helpful?

    0 comments No comments

  2. Ramr-msft 17,836 Reputation points
    2021-08-05T15:08:23.96+00:00

    @Francis O'Brien Thanks for the details. This is indeed a bug, Hebrew (or indeed all locales) should be supported for diarization in most scenarios. We are working on a fix, will update on the same once it's fixed.

    Was this answer helpful?


Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.