Azure cognitive speech Produces missing/incorrect words

Amper, Charwin (Contractor) 85 Reputation points
2023-12-07T03:36:18.0866667+00:00

Hi, I am wondering why the extracted text from audio have missing/incorrect words.

In this case the language used is Greek(el-gr)(other language could have the same issue)

Below are the image of Transcriptions JSON result and the human edited/correct captions.

Correct words are those that are in black background.

User's image

and here is the payload request for creating a transcription.

 var azureTranscription = new AzureTranscription
 {
     ContentUrls = new List<Uri> { contentUri },
     Locale = locale,
     DisplayName = displayName,
     Properties = new AzureTranscriptionProperties
     {
         Channels = new List<int> { 0 },
         WordLevelTimestampsEnabled = true,
         PunctuationMode = punctuationMode,
         ProfanityFilterMode = profanityFilterMode,
         TimeToLive = TimeSpan.FromHours(24),
         DestinationContainerUrl = destinationContainerUrl,
     },
     
 };

Are there special settings I need to consider for creating Transcription for languages like Greek?

Thanks

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,950 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.