Batch transcription using speech to text service is taking more than 12 hours to transcribe some audio files.

Question

Batch transcription using speech to text service is taking more than 12 hours to transcribe some audio files.

Smit Burde 1

Feb 17, 2022, 12:20 PM

Hi,

We use azure speech to text to transcribe the mp3 files. The audio files are approximately 1MB in size and around 10 mins long.
The batch transcription is taking more than 12 hrs to transcribe these files.

Ramr-msft 17,821 Reputation points

Feb 17, 2022, 4:19 PM

@Smit Burde Thanks, Can you please share the audio file to check.
Smit Burde 1 Reputation point

Feb 17, 2022, 4:32 PM

The file which took more than 12 hrs to transcribe is uploaded here -
https://1drv.ms/u/s!AlXMoZytp8E7gbQ_Tl4fyXsLfb6FbA?e=LB6ixX

1 answer

Your answer

Ramr-msft 17,821 Reputation points

Feb 17, 2022, 4:19 PM

@Smit Burde Thanks, Can you please share the audio file to check.
Smit Burde 1 Reputation point

Feb 17, 2022, 4:32 PM

The file which took more than 12 hrs to transcribe is uploaded here -
https://1drv.ms/u/s!AlXMoZytp8E7gbQ_Tl4fyXsLfb6FbA?e=LB6ixX

Answer 1

@Smit Burde Thanks for the question. This https://learn.microsoft.com/en-us/azure/cognitive-services/speech-service/quickstarts/from-blob?pivots=programming-language-csharp explains how to transcribe audio files that are in storage (offline aka batch transcription). Samples are available in our github sample repository (C# and python https://github.com/Azure-Samples/cognitive-services-speech-sdk/tree/master/samples/batch). You don’t need to be constantly connected to the service, you submit jobs and collect the results at a later point in time, the audio files can have a length of several hours. The functionality is REST based.

This repo I added some sample code to demo the Speech to Text SDK:
https://github.com/caiomsouza/Microsoft-Cognitive-Services/tree/master/speech-to-text

Share via

Batch transcription using speech to text service is taking more than 12 hours to transcribe some audio files.

1 answer

Your answer