Batch transcription using speech to text service is taking more than 12 hours to transcribe some audio files.

Smit Burde 1 Reputation point
2022-02-17T12:20:36.043+00:00

Hi,

We use azure speech to text to transcribe the mp3 files. The audio files are approximately 1MB in size and around 10 mins long.
The batch transcription is taking more than 12 hrs to transcribe these files.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,676 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Ramr-msft 17,736 Reputation points
    2022-02-17T16:18:31.52+00:00

    @Smit Burde Thanks for the question. This https://learn.microsoft.com/en-us/azure/cognitive-services/speech-service/quickstarts/from-blob?pivots=programming-language-csharp explains how to transcribe audio files that are in storage (offline aka batch transcription). Samples are available in our github sample repository (C# and python https://github.com/Azure-Samples/cognitive-services-speech-sdk/tree/master/samples/batch). You don’t need to be constantly connected to the service, you submit jobs and collect the results at a later point in time, the audio files can have a length of several hours. The functionality is REST based.

    This repo I added some sample code to demo the Speech to Text SDK:
    https://github.com/caiomsouza/Microsoft-Cognitive-Services/tree/master/speech-to-text

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.