Real-time speech-to-text speed improvement

Mohammad Al-Hakim 1 Reputation point
2022-04-17T20:19:16.2+00:00

I am wondering if there’s a way for me to speed up the process of real-time transcription. Preferably for synchronous speech recognition as my usages are going to be relatively short.

I originally considered building a container so that I can have the model running locally to decrease latency however given that this is a personal project it is likely not possible to get approval from Azure.

To give you some context about my situation: I am based close to the Australian East servers, with internet speeds of >200mbps and using a high end PC.

My current speed isn't that bad but I feel it sometimes lags a bit.
If you have any advice for improving the speed I would greatly appreciate it.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,555 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Ramr-msft 17,731 Reputation points
    2022-04-18T15:33:29.56+00:00

    @Mohammad Al-Hakim Thanks for the question. Is your audio files are of different sizes?. Can you please share the test results that took more time to transcript.

    You can use the REST API for Speech to Text to transcribe larger files and You can run a custom model locally in a Docker container.

    https://learn.microsoft.com/en-us/azure/cognitive-services/speech-service/speech-services-quotas-and-limits

    1 person found this answer helpful.