Real-time speech-to-text speed improvement

Mohammad Al-Hakim 1 Reputation point

I am wondering if there’s a way for me to speed up the process of real-time transcription. Preferably for synchronous speech recognition as my usages are going to be relatively short.

I originally considered building a container so that I can have the model running locally to decrease latency however given that this is a personal project it is likely not possible to get approval from Azure.

To give you some context about my situation: I am based close to the Australian East servers, with internet speeds of >200mbps and using a high end PC.

My current speed isn't that bad but I feel it sometimes lags a bit.
If you have any advice for improving the speed I would greatly appreciate it.

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,482 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Ramr-msft 17,641 Reputation points

    @Mohammad Al-Hakim Thanks for the question. Is your audio files are of different sizes?. Can you please share the test results that took more time to transcript.

    You can use the REST API for Speech to Text to transcribe larger files and You can run a custom model locally in a Docker container.

    1 person found this answer helpful.