I am wondering if there’s a way for me to speed up the process of real-time transcription. Preferably for synchronous speech recognition as my usages are going to be relatively short.
I originally considered building a container so that I can have the model running locally to decrease latency however given that this is a personal project it is likely not possible to get approval from Azure.
To give you some context about my situation: I am based close to the Australian East servers, with internet speeds of >200mbps and using a high end PC.
My current speed isn't that bad but I feel it sometimes lags a bit.
If you have any advice for improving the speed I would greatly appreciate it.