Thanks for reaching out to us, Whisper Model is a good solution, but you need the access for Azure OpenAI first.
I would suggest you trying the Real-time speech to text feature if you want to do a live transcribe.
Please see document here - https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-to-text#real-time-speech-to-text
QuickStart - https://learn.microsoft.com/en-us/azure/ai-services/speech-service/how-to-recognize-speech?pivots=programming-language-python
GitHub samples are here - https://github.com/Azure-Samples/cognitive-services-speech-sdk
If you want to leverage Whisper model, please make sure you apply for the access here - Currently, you must submit an application to access Azure OpenAI Service. To apply for access, complete this form.
I hope this helps, please have a try and let us know how it works.
-Please kindly accept the answer if you feel helpful to support the community, thanks a lot.