Use of Azure OpenAI Whisper model

Suvi Anju 45 Reputation points
2023-09-20T16:08:15.6933333+00:00

I have a client request where I have to a development to the medical team that requires the OpenAI whisper model to be integrated. I tried researching on connecting the whisper model and failed with the same. I am unsure of where that whisper model gets displayed and how to connect it with the development. Additionally I am trying to understand are there any restriction on the file size of the audio that it can process.

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
2,943 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,793 questions
{count} votes

Accepted answer
  1. ChakaravarthiRangarajanBhargavi-1820 715 Reputation points
    2023-09-21T10:10:30.6733333+00:00

    Hi Suvi Anju,

    Thank you for the interesting question.

    OpenAI Whisper is Coming Soon to Azure OpenAI Service and Azure AI Speech. It is in Purview stage the benefits of running the OpenAI Whisper model in Azure include enterprise-grade security, privacy controls, and data processing capabilities that allow for customized solutions to fit specific business needs. Whisper transcription by enabling files up to 1GB in size and the ability to process large amounts of files by allowing you to batch up to 1000 files in a single request.

    Regarding on how to access the part, before you check for the creation, please recheck the prerequisite of the below

    ·        An Azure subscription - Create one for free.

    ·        Access granted to Azure OpenAI Service in the desired Azure subscription. Currently, access to this service is granted only by application. You can apply for access to Azure OpenAI Service by completing the form at https://aka.ms/oai/access.

    ·        An Azure OpenAI resource created in the North Central US or West Europe regions with the whisper model deployed. For more information, see Create a resource and deploy a model with Azure OpenAI.

    ·        To successfully make a call against Azure OpenAI, you'll need an endpoint and a key.

    User's image

    More information please refer to below blog post.

    https://techcommunity.microsoft.com/t5/azure-ai-services-blog/openai-whisper-is-coming-soon-to-azure-openai-service-and-azure/ba-p/3876671

    I hope this helps.

    Regards,

    Chakravathi Rangarajan Bhargavi

    -Please kindly accept the answer and vote 'Yes' if you feel helpful to support the community, thanks a lot.

     

    2 people found this answer helpful.

1 additional answer

Sort by: Most helpful
  1. Ramr-msft 17,736 Reputation points
    2023-09-21T09:17:27.14+00:00

    @Suvi Anju Thanks for the question, you can use the Azure OpenAI Whisper model for speech to text. The file size limit for the Azure OpenAI Whisper model is 25 MB. If you need to transcribe a file larger than 25 MB, you can use the Azure AI Speech batch transcription API.

    Prerequisites:

    Currently the following regions are supported for whisper model. An Azure OpenAI resource created in the North Central US or West Europe regions with the whisper model deployed. For more information, see Create a resource and deploy a model with Azure OpenAI.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.