Use of Azure OpenAI Whisper model

Question

Use of Azure OpenAI Whisper model

Suvi Anju 45

I have a client request where I have to a development to the medical team that requires the OpenAI whisper model to be integrated. I tried researching on connecting the whisper model and failed with the same. I am unsure of where that whisper model gets displayed and how to connect it with the development. Additionally I am trying to understand are there any restriction on the file size of the audio that it can process.

AdityaSa 801 Reputation points

2023-09-21T09:10:01.84+00:00

@Suvi Anju Thanks for the question, here is the blog to get started with whisper model.

https://techcommunity.microsoft.com/t5/azure-ai-services-blog/announcing-the-preview-of-openai-whisper-in-azure-openai-service/ba-p/3928388

Accepted answer

1 additional answer

Your answer

AdityaSa 801 Reputation points

2023-09-21T09:10:01.84+00:00

@Suvi Anju Thanks for the question, here is the blog to get started with whisper model.

https://techcommunity.microsoft.com/t5/azure-ai-services-blog/announcing-the-preview-of-openai-whisper-in-azure-openai-service/ba-p/3928388

Answer 1

Chakaravarthi Rangarajan Bhargavi 1,115 MVP

Hi Suvi Anju,

Thank you for the interesting question.

OpenAI Whisper is Coming Soon to Azure OpenAI Service and Azure AI Speech. It is in Purview stage the benefits of running the OpenAI Whisper model in Azure include enterprise-grade security, privacy controls, and data processing capabilities that allow for customized solutions to fit specific business needs. Whisper transcription by enabling files up to 1GB in size and the ability to process large amounts of files by allowing you to batch up to 1000 files in a single request.

Regarding on how to access the part, before you check for the creation, please recheck the prerequisite of the below

· An Azure subscription - Create one for free.

· Access granted to Azure OpenAI Service in the desired Azure subscription. Currently, access to this service is granted only by application. You can apply for access to Azure OpenAI Service by completing the form at https://aka.ms/oai/access.

· An Azure OpenAI resource created in the North Central US or West Europe regions with the whisper model deployed. For more information, see Create a resource and deploy a model with Azure OpenAI.

· To successfully make a call against Azure OpenAI, you'll need an endpoint and a key.

User's image

More information please refer to below blog post.

https://techcommunity.microsoft.com/t5/azure-ai-services-blog/openai-whisper-is-coming-soon-to-azure-openai-service-and-azure/ba-p/3876671

I hope this helps.

Regards,

Chakravathi Rangarajan Bhargavi

-Please kindly accept the answer and vote 'Yes' if you feel helpful to support the community, thanks a lot.

Nik Jan 5 Reputation points

2023-09-26T13:58:42.51+00:00

This is great news. Is Whisper also connected to Speech Studio, in speech to text tool for testing? Because I am trying it there and the accuracy is still not as good as when using Whisper API directly?

I would like to be sure this is the same results and I am getting from Whisper API at the moment, before I start switching the code to it.

okappy 0

I solved to access azure whisper api with python by steps below.

STEP.1 Create your Azure open ai resource in 'North Central' US or 'West Europe' Region.

'The Whisper model via Azure OpenAI Service is available in the following regions: North Central US and West Europe'

STEP.2 Create Whisper deployment.

STEP.3 Write code(python).

Example:

import requests

api_base = os.getenv("OPENAI_API_BASE")
deployment = os.getenv("OPENAI_API_WHISPER_DEPLOYMENT", 'whisper-1')
api_version = os.getenv("OPENAI_API_WHISPER_VERSION", '2023-09-01-preview')
api_key = os.getenv("OPENAI_API_KEY", '[your api key as alternative to env value])

url = "{}openai/deployments/{}/audio/transcriptions?api-version={}".format(api_base, deployment, api_version)

headers = {
    "content_type": 'multipart/form-data; boundary=----WebKitFormBoundary7MA4YWxkTrZu0gW',  
    "api-key": "XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX"  
}
data = {
        "prompt": "[prompt here for improving accuracy]"
        "language":"ja"
       }
       
file_path = "C:\\[path]\\[to]\\[media]\\[file]\\audio.mp3"
files = [("file", open(file_path, "rb"))]

response = requests.post(url, headers=headers, data=data, files=files)  
print(response.json())

Nik Jan 5 Reputation points

2023-09-27T10:58:23.8633333+00:00

Ok, I can see it, thanks. I have also managed to connect to it via Azure Open AI Studio. However, the Whisper which is deployed under "Azure Open AI" services is pretty basic - it works more accurately than old Microsoft's speech-to-text models, but it does not offer any additional features. It is basic Whisper model, which we are already getting from Open AI. I would like options like diarization, word-level stamping etc.

I am now trying to connect to "Azure AI Services" instead of "Azure Open AI", to see if there is any difference.

This is bit confusing, not sure why it has been set this way. I guess it is some kind of transition environment.

Answer 2

Ramr-msft 17,826

@Suvi Anju Thanks for the question, you can use the Azure OpenAI Whisper model for speech to text. The file size limit for the Azure OpenAI Whisper model is 25 MB. If you need to transcribe a file larger than 25 MB, you can use the Azure AI Speech batch transcription API.

Prerequisites:

Currently the following regions are supported for whisper model. An Azure OpenAI resource created in the North Central US or West Europe regions with the whisper model deployed. For more information, see Create a resource and deploy a model with Azure OpenAI.

okappy 0 Reputation points

2023-09-25T05:12:29.3533333+00:00

Chakravathi Rangarajan Bhargavi,

I have same problem, and thanks for your post, I 've got Endpoint and key.

And then, I tried to access whisper api with python, using this article as refference, but failed.

'resource not found' error occurs.

To simplize the problem, i tried a curl command in the article, but that also fails.

If you have some advice, please help.

Regards,

Okappy
okappy 0 Reputation points

2023-09-25T05:13:01.1633333+00:00

(deleted for double post. sorry.)

Share via

Use of Azure OpenAI Whisper model

1 additional answer

Your answer