When Will the GPT 4o-Audio-Preview Model Be Released?

Question

When Will the GPT 4o-Audio-Preview Model Be Released?

AI X 130

I noticed that the Azure Pricing page mentions the upcoming GPT 4o-Audio-Preview model in the Chat Completions API, which is designed to process and generate audio content, including features like speech recognition and audio synthesis. Could you provide an estimated release date for this model, and any additional details on how it can be accessed once available?

navba-MSFT 27,550 Reputation points Microsoft Employee Moderator

2024-12-10T06:52:48.17+00:00

Just following up to check if my suggestion helped. Please let me know if you have any further queries. I would be happy to help.
navba-MSFT 27,550 Reputation points Microsoft Employee Moderator

2024-12-11T14:32:44.0266667+00:00

Just following up to check if the below answer helped. If that answers your query, do click "Accept the answer” for the same, which might be beneficial to other community members reading this thread. And, if you have any further query do let me know. I would be happy to help.

Accepted answer

0 additional answers

Your answer

navba-MSFT 27,550 Reputation points Microsoft Employee Moderator

2024-12-10T06:52:48.17+00:00

Just following up to check if my suggestion helped. Please let me know if you have any further queries. I would be happy to help.
navba-MSFT 27,550 Reputation points Microsoft Employee Moderator

2024-12-11T14:32:44.0266667+00:00

Just following up to check if the below answer helped. If that answers your query, do click "Accept the answer” for the same, which might be beneficial to other community members reading this thread. And, if you have any further query do let me know. I would be happy to help.

Answer 1

@AI X Welcome to Microsoft Q&A Forum, Thank you for posting your query here!

.

As of now there is no ETA on when the GPT 4o-Audio would be GA (Generally Available).

.

We would be updating our below documentations once the model becomes GA. You may rely on the below documentation and keep the track of this:

.

On a side note:

Azure OpenAI GPT-4o Realtime API for speech and audio is part of the GPT-4o model family that supports low-latency, "speech in, speech out" conversational interactions. The GPT-4o audio realtime API is designed to handle real-time, low-latency conversational interactions, making it a great fit for use cases involving live interactions between a user and a model, such as customer support agents, voice assistants, and real-time translators.

Currently only gpt-4o-realtime-preview version: 2024-10-01-preview supports real-time audio.

The gpt-4o-realtime-preview model is available for global deployments in East US 2 and Sweden Central regions.

.

Hope this helps. If you have any follow-up questions, please let me know. I would be happy to help.

**

Please do not forget to "Accept the answer” and “up-vote” wherever the information provided helps you, this can be beneficial to other community members.

Anand, Atul 0 Reputation points

2025-01-22T06:29:45.8766667+00:00

Can you please provide any new update on this?

Share via

When Will the GPT 4o-Audio-Preview Model Be Released?

0 additional answers

Your answer