@AI X Welcome to Microsoft Q&A Forum, Thank you for posting your query here!
.
As of now there is no ETA on when the GPT 4o-Audio would be GA (Generally Available).
.
We would be updating our below documentations once the model becomes GA. You may rely on the below documentation and keep the track of this:
.
On a side note:
Azure OpenAI GPT-4o Realtime API for speech and audio is part of the GPT-4o model family that supports low-latency, "speech in, speech out" conversational interactions. The GPT-4o audio realtime
API is designed to handle real-time, low-latency conversational interactions, making it a great fit for use cases involving live interactions between a user and a model, such as customer support agents, voice assistants, and real-time translators.
Currently only gpt-4o-realtime-preview
version: 2024-10-01-preview
supports real-time audio.
The gpt-4o-realtime-preview
model is available for global deployments in East US 2 and Sweden Central regions.
.
Hope this helps. If you have any follow-up questions, please let me know. I would be happy to help.
**
Please do not forget to "Accept the answer” and “up-vote” wherever the information provided helps you, this can be beneficial to other community members.