An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
Thank you, this clarifies the GA /openai/v1/realtime behavior.
Your explanation is consistent with our earlier observations:
- using gpt-realtime-whisper as the /realtime connection model returns “The realtime operation does not work with the specified model”;
- using an OpenAI-style dedicated transcription session on a standard realtime session is also rejected.
So we understand that the supported GA pattern is:
- connect to /openai/v1/realtime with a supported realtime model such as gpt-realtime-1.5;
- enable transcription with session.input_audio_transcription using a deployment name.
However, we also observed a different Azure behavior on June 10, 2026 with the preview transcription_sessions flow:
- POST https://<resource>/openai/realtimeapi/transcription_sessions?api-version=2025-04-01-preview succeeds
- Azure returns:
- object = realtime.transcription_session
- id = sess_DpE8UrTWVVY9rJQhPwDlb
- a valid client_secret
- Opening: wss://<resource>/openai/realtime?api-version=2025-04-01-preview&intent=transcription&deployment=gpt-realtime-whisper returns HTTP 302
- Azure redirects to: wss://<resource>/v1/realtime?api-version=2025-04-01-preview&intent=transcription&deployment=gpt-realtime-whisper&api-key=...
- Following that redirected URL returns HTTP 404 Resource not found
(apim-request-id: b270a055-1574-4ed6-83ec-f27ddc6f757c)
Could you please clarify the status of this preview transcription_sessions flow on Azure OpenAI?
- Is /openai/realtimeapi/transcription_sessions currently unsupported or deprecated for gpt-realtime-whisper?
- If it is unsupported, why does it successfully create a realtime.transcription_session and return a client_secret?
- If it is supported, what is the correct WebSocket URL and authentication pattern to use after the session creation call?
- Could you share a minimal official Azure example in Java for the recommended supported websocket transcription flow?
If the supported path is only the GA realtime session with session.input_audio_transcription, we can switch our implementation to that model.