Hi Rosemary Raphael,
Thank you for reaching out on the Microsoft Q&A.
- Model Version Retirement & Transition: The version of
gpt-4o-transcribereleased in March 2025 (2025-03-20) was scheduled for retirement on October 15, 2025. Customers using that version are now being forced to upgrade to a newer build. This transition appears to have introduced instability in language detection. - Known Bug with Language Enforcement: There is an acknowledged bug where
gpt-4o-transcribeignores or inconsistently applies thelanguageparameter. Even when you specify"language": "en", the model sometimes switches to other languages (Chinese, Malay, etc.). This is because the language hint is treated as a “soft preference,” not a strict rule. - Community Reports of Wrong Language Output : Multiple developers have reported that since early October, English audio is being transcribed as Chinese or Malay, and Chinese as other languages. This aligns with your timeline.
- Realtime API Changes: A new variant
gpt-4o-transcribe-latestwas introduced in the Realtime API, but it’s not fully stable yet. Some users report missing or misaligned transcription events when using the new GA session https://community.openai.com/t/realtime-transcription-mismatch-and-gpt-4o-transcribe-latest/1358789
The retirement of the old version and rollout of a new backend likely changed the language detection pipeline. - The model is multilingual by design, and without strict enforcement, it guesses based on acoustic cues. If your audio has background noise or mixed-language phrases, the bug amplifies misclassification. You Can Do Now 1.Check Your Deployment Version In Azure Portal → OpenAI Deployments, confirm if you’re still on
2025-03-20. If yes, you need to update to the latest version once it appears in the dropdown.- Force Language via Prompting Add a strong instruction in the
promptfield:
This helps, but does not fully fix the bug."prompt": "The audio is entirely in English. Transcribe only in English." - Set Temperature Low Use
temperature: 0.2and avoid chunking. Some developers report better consistency with these settings. [community.openai.com] - Fallback Option If accuracy is critical, consider temporarily switching to Whisper-1 for strict language handling until the new
gpt-4o-transcribebuild stabilizes.
- Force Language via Prompting Add a strong instruction in the
References:https://learn.microsoft.com/en-us/azure/ai-foundry/openai/whats-new
https://learn.microsoft.com/en-us/azure/ai-services/translator/reference/known-issues https://learn.microsoft.com/en-us/azure/ai-services/speech-service/language-identification?tabs=once&pivots=programming-language-csharp
Feel free to accept this as an answer.
Thank you