Hello abadis,
Welcome to the Microsoft Q&A and thank you for posting your questions here.
Regarding your issues:
Whisper Model not available in Germany West Central
As of now, Whisper is not available in Germany West Central or Germany North in Azure OpenAI. You can check available models by region: https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models#region-availability
- You have some of the workaround options to: Deploy Whisper in a nearby region e.g., West Europe. Use Private Link, VNet Integration, and Customer-Managed Keys CMK to enforce data protection and encryption at rest/transit. Ensure logs and backups are stored in German-compliant storage accounts.
- If the model must run in Germany, consider using confidential VMs with Whisper deployed in a container using ONNX or HuggingFace non-Azure OpenAI Whisper. Use tools like Whisper.cpp or OpenAI Whisper running in AKS + SGX-capable node pool.
- Submit a quota increase plus region request directly to Microsoft: - https://portal.azure.com/#blade/Microsoft_Azure_Support/HelpAndSupportBlade/newsupportrequest
WebSocket Access to GPT-4o
The fact is that Azure OpenAI does not currently support WebSocket access for text completions access to GPT-4o. Only REST endpoints are available. - https://learn.microsoft.com/en-us/azure/ai-services/openai/whats-newhttps://learn.microsoft.com/en-us/azure/ai-services/openai/whats-new
- Workaround options are to:
- OpenAI's hosted version of GPT-4o supports WebSocket - https://platform.openai.com/docs/guides/gpt/chat-websocketshttps://platform.openai.com/docs/guides/gpt/chat-websockets
- This violates DIPA/GDPR for German healthcare unless OpenAI hosts in the EU which they currently don’t.
- Use GPT-4o with
stream=true
in REST endpoint. This allows token-by-token streaming, which is close to real-time. - https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/chatgpt?tabs=console#streaming-responses
So, submit support ticket to request Whisper model in Germany West Central: Azure Support Portal- https://portal.azure.com/#blade/Microsoft_Azure_Support/HelpAndSupportBlade/newsupportreques
- Explore deploying Whisper on Confidential VMs in Germany, if on-premises processing is critical.
- Use
stream=true
for GPT-4o chat completions in Azure to simulate WebSocket streaming. - Follow Azure OpenAI Release Notes on - https://learn.microsoft.com/en-us/azure/ai-services/openai/whats-new
- Reach out to Priority Customer Support - if you can.
I hope this is helpful! Do not hesitate to let me know if you have any other questions or clarifications.
Please don't forget to close up the thread here by upvoting and accept it as an answer if it is helpful.