Hello Guillaume Lameyse,
Welcome to the Microsoft Q&A and thank you for posting your questions here.
I understand that you are having issues with the GPT-4o Deployment in West Europe - Severe Latency Issues.
I'm sorry to hear about the performance issues you're experiencing with your Azure GPT-4o deployment, this is getting common recently. You can enhance the reliability and efficiency of your Azure GPT-4o deployment with some of the steps to diagnose and improve stability:
First, check the Azure Status page https://status.azure.com/en-us/status to monitor real-time service health. While there are no widely reported issues specific to the West Europe region, performance fluctuations may arise due to demand and infrastructure updates. You can stay informed by referring to the Azure OpenAI Service documentation.
To enhance performance, consider monitoring and analyzing usage with Azure Monitor to detect anomalies. Optimizing requests, such as reducing token generation or enabling streaming, can significantly improve response times, as noted in this discussion similar case - https://learn.microsoft.com/en-us/answers/questions/1696807/gpt-4o-slow-to-complete-after-repeated-runs
Also, deploying in multiple regions helps distribute workload and ensures redundancy and keeping your deployment up to date with the latest model versions, as detailed in this link: https://learn.microsoft.com/en-us/azure/ai-services/openai/concepts/models , can also improve performance. If issues persist, reaching out to Azure Support via your Azure Portal will be a better.
I hope this is helpful! Do not hesitate to let me know if you have any other questions or clarifications.
Please don't forget to close up the thread here by upvoting and accept it as an answer if it is helpful.