@Andrew J Thanks for reaching out. Yes, that's correct. The recommended way to monitor throttling for Azure OpenAI services is to use custom logging and metrics. You can configure your Azure OpenAI service to log all requests and responses, including any 429 responses due to throttling. You can then use Azure Monitor to create a custom metric that tracks the number of 429 responses over a given time period. You can configure an alert rule to trigger when the number of 429 responses exceeds a certain threshold. Reference: https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/monitoring#configure-diagnostic-settings https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/monitoring#set-up-alerts
do let me know incase of further queries, I would be happy to assist you.