Hi there
Try upgrading to a higher SKU tier can help, but first, check your quota limits in the Azure portal and request an increase if needed. try batching requests, reducing unnecessary API calls, and implementing caching for frequently used responses. Also, use Azure OpenAI Rate Limit headers to monitor usage patterns and adjust accordingly. If traffic is unpredictable, implementing a queueing mechanism can help distribute requests more efficiently.
If this helps kindly accpt the answer thanks.