Hi avi yashchin, you may consider 2 options here:
- Use Global-Standard deployment, as it provides higher TPM limits for GPT-4, GPT-4o and GPT-4o-mini: https://learn.microsoft.com/en-us/azure/ai-services/openai/quotas-limits#gpt-4o--gpt-4-turbo-global-standard
- Or, if you want to increase quota for a Standard deployment, you can open Azure OpenAI Studio, click Quota, choose your Az subscription and then in the right corner click "Request quota" button (as shown on the attached screenshot). You will be forwarded to online form, where you can specify new quota requirements and if approved, relevant quota increase will be allocated.