Hi 服部 隼也,
Yes, Azure OpenAI usage quotas (such as tokens-per-minute, requests-per-minute, and deployment limits) are initially provisioned at default limits. These can be increased upon request, but the final limits are subject to Microsoft’s internal review based on your use case, region availability, and responsible AI review.
Answers to Your Questions:
1.what extent can the usage quota be increased through requests?
· There is no publicly fixed upper bound for quota increases. However, Microsoft typically reviews the scale based on your:
o Intended use case
o Model size (e.g., GPT-4 Turbo vs. GPT-3.5)
o Token consumption requirements (tokens-per-minute)
o Region availability
o Responsible AI use and business justification
· Token-per-minute (TPM) and Requests-per-minute (RPM) are the main metrics considered.
2.Is there a maximum limit to how much the quota can be increased?
· Yes, practical ceilings exist, but they are not explicitly documented.
· Examples of high-end limits seen in practice:
o GPT-4 Turbo: Up to 300,000 TPM
o GPT-3.5-Turbo: Up to 600,000–1,000,000 TPM
o These are not guaranteed and depend on request review.
· Microsoft’s Internal team will assess:
o Your current usage trends
o Business justification and project scale
o Enterprise agreement or subscription type
o Compliance with Responsible AI Standard
How to Request a Quota Increase
You can follow these steps:
Option1: Via Azure Portal
- Go to the Azure Portal → Your OpenAI Resource.
- Select "Limits" or "Usage + quotas" under the resource settings.
- Click “Request increase” or open a support ticket.
- Provide:
- Subscription details
- Region
- Model (e.g., gpt-4, gpt-35-turbo)
- Required TPM / RPM
- Description of use case and justification
Option2: Using Azure Support
· Navigate to: https://portal.azure.com/#blade/Microsoft_Azure_Support/HelpAndSupportBlade
· Create a new support request:
o Issue Type: Service and subscription limits (quotas)
o Service: Azure OpenAI
o Region and Deployment ID
o Provide justification and urgency
Please refer below Documentations
· Azure OpenAI Quotas & Limits
Hope this helps, if that resolves your query please do up-vote and accept it, so that it will be helpful for others in the community who are having similar issues/query.
Thank You!