An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
Hello Nagrath, Richa,
Thanks for raising this question in Azure Q&A forum.
Azure OpenAI quota tier upgrades are not automatic and follow specific eligibility criteria based on your demonstrated usage, payment history, and business justification. Here are the exact requirements and process:
Automatic Tier Progression (T1 → T2 → T3 → T4)
Microsoft will automatically upgrade your quota tier as you demonstrate sustained usage of your existing allocation. The key requirements are:
- Active usage — you must be consuming your current quota consistently (ideally hitting limits) across supported models
Good payment history — no outstanding balances or payment issues
Minimum spend thresholds — while exact numbers aren't published, Tier 2+ typically requires $100-500+ monthly spend across supported models
Automatic upgrades typically happen within 1-3 business days of meeting the criteria.
Tier 5 and Above — Manual Request Required
For Tier 5 (and higher custom quotas), you must submit a manual quota increase request via the official form. Microsoft explicitly states:
"Quota increase requests are processed in the order they're received, and priority goes to customers who actively use their existing quota allocation. Requests that don't meet this condition might be denied."
Eligibility Criteria for Manual Quota Increases
Your request will be evaluated based on these factors:
1. Current Quota Utilization
Provide evidence of hitting quota limits consistently (screenshots from Azure Monitor → Metrics showing 100% TPM/RPM utilization)
Include historical usage trends over the past 30 days
Show peak usage patterns and growth projections
2. Legitimate Business Case
Enterprise production workload (not experimental/personal projects)
Customer impact — how quota limits are blocking business operations
Technical justification — why you need the specific model/quota (latency, concurrency, etc.)
3. Payment and Account History
No outstanding balances
Paid subscription (not Pay-As-You-Go with payment issues)
Good standing (no abuse flags or violations)
4. Supporting Documentation
Architecture diagrams showing how you'll use the quota
Customer contracts/SLAs demonstrating business need
Load test results justifying the requested capacity
The Request Process
Submit the official quota request form: **Azure OpenAI Quota Request Form**
Include ALL required details — incomplete requests are rejected
Expect 2-5 business days for review (priority based on usage evidence)
Be prepared to follow up — if no response in 5 days, create a support ticket referencing your quota request ID
**Tier-Specific Limits (as of March 2026)**
| Tier | GPT-4o TPM (Global Standard) | GPT-4o-mini TPM | Notes |
|---|---|---|---|
| T1 | 1,000 | 15,000 | Default |
| T1 | 1,000 | 15,000 | Default |
| T2 | 15,000 | 30,000 | Auto-upgrade |
| T3 | 30,000 | 70,000 | Auto-upgrade |
| T4 | 100,000 | 330,000 | Auto-upgrade |
| T5 | 1,000,000+ | 1,000,000+ | Manual request |
Pro Tips for Success
Start with T4 first — requesting T5 immediately often gets denied. Get to T4 via automatic upgrades first
- Request specific models — don't ask for "all models" quota is model-specific
Include metrics — "We hit 95% TPM utilization for gpt-4o-mini for 14/30 days" is compelling evidence
Use the quota request form — support tickets get redirected to the form anyway
If it helps kindly accept the answer.
Best Regards,
Jerald Felix