Unable to provision more PTU for gpt4.1

Question

Unable to provision more PTU for gpt4.1

Jeffrey Lau 0

I am getting this error in my project
{ "error": { "code": "InvalidCapacity", "message": "There's no available capacity to scale out by 50 PTU for the current request." } }

However when I look at the quota and available ptu it say 1750. Not sure where the issue is.

Manas Mohanty 6,115 Reputation points Microsoft External Staff Moderator

2025-05-19T17:04:50.02+00:00

Hi Jeffrey Lau

Agree to Jerald Felix pointer on using other available regions. You can try again later if you want to deploy in same region.

Reference - https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/provisioned-get-started#create-your-provisioned-deployment--capacity-is-not-available

Thank you.
Manas Mohanty 6,115 Reputation points Microsoft External Staff Moderator

2025-05-21T09:21:47.8466667+00:00

Hi Jeffrey Lau

We have not heard from you.

Hope you have been able to use your balance PTU quota in other region or later time.

Thank you.

1 answer

Your answer

Manas Mohanty 6,115 Reputation points Microsoft External Staff Moderator

2025-05-19T17:04:50.02+00:00

Hi Jeffrey Lau

Agree to Jerald Felix pointer on using other available regions. You can try again later if you want to deploy in same region.

Reference - https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/provisioned-get-started#create-your-provisioned-deployment--capacity-is-not-available

Thank you.
Manas Mohanty 6,115 Reputation points Microsoft External Staff Moderator

2025-05-21T09:21:47.8466667+00:00

Hi Jeffrey Lau

We have not heard from you.

Hope you have been able to use your balance PTU quota in other region or later time.

Thank you.

Answer 1

Hello Jeffrey Lau,

The error is not about your quota (i.e., what you're allowed to use) — it's about regional availability of actual compute capacity at the moment you're making the request.

Even though your quota shows 1750 PTU available, the region (e.g., East US or Switzerland Central) currently doesn’t have enough free physical capacity to allocate an additional 50 PTUs for GPT-4.1.

Capacity issues are often temporary. Retry after a few minutes or during non-peak hours. If your workload allows, try deploying in another region (e.g., West US, France Central, or East Asia) that has better capacity.

Please check model summary table for other region selection

Tips to Investigate Further

Go to Azure Portal > Azure OpenAI > Your Resource > Usage + Quotas.

Check both:

Quota Limit (what you're allowed)

Current Usage & Regional Capacity

Also, ensure you're not:

Requesting a non-existent SKU or region combination.
Trying to scale multiple deployments at once without available capacity.

You can find available regional capacity using Capacity API as mentioned in below section

https://learn.microsoft.com/en-us/azure/ai-services/openai/quotas-limits?tabs=REST#regional-quota-capacity-limits

Best Regards,

Jerald Felix

Share via

Unable to provision more PTU for gpt4.1

1 answer

Your answer