Still got Azure OpenAI Insufficient quota error after reducing existing TPM and RPM

L P 0 Reputation points
2025-05-06T06:24:43.3333333+00:00

In Azure AI Foundry, I deploy a GPT4.1 with 20K TPM and 20 RPM

User's image I still have 30K quota remain, and I try to deploy a GPT4.1-mini, but I got error "Insufficient quota for selected options"

User's image

Thank you!

Update: I have tried delete the GPT4.1, still got the same error when deploying GPT4.1-mini

Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
3,632 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Saideep Anchuri 9,500 Reputation points Moderator
    2025-05-06T06:43:19.6733333+00:00

    Hi L P

    It seems that you are encountering an "Insufficient quota" error despite having a remaining quota of 30K.

    1. Model-Specific Quota Limits: Each model has its own maximum Tokens-Per-Minute (TPM) allocation. For the GPT-4.1 model, the default quota limit is 1M TPM, and for the GPT-4.1-mini, it is also 1M TPM. If the combined TPM of your existing deployments exceeds your total quota, you may not be able to create additional deployments.
    2. Requests-Per-Minute (RPM): The RPM is also a limiting factor. For GPT-4.1, the RPM is set at a specific ratio to the TPM. If your current deployments are consuming too much of your RPM allocation, it could prevent new deployments.
    3. Quota Allocation: When you assign TPM to a deployment, it reduces the available quota for that model. If you have already allocated a significant amount of your quota to the GPT-4.1 deployment, it may limit your ability to deploy the GPT-4.1-mini.

    If the current quota is not enough, you can request a quota increase for the specific resources needed for the GPT4.1-mini deployment. You can do this by following these steps:

    • Go to the Azure portal.
    • Select Help + support.
    • Choose New support request.
    • Provide the necessary information, such as the resource type (GPT4.1-mini), the subscription, and the specific quota you need to increase.
    • Submit the request for a quota increase.

    Kindly refer below link: quota

    Thank You.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.