Request for Steps to Increase TPM Limit for Azure OpenAI GPT-4.1

jin 20 Reputation points
2025-04-22T02:41:11.96+00:00

I am trying to use Azure OpenAI's GPT-4.1.

Currently, I would like to use GPT-4.1, but I have discovered that the TPM limit is too low for production use. When I checked the "Azure OpenAI Service: Request for Quota Increase" page, I found that GPT-4.1 was not available under the "13. Global Standard Model" option.

User's image

Could you please guide me on the steps required to increase the TPM limit for GPT-4.1?

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
4,101 questions
0 comments No comments
{count} votes

Accepted answer
  1. Pavankumar Purilla 8,570 Reputation points Microsoft External Staff Moderator
    2025-04-22T04:23:07.8233333+00:00

    Hi jin,

    The GPT-4.1 model was recently released and is an interesting addition to the range of advanced AI models. However, because this model is new, it is not yet available for quota requests in application forms. Rest assured that this option will be enabled soon. We are doing our best to make it available as soon as possible. Once the update is implemented, you can seamlessly request quotas. Thank you for your patience and understanding in this matter. In the meantime, if you have any further questions or need assistance, please feel free to contact us.

    In the meantime, we recommend that you create a deployment with a lower token per minute (TPM) limit to see if it helps you successfully initialize your deployment.

    Hope this helps. Do let us know if you have any further queries.


    If this answers your query, do click Accept Answer and Yes for was this answer helpful.

    1 person found this answer helpful.

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.