Azure OpenAI quota increase request

Mohammad 0 Reputation points


My company plans to use GPT4 engine to construct an application. I'm responsible for selecting the right services that can meet our needs. However, the default quota and limitations (token per minute) won't cover the expectations. I've read that the quota increase request might be refused and the processing of the requests can be paused.

I wanted to know that how easy is it to scale up the model when deployed on Azure? Is it better that we choose the OpenAi services instead of Azure?

Thank you

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
1,414 questions
Azure App Service
Azure App Service
Azure App Service is a service used to create and deploy scalable, mission-critical web apps.
6,030 questions
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
1,871 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Janarthanan S 540 Reputation points

    Hi @Mohammad

    Quota is assigned to your subscription on a per-region, per-model basis in units of Tokens-per-Minute (TPM). When you onboard a subscription to Azure OpenAI, you'll receive default quota for most available models.

    Quota Tokens-Per-Minute (TPM) allocation is not related to the max input token limit of a model. Model input token limits are defined in the models table and are not impacted by changes made to TPM.

    For increasing Quota limit and request please find documentation.

    Please try out these steps with your data and check if it works. Hope this answer helps you with solution! Please comment below if you need any assistance on the same. Happy to help!

    I hope the solution is useful to you and then accept the answer.


    Janarthanan S

    0 comments No comments