@Oon Chee Yee Thanks for the question, I could see 240 TPM used in your snapshot. The quota is per subscription per region, so each subscription will have full quota per model. In Azure AI Studio under quota section, it states per subscription see in image pasted here. Also. in the documentation (here Manage Azure OpenAI Service quota - Azure Cognitive Services | Microsoft Learn) it states "Quota is assigned to your subscription on a..." so everything is by subscription.
Azure AI Studio showing incorrect quota information
My Azure AI studio quota is showing wrong quota information. The limit is 240, but I only have one model deployed for 120, but it shows that I already hit my limit, and cannot deploy another model in a different resources
Azure AI services
2 answers
Sort by: Most helpful
-
-
Janarthanan S 700 Reputation points
2023-09-06T16:25:53.4633333+00:00 Quota is assigned to your subscription on a per-region, per-model basis in units of Tokens-per-Minute (TPM). When you onboard a subscription to Azure OpenAI, you'll receive default quota for most available models. Then, you'll assign TPM to each deployment as it is created, and the available quota for that model will be reduced by that amount. You can continue to create deployments and assign them TPM until you reach your quota limit. Once that happens, you can only create new deployments of that model by reducing the TPM assigned to other deployments of the same model (thus freeing TPM for use), or by requesting and being approved for a model quota increase in the desired region.
Regards,
Janarthanan S