Low Azure OpenAI Quota Limit

duy phong đào trọng 0 Reputation points
2024-07-28T09:31:03.1366667+00:00

Hi everyone,

My company started using Azure OpenAI services a few days ago, and I've been tasked with deploying them. I created the resource and deployed the development environment in Azure OpenAI Studio, but I'm facing an issue where my quota limit is capped at just 1K tokens per minute.

I've deployed the same service on other Azure accounts, and all of them were automatically assigned much higher quota limits without requiring any additional steps like submitting a quota increase request.

Does anyone know why this might be happening or how to resolve this issue? Any insights or suggestions would be greatly appreciated.

Thanks in advance!

User's image

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
3,244 questions
0 comments No comments
{count} votes

2 answers

Sort by: Most helpful
  1. YutongTie-MSFT 52,866 Reputation points
    2024-07-28T23:45:01.0466667+00:00

    Hello @duy phong đào trọng

    Thanks for reaching out to us, as you just started to use this service, the quota assigned to you may start from a average level for every model.

    You can easily request for more quota according to your business need. There is a button on the right which can direct you to the request form as below screenshot. Please feel free to request it.

    User's image

    The gating team will review on your request and provide the quota accordingly.

    Thanks for reaching out to us again and I hope it helps.

    Regards,

    Yutong

    -Please kindly accept the answer if you feel helpful to support the community, thanks a lot.

    0 comments No comments

  2. Rob Accardi 0 Reputation points
    2024-07-31T16:29:18.97+00:00

    I have the same problem. Is it a problem or is this normal behavior? I don't know. But I've put in a request for a quota increase and am waiting to hear back.

    For what it's worth, I deployed a model yesterday and I did not notice any warnings about low quota on the new deployment form. I did not visit the quotas page yesterday so I cannot say whether that page would have listed a 1K TPM limit for most models. I've since deleted and purged the deployment I performed yesterday. I wonder if there's a bug caused by deleting a deployment where your next deployment will be limited to 1K TPM

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.