How can I request a quota increase for gpt-35-turbo-instruct?
I need to use gpt-35-turbo-instruct
for a research project where I need to use the completions API, not the chat API. This project also requires querying many tokens, approximately ~1 billion total.
According to this azure docs page, the limit for regions eastus
and swedencentral
should be 240k TPM for this model. My quota for these regions is only 30k TPM though, much less than what I need. When I click the link next to my deployment link for "Request for Quota Increase", I get a form with a list of models to choose from. The problem is that gpt-35-turbo-instruct
is not one of the models listed in this dropdown list.
I have submitted multiple quota increases where I explicitly say "Don't increase the quota for the model I requested, instead please increase my quota for gpt-35-turbo-instruct". These requests always get approved, but my quota does not increase for gpt-35-turbo-instruct
, instead my quota gets approved for the model I selected from the dropdown. This happens even when I write "DO NOT APPROVE THIS REQUEST" in multiple text fields, so it seems like nobody is actually reading these forms.
Is there a way I can request a quota increase for gpt-35-turbo-instruct
, or is there a way I there a way I can escalate this request to an Azure support rep who can help me? Thanks.