Can't deploy Azure Open AI models due "No quota is available for this deployment. You can request for more quota."

Guillermo Subirán 36 Reputation points
2023-07-11T17:53:59.6166667+00:00

Today, I wanted to test the gpt-35-turbo-16k model, so I attempted to deploy it. However, I received a message stating that there was no quota available. I initially assumed that it might be due to an existing deployment of the gpt-35-turbo model, so I deleted it. Unfortunately, I encountered the same issue even after removing the previous deployment. Furthermore, I tried redeploying the gpt-35-turbo model but was unable to do so due to the same problem.

User's image

This is my actual quota:
User's image

And those are the models that I have deployed:

User's image

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
3,362 questions
{count} votes

Accepted answer
  1. AshokPeddakotla-MSFT 35,096 Reputation points
    2024-05-16T03:37:37.1733333+00:00

    Guillermo Subirán I'm glad that your issue is resolved and thank you for posting your solution so that others experiencing the same thing can easily reference this!

    Since the Microsoft Q&A community has a policy that the question author cannot accept their own answer, they can only accept answers by others, I'll repost your solution in case you'd like to Accept the answer.

    Error Message:

    I wanted to test the gpt-35-turbo-16k model, so I attempted to deploy it. However, I received a message stating that there was no quota available. I initially assumed that it might be due to an existing deployment of the gpt-35-turbo model, so I deleted it. Unfortunately, I encountered the same issue even after removing the previous deployment. Furthermore, I tried redeploying the gpt-35-turbo model but was unable to do so due to the same problem.

    User's image

    This is my actual quota: User's image

    And those are the models that I have deployed:

    User's image

    Solution:

    As I mentioned earlier, In East US, 0613 model has been temporarily disabled due to the current investigations of customer reported issues. We expect this should be resolved in the next week.

    After few days issue is resolved. In your case, work around was suggested to add a role assignment.

    if you deploy the model with an account with subscription level permissions you will be able to deploy the models. In my case my user has resource group level permissions so I think is unable to "read" if there is enough quota available to do the deploy.

    The documentation is also updated with more details on role clarity.

    User's image

    If you have any other questions, please let me know. Thank you again for your time and patience throughout this issue.

    0 comments No comments

1 additional answer

Sort by: Most helpful
  1. AshokPeddakotla-MSFT 35,096 Reputation points
    2023-07-13T15:48:21.14+00:00

    Guillermo Subirán

    Another weird thing that occurred today was that the gpt-35-turbo-16k model disappeared from the list of deployable models

    I have checked internally on this ask and below is an update.

    In East US, 0613 model has been temporarily disabled due to the current investigations of customer reported issues. We expect this should be resolved in the next week.

    I will update here once I hear anything about the availability. Hope this helps.

    Update: Good news!

    gpt-35-turbo 0613 & gpt-35-turbo-16k models are available now in East US for deployment. Also, the limit related to 8k is resolved as well. Let us know if you are still seeing the error.

    I just tried and able to deploy successfully.

    User's image

    Do let us know if you have any other queries.

    If the response helped, please do click Accept Answer and Yes for was this answer helpful.

    Doing so would help other community members with similar issue identify the solution. I highly appreciate your contribution to the community.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.