Deploying gpt4o-mini model with Global-standard deployment type results in an error when using Azure AI Studio

JP 30 Reputation points
2024-09-09T15:10:48.6333333+00:00

Deploying gpt4o-mini model with Global-standard deployment type results in an error when using Azure AI Studio. What is the best way to resolve this that involves the least amount of administrative effort?

User's image

Steps to reproduce

  1. Go to Azure OpenAI Studio and clock the Deployments navigation menu item on the left side bar
  2. Click the Deploy model button and choose the Deploy base model option
  3. Select gpt-4o-mini from the model options on the left-side column followed by clicking the Confirm button
  4. Enter the Deployment details
    1. Deployment name: sample-gpt-4o-mini
    2. Model version: [select any]
    3. Deployment type: Global standard
    4. Tokens per Minute Rate Limit: [select 1K or above]
    5. Content filter: [select any]
  5. Click the Deploy button

Issue

  1. Can not move the slider (disabled) to adjust the Tokens per Minute Rate Limit (thousands)
  2. You can move the slider when selecting Global Batch as the Deployment type

Illustrations

  1. Example of Global Standard deployment type
    User's image
  2. Example of Global Batch deployment type
    User's image
  3. Example of
Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
2,831 questions
0 comments No comments
{count} votes

2 answers

Sort by: Most helpful
  1. JP 30 Reputation points
    2024-09-09T15:21:31.36+00:00

    I found more information according to the documentation a Quota increase must be made. Closing question.

    User's image Steps to request a quota increase

    1. Go to Azure OpenAI Studio and click the Quota navigation menu item on the left side bar
    2. Click to expand the Deploy model to verify it contains the deployment you wish to increase the quota for
    3. Select Request quota icon button to be redirected to the Azure OpenAI Service: Request for Quota Increase page

    Example Azure OpenAI Service: Request for Quota Increase PageUser's image

    0 comments No comments

  2. YutongTie-MSFT 51,611 Reputation points
    2024-09-15T22:22:17.0133333+00:00

    Hello JP,

    Thanks for asking this question, I will repost your answer so that you can have a chance to accept answer as original poster can not accept his/her owner answer. Please feel free to repost it, we appreciate your answer again.

    As JP shared, this issue was caused by insufficient quota for gpt 4o-mini, a quota request solved this problem.

    Steps to request a quota increase

    1. Go to Azure OpenAI Studio and click the Quota navigation menu item on the left side bar
    2. Click to expand the Deploy model to verify it contains the deployment you wish to increase the quota for
    3. Select Request quota icon button to be redirected to the Azure OpenAI Service: Request for Quota Increase page

    Credit to JP for sharing it and hope this can help others.

    Regards,

    Yutong

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.