Can not set capacity while using azure cli to deploy gpt-4o-mini

Tony Chen 0 Reputation points
2024-08-01T04:29:28.9+00:00

Hi,

I am using command below to deploy gpt-4o-mini model, but it is not deployed with the 450 capacity but 10 capacity.

az cognitiveservices account deployment create --model-format OpenAI --model-name gpt-4o-mini --model-version 2024-07-18 --name 4omini_test --resource-group 4omini_test --deployment-name gpt-4o-mini --capacity 450 --debug

Does anyone know how should I adjust the command? Or other suggestions?

Thanks.

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
4,069 questions
{count} votes

2 answers

Sort by: Most helpful
  1. Tony Chen 0 Reputation points
    2024-08-01T07:46:15.9733333+00:00

    I tried to add "--sku standard" in command and it works well now.

    az cognitiveservices account deployment create --model-format OpenAI --model-name gpt-4o-mini --model-version 2024-07-18 --name 4omini_test --resource-group 4omini_test --deployment-name gpt-4o-mini --sku standard --capacity 450 --debug


  2. AshokPeddakotla-MSFT 35,971 Reputation points Moderator
    2024-08-01T08:02:35.8733333+00:00

    Tony Chen I'm glad that your issue is resolved and thank you for posting your solution so that others experiencing the same thing can easily reference this!

    Since the Microsoft Q&A community has a policy that the question author cannot accept their own answer, they can only accept answers by others, I'll repost your solution in case you'd like to Accept the answer.

    Error Message:

    I am using command below to deploy gpt-4o-mini model, but it is not deployed with the 450 capacity but 10 capacity.

    az cognitiveservices account deployment create --model-format OpenAI --model-name gpt-4o-mini --model-version 2024-07-18 --name 4omini_test --resource-group 4omini_test --deployment-name gpt-4o-mini --capacity 450 --debug

    Solution:

    add "--sku standard" in command and it works well now.

    az cognitiveservices account deployment create --model-format OpenAI --model-name gpt-4o-mini --model-version 2024-07-18 --name 4omini_test --resource-group 4omini_test --deployment-name gpt-4o-mini --sku standard --capacity 450 --debug

    If you have any other questions, please let me know. Thank you again for your time and patience throughout this issue.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.