Azure OpenAI PTU usage of fine-tuned model

mh0.song 0 Reputation points
2025-02-04T01:00:42.5133333+00:00

Hello,

I would like to calculate PTU estimates for a fine-tuned model,

but fine-tuned models aren’t listed in the capacity calculator.

I also couldn’t find any explanation about PTU usage for fine-tuned models in the documentation at https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/provisioned-throughput-onboarding.

I’m wondering if the PTU usage for a fine-tuned model is the same as for its base model.

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
4,080 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Vikram Singh 2,585 Reputation points Microsoft Employee Moderator
    2025-02-04T05:07:54.6233333+00:00

    Hi mh0.song,

    Thanks for posting your question on Microsoft Q&A.

    To estimate PTUs (Provisioned Throughput Units) for a fine-tuned model in Azure OpenAI, it's essential to know that PTUs are model-independent. This means the PTU usage for a fine-tuned model is generally the same as its base model. PTUs can be used to deploy any supported model/version in the region. Fine-tuning adjusts the base model's weights to improve performance on specific tasks without significantly changing its architecture or computational needs.

    For more details, refer to these Microsoft documentation links:

    Provisioned Throughput Units Onboarding

    Customize a Model with Fine-Tuning

    If this reply was helpful, please upvote and/or accept it as an answer. Let me know if you have any other questions.

    Thank you!


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.