Clay Casper Greetings!
Is this an accurate understanding of the pricing model and if so, why is the hosting for a fine-tuned model so expensive? Maybe there's some complex storage mechanism for fine-tuned models, idk I'm not an AI expert.
Your understanding is correct. As mentioned in the blogpost, Fine tuned models are subject to an hourly hosting charge, as well as token based pricing for input and output data.
There is a $7/hour hosting charge for GPT3.5-turbo fine-tuned model in Azure OpenAI Service. This hourly cost is for hosting the fine-tuned model on Azure infrastructure, which includes the cost of compute, storage, and networking resources.
The reason for the high cost of hosting a fine-tuned model is due to the complexity of the model and the resources required to run it. Fine-tuning a model involves training it on a large dataset to improve its accuracy for a specific task. This requires a lot of computational power and storage resources. Additionally, the fine-tuned model needs to be hosted on a powerful server to handle the increased workload. All of these factors contribute to the high cost of hosting a fine-tuned model.
We strongly recommend reviewing the pricing information for fine-tuning prior to make sure you are comfortable with the associated costs.
Also, please see Customize a model with fine-tuning (preview) and Plan to manage costs for Azure OpenAI Service for more details.
Do let me know if that helps or have any further queries.
If the response helped, please do click Accept Answer
and Yes
for was this answer helpful.
Doing so would help other community members with similar issue identify the solution. I highly appreciate your contribution to the community.