How to track costs for Whisper and GPT-4O transcription models?

imgkj 45 Reputation points
2025-06-19T07:02:16.4166667+00:00

We are implementing audio to text translation and transcription using Whisper and GPT-4O transcription models. The goal is to track the costs associated with these models. While there is documentation and a price calculator explaining the cost per token, guidance is needed on how to calculate the size of tokens per request. This information would be beneficial for comparing costs between the transcription models and understanding the token calculation process.

Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
4,115 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Pavankumar Purilla 8,665 Reputation points Microsoft External Staff Moderator
    2025-06-19T07:57:13.5466667+00:00

    Hi imgkj,

    When using Whisper and GPT-4O transcription models in Azure OpenAI, the way you track and calculate costs is different for each model.

    For Whisper:

    • You are charged based on the length of your audio file.
    • The cost is calculated per minute of audio. For example, if you transcribe a 10-minute audio file, you pay for 10 minutes.
    • You don’t need to worry about the number of words or characters in the transcript—just the audio duration.
    • You can check the exact pricing here: Azure AI Speech Pricing.

    For GPT-4O Transcription:

    You are charged based on the number of tokens in the transcript.

    A token is about 4 characters of text (for English). For example, the word “hello” is one token.

    To estimate your cost, count the number of characters in your transcript and divide by 4 to get the number of tokens.

    How to Track Costs:

    For Whisper: Just keep track of the total minutes of audio you process.

    For GPT-4O: Keep track of the number of characters in your transcripts, divide by 4 to estimate tokens, and use the token price for your cost calculation.

    For more information:
    Azure OpenAI Service Pricing
    Azure AI Speech Pricing
    Azure OpenAI Models Overview

    Hope this helps. Do let us know if you have any further queries.


    If this answers your query, do click Accept Answer and Yes for was this answer helpful.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.