Total Tokens Metric Shows No Data Despite Active Requests - Azure OpenAI Metrics

Aidan Mattrick 0 Reputation points
2025-07-04T16:18:57.06+00:00

I'm having an issue with Azure OpenAI metrics where I can see requests are being tracked (showing 9 Azure OpenAI Requests), but when I select "Total Tokens (Sum)" for the same timeframe, it shows no data (displays "--").

What I can see:

  • Azure OpenAI Requests (Sum): requests recorded
  • Total Tokens (Sum): No data (shows "--")
  • Same timeframe (Jul 04 7:12 AM)
  • Same resource (-openai-prod)

What I've tried:

  • Confirmed I'm using "Sum" aggregation for the token metric. Have also tried "Count" and others.
  • Verified I'm looking at the "Azure OpenAI - Usage" category for token metrics
  • Checked that requests are showing data for the same time period

The requests are clearly going through since I can see them in the metrics, but the corresponding token usage isn't appearing (attached image, Screenshot 2025-07-04 at 9.16.20 AM.png). Any guidance on troubleshooting this would be appreciated.

Azure AI services
Azure AI services
A group of Azure services, SDKs, and APIs designed to make apps more intelligent, engaging, and discoverable.
3,632 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Amira Bedhiafi 34,101 Reputation points Volunteer Moderator
    2025-07-05T12:15:20.5866667+00:00

    Hello Aidan !

    Thank you for posting on Microsoft Learn.

    I think you are experiencing a delay or lack of token-level metric emission for certain configurations or request patterns in Azure OpenAI.

    Keep in mind that not all requests emit Total Tokens as only requests that involve completions (chat/completions or completions endpoints) emit token data.

    Embedding or moderation calls may show request counts but not token data.

    If you're using tools like the playground or embedding APIs, token usage may not be tracked yet in the same way.

    Don't forget that Azure may delay token metric publication by up to 15-30 minutes so if you're checking immediately after sending requests, try selecting a wider historical time range.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.