Share via

Need help in identifying sudden spike in usage from azure open ai resources. Want to see where the calls are coming from.

Yamini Modipalli 5 Reputation points
2026-01-27T17:31:21.4666667+00:00

The usage and billing of Azure open AI Services in our Dev and UAT tenants went up super high suddenly from last Thursday 01/22

We verified out system logs and confirmed the load is not from our application. Suspecting bad actors we regenerated the keys and later deleted them to consolidate the resources.

Would like to know what might have caused the spike.

Azure OpenAI in Foundry Models

2 answers

Sort by: Most helpful
  1. Anshika Varshney 10,655 Reputation points Microsoft External Staff Moderator
    2026-01-27T19:06:48.4233333+00:00

    Hi Yamini Modipalli,

    Thank you for reaching out on the Microsoft Q&A.

    A sudden spike like this is understandably concerning, especially when you’ve already confirmed it didn’t originate from your application.

    At the platform level, Azure OpenAI itself doesn’t expose per‑caller or per‑IP request attribution. If keys are compromised or reused unintentionally, the service will still process those requests and meter usage normally. This means that once usage is observed, the investigation has to rely on Azure-side monitoring and diagnostics, rather than application logs alone.

    A few things that may help narrow the scope:

    Azure Monitor metrics Review metrics such as Tokens Processed, Requests, and Latency for the affected OpenAI resource. These can sometimes indicate whether the spike was due to a burst of short requests, a smaller number of very large prompts, or sustained traffic over time. Azure OpenAI exposes these metrics but not caller identities.

    Diagnostic logs (if enabled) If diagnostic settings were enabled before the spike, you may be able to see request volume and timing patterns exported to Log Analytics, Event Hub, or Storage. If diagnostics were not enabled at the time, unfortunately there’s no historical reconstruction available afterward.

    Key regeneration was the right immediate action Rotating and deleting keys is the correct containment step when unexpected usage is detected. This prevents any further unauthorized access going forward, even if the exact source can’t be identified retroactively.

    If, after reviewing metrics and logs, the source of the spike is still unclear, do let me know.

    For future prevention, consider:

    • Enabling diagnostic logs permanently on Azure OpenAI resources
    • Using separate keys per environment
    • Applying network restrictions or private endpoints, where feasible

    Hope this helps provide some direction and thank you for sharing the details.

    Thankyou!

    Was this answer helpful?

    0 comments No comments

  2. Q&A Assist
    2026-01-27T17:31:30.61+00:00
    AI generated content. This question contains an answer created with AI from Q&A Assist. Learn more

    To identify the sudden spike in usage of Azure OpenAI resources, you can utilize Azure Monitor, which collects and aggregates metrics and logs from your Azure resources. This service can help you analyze the data to determine where the calls are coming from and potentially identify any unauthorized access or misuse.

    Here are some steps you can take:

    1. Set Up Monitoring: Ensure that Azure Monitor is configured for your OpenAI resources. This will allow you to track metrics related to usage, performance, and availability.
    2. Analyze Metrics: Use the metrics dashboard available in the Azure portal to review the usage patterns. Look for unusual spikes in HTTP requests or token-based usage that could indicate excessive or unauthorized access.
    3. Review Logs: Check the logs for any unusual activity or patterns that could point to the source of the spike. If logging was not enabled prior to the spike, consider enabling it for future monitoring.
    4. Engage Support: If you cannot determine the cause of the spike through your analysis, consider creating a support request with Azure to get assistance in investigating the issue further.

    By following these steps, you should be able to gather insights into the usage spike and take appropriate actions to secure your resources.


    References:

    Was this answer helpful?

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.