Monitoring Subscription/Resource scope 'cloud uptime' in Azure

Kenneth Huddleston 145 Reputation points
2023-03-24T17:39:19.57+00:00

My business line would like to expand and automate our reporting and handling of Azure outages and incidents, to give us a more reliable 'cloud uptime' metric as it relates to our own Azure ecosystem. We are not concerned (yet) about the uptime of our VMs specifically, what we are looking to do is monitor and log outages and failures to generate an approximate 'cloud uptime'.

I was digging through Microsoft documentation and I see that the Microsoft Resource Health REST API appears to provide tailored outage metrics which can be filtered per subscription or per resource.

  • Can I consider this information to be a fairly reliable snapshot of all outages/downtime microsoft has recorded for our specific ecosystem?
  • Is there a better resource/api to utilize? Or is there another recommended approach to creating a tailored cloud uptime reporting pipeline (again, not taking into account 'local' uptime of VMs, but speaking to larger availability trends)?

Just looking for some input from someone who 'knows' these APIs and tools before I spend a significant amount of time digging myself into a hole. I appreciate any feedback you can provide.

Azure Monitor
Azure Monitor
An Azure service that is used to collect, analyze, and act on telemetry data from Azure and on-premises environments.
3,645 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Sina Salam 22,031 Reputation points Volunteer Moderator
    2023-03-26T06:24:45.8333333+00:00

    @Kenneth Huddleston

    Welcome again to Microsoft Q&A and thank you for posting your questions here!

    I can assure you the best resources so far are Microsoft resources for any of Microsoft technologies. To efficiently monitor and log outages and failures to generate an approximate 'cloud uptime'. You have done so well to have this resourceful link Azure Resource health REST API that contain APIs for Azure REST operation groups.

    Kindly, consider the information to be enough and a reliable best practice.

    You can focus more on Azure Monitor, which you can use to monitor and log outages and failures with associated APIs.

    Wish you good success.

    Kindly let me know if the above helps or you need further assistance or explanation on this issue.

    Best Regards,

    Sina

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.