Improve your reliability with modern operations practices: Learning from failure

DevOps Engineer
Solution Architect
Technology Manager

Incidents will happen—there’s no doubt about that. The key question is whether you will treat them as a learning opportunity to make your operations practice better or just as a loss of time, money, and reputation. Learn the practice and the Azure tools that can help level up your operations practice by learning from failure.

Learning objectives

In this module you will:

  • Discover the importance of learning from incidents
  • Understand the aspects of complex systems that make learning from failure important
  • Learn when and how to conduct a post-incident review
  • Understand the purpose and goals of a post-incident review
  • Learn the components that go into a good post-incident review
  • Explore the Azure tools that can assist with getting started with post-incident reviews
  • Become aware of common traps to avoid
  • Identify helpful practices to conduct a better review