Azure service reliability guides

Below is a list of Azure service reliability guides, organized by service category.

Note

Some service documents are in the process of, or are not yet updated into a single reliability guide format. These may contain more than one document that references reliability guidance.

AI and machine learning

Product Guidance
Azure AI Health Insights Reliability in Azure AI Health Insights
Azure AI Search Reliability in Azure AI Search
Azure Bot Service Reliability in Azure Bot Service
Azure Machine Learning Service Failover for business continuity and disaster recovery

Analytics

Product Guidance
Azure HDInsight Reliability in Azure HDInsight
Azure HDInsight on AKS Reliability in Azure HDInsight on AKS
Azure Machine Learning Service Failover for business continuity and disaster recovery
Azure Stream Analytics Achieve geo-redundancy for Azure Stream Analytics jobs
Azure Event Hubs Reliability in Azure Event Hubs
Azure Data Explorer Business continuity and disaster recovery overview
Azure Data Share Disaster recovery for Azure Data Share
Azure Chaos Studio Reliability in Azure Chaos Studio
Microsoft Fabric Reliability in Microsoft Fabric
Microsoft Purview Reliability in Microsoft Purview

Compute

Product Guidance
Azure App Service Reliability in Azure App Service
Azure Batch Reliability in Azure Batch
Azure Container Apps Reliability in Azure Container Apps
Azure Container Instances Reliability in Azure Container Instances
Azure Functions Reliability in Azure Functions
Azure Kubernetes Service (AKS) Create an Azure Kubernetes Service (AKS) cluster that uses availability zones

High availability and disaster recovery overview for Azure Kubernetes Service (AKS)
Azure Service Fabric Deploy an Azure Service Fabric cluster across Availability Zones

Disaster recovery in Azure Service Fabric
Azure Spring Apps Reliability in Azure Spring Apps
Azure Virtual Machines Reliability in Azure Virtual Machines
Azure Virtual Machine Image Builder Reliability in Azure Virtual Machine Image Builder
Azure Virtual Machine Scale Sets Reliability in Azure Virtual Machine Scale Sets
Azure VMware Solution Deploy disaster recovery using VMware HCX

Containers

Product Guidance
Azure App Configuration How does App Configuration ensure high data availability?

Resiliency and disaster recovery
Azure Container Apps Reliability in Azure Container Apps
Azure Container Instances Reliability in Azure Container Instances
Azure Container Registry Enable zone redundancy in Azure Container Registry for resiliency and high availability

Geo-replication in Azure Container Registry
Azure Kubernetes Service (AKS) Create an Azure Kubernetes Service (AKS) cluster that uses availability zones

High availability and disaster recovery overview for Azure Kubernetes Service (AKS)
Azure Service Fabric Deploy an Azure Service Fabric cluster across Availability Zones

Disaster recovery in Azure Service Fabric

Databases

Product Guidance
Azure SQL Azure SQL - High availability

Disaster recovery guidance - Azure SQL Database
Azure SQL-Managed Instance Failover groups overview & best practices - Azure SQL Managed Instance
Azure Database for MySQL Overview of business continuity with Azure Database for MySQL - Single Server
Azure Database for MySQL - Flexible Server Azure Database for MySQL Flexible Server High availability

Azure Database for MySQL Flexible Server - Restore to latest restore point
Azure Database for PostgreSQL - Flexible Server Reliability in Azure Database for PostgreSQL - Flexible Server
Azure Cosmos DB for NoSQL Reliability in Azure Cosmos DB for NoSQL
Azure Cosmos DB for MongoDB vCore Reliability in Azure Cosmos DB for MongoDB vCore
Azure Cache for Redis Enable zone redundancy for Azure Cache for Redis

Configure passive geo-replication for Premium Azure Cache for Redis instances

Developer tools

Product Guidance
Azure API Center Reliability in Azure API Center

DevOps

Product Guidance
Azure Deployment Environments Reliability in Azure Deployment Environments
Azure DevOps Data availability
Azure Monitor-Log Analytics Enhance data and service resilience in Azure Monitor Logs with availability zones

Log Analytics workspace replication

Hybrid + multicloud

Product Guidance
Azure Operator Nexus Reliability in Azure Operator Nexus

Industry solutions

Product Guidance
Microsoft Community Training Reliability in Microsoft Community Training

Integration

Product Guidance
Azure API for FHIR® Disaster recovery for Azure API for FHIR
Azure API Management Ensure API Management availability and reliability

How to implement disaster recovery using service backup and restore
Azure Data Manager for Energy Reliability in Azure Data Manager for Energy
Azure Data Factory Azure Data Factory data redundancy
Azure Event Grid Reliability in Azure Event Grid
Azure Functions Reliability in Azure Functions
Azure Health Data Services: De-identification service (preview) Reliability in Azure Health Data Services: De-Identification service
Azure Health Data Services: Workspace services (FHIR®, DICOM®, MedTech) Business continuity and disaster recovery considerations
Azure Logic Apps Protect logic apps from region failures with zone redundancy and availability zones

Business continuity and disaster recovery for Azure Logic Apps
Azure Service Bus Best practices for insulating applications against Service Bus outages and disasters

Internet of Things

Product Guidance
Azure Device Registry Reliability in Azure Device Registry
Azure IoT Hub IoT Hub high availability and disaster recovery
Azure Notification Hubs Reliability in Azure Notification Hubs

Media

Product Guidance
Azure Media Services High Availability with Media Services and Video on Demand (VOD)

Management and governance

Product Guidance
Azure Backup Reliability in Azure Backup
Azure Guest Configuration Azure Guest Configuration Availability
Azure Monitor-Log Analytics Enhance data and service resilience in Azure Monitor Logs with availability zones

Log Analytics workspace replication
Azure Site Recovery Set up disaster recovery for Azure VMs

Migration

Product Guidance
Azure Migrate Does Azure Migrate offer Backup and Disaster Recovery?
Azure Site Recovery Set up disaster recovery for Azure VMs

Networking

Product Guidance
Azure Application Gateway (V2) Autoscaling and High Availability
Azure Application Gateway for Containers Reliability in Azure Application Gateway for Containers
Azure Bastion Reliability in Azure Bastion
Azure Communications Gateway Reliability in Azure Communications Gateway
Azure DNS Reliability in Azure DNS
Azure DDoS Protection Reliability in Azure DDoS Protection
Azure ExpressRoute Designing for high availability with ExpressRoute

Designing for disaster recovery with ExpressRoute private peering
Azure Firewall Deploy an Azure Firewall with Availability Zones using Azure PowerShell
Azure Load Balancer Reliability in Azure Load Balancer
Azure Network Watcher Azure Network Watcher service availability and redundancy
Azure Private Link Azure Private Link availability
Azure Public IP Azure Public IP Availability Zone
Azure Route Server Azure Route Server frequently asked questions (FAQ)
Azure Traffic Manager Reliability in Azure Traffic Manager
Azure Virtual Network Virtual networks and availability zones

Virtual Network – Business Continuity
Azure Virtual WAN How are Availability Zones and resiliency handled in Virtual WAN?

Disaster recovery design
Azure VPN Gateway About zone-redundant virtual network gateway in Azure availability zones

Highly Available cross-premises and VNet-to-VNet connectivity
Azure Private 5G Core Reliability in Azure Private 5G Core
Azure Web Application Firewall Deploy an Azure Firewall with Availability Zones using Azure PowerShell

How do I achieve a disaster recovery scenario across datacenters by using Application Gateway?

Security

Product Guidance
Azure Disk Encryption Redundancy options for managed disks
Azure Firewall Deploy an Azure Firewall with Availability Zones using Azure PowerShell
Azure Key Vault Azure Key Vault availability and redundancy
Azure Web Application Firewall Deploy an Azure Firewall with Availability Zones using Azure PowerShell

How do I achieve a disaster recovery scenario across datacenters by using Application Gateway?

Storage

Product Guidance
Azure Backup Reliability in Azure Backup
Azure Blob Storage Choose the right redundancy option

Azure storage disaster recovery planning and failover
Azure Databox How can I recover my data if an entire region fails?
Azure Elastic SAN Reliability in Azure Elastic SAN
Azure NetApp Files Manage disaster recovery using Azure NetApp Files
Azure Storage Actions Reliability in Azure Storage Actions
Azure Storage-Disk Storage Best practices for achieving high availability with Azure virtual machines and managed disks
Azure Storage Mover Reliability in Azure Storage Mover

Web

Product Guidance
Azure AI Search Reliability in Azure AI Search
Azure API Management Ensure API Management availability and reliability

How to implement disaster recovery using service backup and restore
Azure App Service Reliability in Azure App Service
Azure Container Apps Reliability in Azure Container Apps
Azure Notification Hubs Reliability in Azure Notification Hubs
Azure SignalR Service Resiliency and disaster recovery in Azure SignalR Service