RTO for Cosmos DB Service-managed failover (regional outage)

metalheart 361 Reputation points
2022-09-26T08:57:02.53+00:00

In a two-region account with one write region, what is the RTO for a service-managed failover during a regional outage?

Azure Cosmos DB
Azure Cosmos DB
An Azure NoSQL database service for app development.
1,442 questions
0 comments No comments
{count} votes

Accepted answer
  1. GeethaThatipatri-MSFT 27,337 Reputation points Microsoft Employee
    2022-09-26T23:56:12.72+00:00

    Hi, @metalheart Welcome to the Microsoft Q&A platform, and thanks for using Azure Services.
    If I understand correctly you want to know the RTO in case of service-managed failover for the write region.

    Expected and maximum RPOs and RTOs depend on the kind of outage that Cosmos DB is experiencing. For instance, an outage of a single node will have a different expected RTO and RPO than a whole region outage.
    Please refer to this doc for a better understanding High availability in Azure Cosmos DB | Microsoft Learn

    244973-image.png

    The time to complete the failover is essentially dependent on the number of physical partitions for the account. So an account that has 1000 physical partitions will take longer than an account with 500 partitions. The volume of data stored in the partitions does not matter and it's purely a function of the physical partition count.

    That said RTO is also dependent on the time taken to trigger the failover, which is subjective and this is why we don't have published SLAs on the RTOs for Service-Managed failover.

    I hope this information helps, please let me know if you are looking for additional information.

    Regards
    Geetha


0 additional answers

Sort by: Most helpful