RPO/RTO for storage account and Databricks

Bex Starr 81 Reputation points
2022-09-09T17:02:50.347+00:00

I am trying to understand the RPO/RTO for storage accounts for both customer-controlled failure and Microsoft failure:

Standard storage account configured with GRS/RA-GRS or GZRS/RA-GZRS?

Same question for Databricks configured for an active-passive DR strategy, where CI/CD pipelines perform parallel deployment to both primary and secondary regions. Deployments will be for Control Plane metadata. Our DR solution only really needs to accommodate automated processes and not interactive ones.

For high-availability, I assume Microsoft handles all hardware failures for the Data Plane - do hardware failures (nodes in the cluster and data center failures) remain transparent to the customer?

I found this post, but it doesn't answer my question: https://learn.microsoft.com/en-us/answers/questions/440593/what-is-the-commitment-of-blob-storage-rto-rlo-and.html
https://techcommunity.microsoft.com/t5/azure-storage-blog/understanding-azure-storage-redundancy-offerings/ba-p/1431700

Azure Storage
Azure Storage
Globally unique resources that provide access to data management services and serve as the parent namespace for the services.
3,542 questions
Azure Blob Storage
Azure Blob Storage
An Azure service that stores unstructured data in the cloud as blobs.
3,201 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,534 questions
0 comments No comments
{count} votes

Accepted answer
  1. SaiKishor-MSFT 17,336 Reputation points
    2022-09-09T19:44:35.607+00:00

    @Bex Starr Thank you for reaching out to Microsoft Q&A. I understand that you want to know the RPO/RTO time for storage accounts.

    As mentioned here- https://learn.microsoft.com/en-us/azure/storage/common/storage-redundancy#redundancy-in-a-secondary-region

    "The Azure Storage platform typically has an RPO of less than 15 minutes, although there's currently no SLA on how long it takes to replicate data to the secondary region."

    Wrt RTO- The time it takes to failover after initiation can vary though typically less than one hour.

    Please refer to- https://learn.microsoft.com/en-us/azure/storage/common/storage-initiate-account-failover?tabs=azure-portal#important-implications-of-account-failover

    Hope this helps. Wrt to Databricks, I will reach out to the Databricks team for more details regarding this. In the meanwhile, if you have any further questions, please do let us know.

    Thank you!

    Remember:

    Please accept an answer if correct. Original posters help the community find answers faster by identifying the correct answer. Here is how.

    Want a reminder to come back and check responses? Here is how to subscribe to a notification.

    1 person found this answer helpful.

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.