How to choose the Data Archive for 400-700 GB documents for 7 years of cold storage period; Which is best in Azure cloud ?

Thaenraj Packiamani 20 Reputation points
2023-05-17T22:24:44.31+00:00

The customer plans to use either Azure BLOB storage, Azure ADLS Gen2,

or Cosmos DB, or a combination of Azure BLOB and Cosmos DB to store their JSON documents.

They need to retain the data for 7 years at a cold service level.

What are the best practices for archiving and retrieving these JSON documents along

with their metadata, which are used to locate them? The retrieval process should handle pull

requests for a maximum of 50-100 documents with an SLA of 1 day.

Which storage option would be more cost-effective in meeting this SLA?.

Azure Storage Accounts
Azure Storage Accounts
Globally unique resources that provide access to data management services and serve as the parent namespace for the services.
3,523 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Carlos Solís Salazar 18,191 Reputation points MVP Volunteer Moderator
    2023-05-18T05:38:50.56+00:00

    Thank you for asking this question on the Microsoft Q&A Platform.

    Archive tier for blob storage can meet your requirements:

    Data in the Archive tier can take up to 15 hours to rehydrate, depending on the priority you specify for the rehydration operation

    But you must consider the following:

    • Data must remain in the Archive tier for at least 180 days or be subject to an early deletion charge.
    • A blob in the Archive tier is offline. That is, it cannot be read or modified until it is rehydrated

    Source: https://learn.microsoft.com/en-us/azure/storage/blobs/archive-blob?tabs=azure-portal

    If you require an online tier, consider any of these depending on the frequency of access to the data:

    • Hot tier - An online tier optimized for storing data that is accessed or modified frequently. The hot tier has the highest storage costs, but the lowest access costs.
    • Cool tier - An online tier optimized for storing data that is infrequently accessed or modified. Data in the cool tier should be stored for a minimum of 30 days. The cool tier has lower storage costs and higher access costs compared to the hot tier.
    • Cold tier - An online tier optimized for storing data that is infrequently accessed or modified. Data in the cold tier should be stored for a minimum of 90 days. The cold tier has lower storage costs and higher access costs compared to the cool tier.

    Source: https://learn.microsoft.com/en-us/azure/storage/blobs/access-tiers-overview?tabs=azure-portal

    Hope this helps!


    Accept Answer and Upvote, if any of the above helped, this thread can help others in the community looking for remediation for similar issues.

    NOTE: To answer you as quickly as possible, please mention me in your reply.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.