Databox more than 600 tb data

Anshal 2,251 Reputation points
2024-07-12T10:22:05.6466667+00:00

Hi friends, we have to migrate over large amounts of data and consider Azure databox heavy after considering various other options. I have the following questions:

  1. Since it is a physical migration, is it a best practice to copy it into the blob, validate it, and then move it into the bronze layer? or it is better to directly move it into Bronz layer?
  2. In validation parts what is a comprehensive and effective validation strategy? Which aspects it should cover?
  3. How long does the process of databox heavy take in days?

If I am missing any key points, please cover that too.

Azure Data Box
Azure Data Box
A family of appliances and solutions for offline data transfer to Azure​.
42 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,872 questions
0 comments No comments
{count} votes

Accepted answer
  1. Azar 22,950 Reputation points MVP
    2024-07-12T10:36:01.8266667+00:00

    Hi there Anshal

    Thanks for using QandA platform

    I would say the Best practice is to copy the data into Azure Blob Storage first, validate it, and then move it to the bronze layer.

    Implement steps lik checksum verification, file count and size matching, sample data validation, and metadata consistency checks.

    The typical duration for the entire Databox Heavy process is 10-20 days, including 2-5 days for preparation, 3-7 days for shipping, and 2-5 days for data upload. my guess.

    If this helps kindly accept the response thanks much.

    0 comments No comments

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.