Delta Lake Data lake design

Anshal 2,251 Reputation points
2022-12-15T05:22:14.097+00:00

Hi friends we are designing data lake and three stages raw ,silver and gold my question is do we need to have staging layer also,what are the advantages and is it compulsory

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,563 questions
0 comments No comments
{count} votes

Accepted answer
  1. PRADEEPCHEEKATLA 90,661 Reputation points Moderator
    2022-12-15T09:23:47.167+00:00

    Hello @Anshal ,

    Thanks for the question and using MS Q&A platform.

    Data lake synonyms in use Raw layer = staging = bronze = landing zone.

    The Bronze layer is where we land all the data from source systems. The table structures in this layer correspond to the source system table structures “as-is,” aside from optional metadata columns that can be added to capture the load date/time, process ID, etc.

    In most cases, it's a good idea to keep the data in the Bronze layer in Delta format, so that subsequent reads from the Bronze layer for ETL are performant — and so that you can do updates in Bronze to write CDC changes. Sometimes, when data arrives in JSON or XML formats, we do see customers landing it in the original source data format and then stage it by changing it to Delta format. So sometimes, we see customers manifest the logical Bronze layer into a physical landing and staging zone.

    It's important to plan your data structure before you land it into a data lake. Having a plan helps you use security, partitioning, and processing effectively.

    For more details, refer to Data lake zones and containers.

    Hope this will help. Please let us know if any further queries.

    ------------------------------

    • Please don't forget to click on 130616-image.png or upvote 130671-image.png button whenever the information provided helps you. Original posters help the community find answers faster by identifying the correct answer. Here is how
    • Want a reminder to come back and check responses? Here is how to subscribe to a notification
    • If you are interested in joining the VM program and help shape the future of Q&A: Here is jhow you can be part of Q&A Volunteer Moderators
    0 comments No comments

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.