Azure datalake creation methods

Anshal 2,251 Reputation points
2024-07-24T10:24:14.9566667+00:00

Hi friends, we need to create a datalake (ADLS) with bronze, silver, and gold layers. One of our colleagues suggested it is a better idea to create ADLS through programming, but we plan to create it manually. Are there any advantages of each approach and what is the general and most used approach?

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,480 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,823 questions
0 comments No comments
{count} votes

2 answers

Sort by: Most helpful
  1. Babafemi Bulugbe 3,705 Reputation points MVP
    2024-07-24T12:28:47.9933333+00:00

    Hello Anshal,

    Thank you for posting your query in the Microsoft Q&A Community.

    Please be informed that approach depends on your organization’s best practices. Most Organization use both. Manual setup are used during initial development and testing (It’s an opportunity to learn about ADLS features, security, and best practices.) while programmatic setup are used for consistency, scalability, and automation.

    You can integrate ADLS setup into your deployment pipelines (e.g., Azure DevOps, Terraform, ARM templates when you leverage on automation

    Let me know if further assistance is needed.

    Babafemi

    0 comments No comments

  2. KarishmaTiwari-MSFT 20,207 Reputation points Microsoft Employee
    2024-07-26T18:48:18.1633333+00:00

    @Anshal

    Manual approach advantages:
    You get direct control over the creation and configuration of your ADLS resources, which can be beneficial for understanding the setup and configuration. Manual creation can be a good learning experience, especially if you're new to Azure and its ecosystem.

    Programmatic Approach advantages:
    Programmatic approaches (using Azure CLI, PowerShell, or ARM templates) allow for automated and repeatable deployments, which is ideal for consistency and scalability. More efficient for large-scale projects as it can handle complex structures and large volumes of data. Easier to integrate with CI/CD pipelines and other automated workflows.

    The most common approach in the industry is to use programming for creating and managing ADLS. This is because it offers greater flexibility, scalability, and efficiency, especially for larger projects. However, a manual approach might be simpler for small-scale or one-time setups.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.