Azure datafactory Datalake or Synapse Datalake

Anshal 2,006 Reputation points
2023-04-25T15:06:48.0833333+00:00

Hi friends, I want to know the better approach between Azure data factory Datalake or Synapse Data Lake in developing a Data lake. In this case, I must choose Azure data factory Datalake or Synapse Data Lake which includes cost, scalability, and security. Is it better to build ADLS with has RAW zone and staging zone in ADF ADLS and store curated data in Synapse? What is the best approach?

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,402 questions
Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,573 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
9,945 questions
{count} votes

Accepted answer
  1. PRADEEPCHEEKATLA-MSFT 83,306 Reputation points Microsoft Employee
    2023-04-27T08:38:21.2966667+00:00

    @Anshal - Thanks for the question and using MS Q&A platform.

    Both Azure Data Factory (ADF) Data Lake and Azure Synapse Analytics Data Lake are excellent options for building a data lake in Azure. The choice between them depends on your specific requirements and use case.

    Azure Synapse Analytics is a fully managed analytics service that brings together big data and data warehousing. It offers seamless integration with Azure Data Factory, Azure Data Lake Storage, Azure Blob Storage, and other Azure services. Synapse Analytics Data Lake provides advanced security features such as Azure Active Directory integration, firewall, and virtual network support. It also offers powerful analytics capabilities like SQL Serverless, Apache Spark, and Power BI. With Synapse Analytics, you can build a data lake that includes raw, curated, and enriched data zones.

    Azure Data Factory, on the other hand, is a cloud-based data integration service that allows you to create, schedule, and orchestrate data pipelines. It provides support for various data sources and destinations, including Azure Data Lake Storage Gen1 and Gen2. ADF Data Lake provides a cost-effective option for building a data lake that includes raw and staging zones. You can also use ADF Data Lake to transform and move data between zones.

    In terms of cost, ADF Data Lake may be a more cost-effective option for building a data lake. However, Synapse Analytics Data Lake provides more advanced analytics capabilities and security features.

    As for the best approach, it depends on your specific requirements and use case. If you need advanced analytics capabilities and security features, Synapse Analytics Data Lake may be the better option. If you need a cost-effective solution for building a data lake that includes raw and staging zones, ADF Data Lake may be the better option.

    Regarding your specific scenario, building a data lake with raw and staging zones in ADF Data Lake and storing curated data in Synapse Analytics Data Lake is a good approach. This allows you to take advantage of the cost-effective data integration capabilities of ADF Data Lake and the advanced analytics capabilities and security features of Synapse Analytics Data Lake.

    Hope this helps. Do let us know if you any further queries.


    If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

    1 person found this answer helpful.

0 additional answers

Sort by: Most helpful