@Anshal - Thanks for the question and using MS Q&A platform.
Storing data in both Azure Data Lake Storage and Azure Synapse Analytics can indeed lead to duplication of data and increased costs. However, it is not necessarily an incorrect architecture.
Azure Data Lake Storage is a highly scalable and cost-effective data lake solution that can store and process large amounts of data. It is designed for big data analytics workloads and provides features such as hierarchical namespace, POSIX-compliant access control, and support for multiple file formats.
Azure Synapse Analytics, on the other hand, is an analytics service that brings together big data and data warehousing. It provides a unified experience for data ingestion, preparation, management, and serving. It also provides features such as data integration, data warehousing, and big data analytics.
In some cases, it may make sense to store data in both Azure Data Lake Storage and Azure Synapse Analytics.. For example, you may want to use Azure Data Lake Storage as a landing zone for raw data, and then use Azure Synapse Analytics to transform and analyze the data. Alternatively, you may want to use Azure Synapse Analytics as a data warehouse for structured data, and use Azure Data Lake Storage for unstructured data such as log files or images.
However, if you are duplicating data unnecessarily, it can lead to increased costs and complexity. It is important to carefully consider your data storage and processing requirements, and choose the appropriate solution(s) based on your needs.
Hope this helps. Do let us know if you any further queries.
If this answers your query, do click Accept Answer
and Yes
for was this answer helpful. And, if you have any further query do let us know.