Dynamic Partition based on date value - Split and Post it in data-lake

GOVIND RAJ 41 Reputation points
2022-07-26T13:21:26.143+00:00

Hi, In On-premises, I have a table with a size of around 100GB (5 years of data). I intend to upload the 100GB table as a parquet file to the data lake storage account. While moving data to the storage account, Table data must be split and copied to the parquet file based on the date column in the table. Is it possible to do this dynamically in data factory?

Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
11,624 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Subashri Vasudevan 11,226 Reputation points
    2022-07-26T16:14:39.407+00:00

    Hi @GOVIND RAJ ,

    Thanks for using the MS Q&A portal for posting your query.

    Please check this blog. Instead of round robin partition, you can choose key based partition and mention the date column.

    To know more about partitioning schemes: check here

    Please let us know if you have any question

    Thanks


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.