Load CSV to ADLS by Date

Galaxy 21 Reputation points
2020-10-23T21:32:23.197+00:00

I have a large CSV file that contains a extract of records dating back to year 2016. The requirement is to copy the data into azure data lake but in year/month/day.csv format. What is the best approach to implement this? I am aware of how to achieve the write on the sink (ADLS) but am not sure of how to parse the CSV to obtain the records by day using azure data factory.

Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
0 comments No comments
{count} votes

Answer accepted by question author
  1. Kiran-MSFT 696 Reputation points Microsoft Employee Moderator
    2020-10-24T19:15:47.833+00:00

    You can achieve this by dataflow. Split the date column in each row into individual parts using a derived tx. Then use the key partition feature on the sink to split this by year/month/day on storage

    0 comments No comments

2 additional answers

Sort by: Most helpful
  1. HimanshuSinha 19,547 Reputation points Microsoft Employee Moderator
    2020-10-27T00:24:51.827+00:00

    Hello @Galaxy ,

    Just adding to what Kiran said , you need to derive the month,yaar & day from the date colum .
    I am assuming that you want the folder structure as Year-> Month -> Day.file . You can use the below expression to create the expression ('\' is requirted for
    folder structure )

    concat( toString(year(Date)),'/',toString(month(Date)),'/',toString(dayOfMonth(Date)))

    Also on the derived column option -> select key parition -> select the derived column

    On the sink side I have also addded the Setting -> As data in column -> Selected the derived column

    The below gif should be helful .

    35231-partitiondataondate.gif

    Thanks Himanshu
    Please do consider to click on "Accept Answer" and "Up-vote" on the post that helps you, as it can be beneficial to other community members

    1 person found this answer helpful.

  2. Galaxy 21 Reputation points
    2020-11-03T23:43:47.467+00:00

    Thank You @Kiran-MSFT and @HimanshuSinha for your responses. I wish I could accept both answers.

    0 comments No comments

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.