How to get the content MD5 values of merged files?

Ashwin Shankara Lingam 0 Reputation points
2023-12-21T08:36:16.0766667+00:00

Iam trying to read multiple CSV files form a source folder in ADLS and make them into one single file and put in a target folder.

Iam using ADF Copy activity (merge settings) to achieve this. Merged files look proper in terms of data but the Content MD5 of the merged file is empty. Although all the files in source location had Content MD5 values in ADLS Portal.

Source Folder

File1.csv (Have Content MD5)

File2.csv (Have Content MD5)

File3.csv (Have Content MD5)

Target Folder

Merged file.csv (Doesn't have Content MD5)

ADF Copy activity read this, merge them and place it in target folder

This is a big blocker for us to consume the data since the application consume the data require the Content MD5 for various reasons.

Main issue : Files merged using ADF Copy activity doesn't have Content MD5 value in ADLS.

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,559 questions
Azure Storage Accounts
Azure Storage Accounts
Globally unique resources that provide access to data management services and serve as the parent namespace for the services.
3,523 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
11,608 questions
{count} votes

1 answer

Sort by: Most helpful
  1. ShaikMaheer-MSFT 38,546 Reputation points Microsoft Employee Moderator
    2023-12-27T06:23:12.74+00:00

    Hi
    Ashwin Shankara Lingam
    ,

    Thank you for posting query in Microsoft Q&A Platform.

    I believe all the files in the folder has same schema. Kindly create a dataset which points to folder, don't point to any file, and use same as Source dataset in the copy activity.

    Hope this helps. If not, kindly share your implementation details, such as source and sink dataset details and configurations of copy activity under source and sink tabs.


    Please consider hitting Accept Answer button. Accepted answers help community as well. Thank you.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.