Compare csv or parquet files,identify delta records

Dondapati, Navin 291 Reputation points
2021-11-04T03:11:36.407+00:00

Hi Guys,

We load files into Azure Data lake store every few hours, we want to compare present file with previous file and able to identify delta like merge statement and generate new file, with primary key and action type flag as below

0- insert
1-update
2-delete

Something like cdc on file data, any thoughts?

Bear in mind no database load

Regards,
Navin

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,562 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
11,640 questions
0 comments No comments
{count} vote

1 answer

Sort by: Most helpful
  1. MarkKromer-MSFT 5,226 Reputation points Microsoft Employee Moderator
    2021-11-04T17:15:44.363+00:00
    1 person found this answer helpful.

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.