A community member has associated this post with a similar question:
Remove duplicate rows from xlsx in Azure Data Factory

Only moderators can edit this content.

Remove Duplicate Row from xlsx using azure adf

Dinesh Prajapati 126 Reputation points
2023-03-10T15:33:54.9533333+00:00

hi team,

I want to remove duplicate rows from xlsx via azure adf. It should work like if the data of all the columns of row 1 matches with all the data of all the columns of row2, then any one of the row should be removed. I read on ms site and it says use aggregrate activity to do that, but how can I use dynamically. As it states that I have to select column in group by manually (in debug mode), which I do not want to, as there are multiple files that would be running through the same pipeline.

Following error occurs which is valid as I have off the debug mode in data flow:
User's image

Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
{count} votes