A community member has associated this post with a similar question:
Remove duplicate rows from xlsx in Azure Data Factory
Only moderators can edit this content.
Remove Duplicate Row from xlsx using azure adf
hi team,
I want to remove duplicate rows from xlsx via azure adf. It should work like if the data of all the columns of row 1 matches with all the data of all the columns of row2, then any one of the row should be removed. I read on ms site and it says use aggregrate activity to do that, but how can I use dynamically. As it states that I have to select column in group by manually (in debug mode), which I do not want to, as there are multiple files that would be running through the same pipeline.
Following error occurs which is valid as I have off the debug mode in data flow: