Remove Duplicate Row from xlsx using azure adf

Question

A community member has associated this post with a similar question:
Remove duplicate rows from xlsx in Azure Data Factory

Only moderators can edit this content.

Remove Duplicate Row from xlsx using azure adf

Dinesh Prajapati 126

hi team,

I want to remove duplicate rows from xlsx via azure adf. It should work like if the data of all the columns of row 1 matches with all the data of all the columns of row2, then any one of the row should be removed. I read on ms site and it says use aggregrate activity to do that, but how can I use dynamically. As it states that I have to select column in group by manually (in debug mode), which I do not want to, as there are multiple files that would be running through the same pipeline.

Following error occurs which is valid as I have off the debug mode in data flow:
User's image

Deleted

This comment has been deleted due to a violation of our Code of Conduct. The comment was manually reported or identified through automated detection before action was taken. Please refer to our Code of Conduct for more information.

Deleted

This comment has been deleted due to a violation of our Code of Conduct. The comment was manually reported or identified through automated detection before action was taken. Please refer to our Code of Conduct for more information.

Share via

Remove Duplicate Row from xlsx using azure adf