ADF pipeline lookup distinct rows from csv

arkiboys 9,696 Reputation points
2022-04-26T10:16:12.533+00:00

1- lookup1 reads csv rows --> .csv may have duplicate rows
2- foreach1 is fed from lookup1
3- inside foreach1, there is dataflow1 which and copy activity which processes data
...

Question:
How is it possible to get only the distinct rows from .csv and then proceed?
is it possible for the lookup to get distinct rows only?

Thanks

Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,986 questions
0 comments No comments
{count} votes

Accepted answer
  1. Nandan Hegde 32,831 Reputation points MVP
    2022-04-26T10:23:51.117+00:00

    Hey,
    Based on my understanding, you would need to use dataflow for that distinct rows since your lookup is a file rather than SQL .
    https://www.cloudfronts.com/azure/adfs-mapping-data-flows-how-do-you-get-distinct-rows-and-rows-count-from-the-data-source/

    @Nasreen Akter : any thoughts on this ?

    0 comments No comments

1 additional answer

Sort by: Most helpful
  1. Samy Abdul 3,376 Reputation points
    2022-04-26T10:23:14.153+00:00
    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.