Share via

Azure Data Flow compare two string with lookup

youssef125 271 Reputation points
2021-12-02T11:24:42.67+00:00

I'm using Azure Data flow to do some transformation on the data but I'm facing some challenges. I have a use case where I have two streams, these two streams have some common data, and what I'm looking for is to output the common data between these two streams. I do matching data with some common fields(product_name(string) and brand(string)), I have not got ID. to do the matching, I picked lookup activity and I tried to compare the brand in the two streams, but THE RESULT IS NOT CORRECT because for example:

left stream : the brand = Estēe Lauder

right stream. : the brand = Estée Lauder

for me this is the same brand, but they have different text format, I wanted to use 'like' operator but lookup activity does not support it, I'm using '==' operator to compare.

is there a way to override this problem please ?
154442-capture-decran-2021-12-02-a-121651.png

Azure Databricks
Azure Databricks

An Apache Spark-based analytics platform optimized for Azure.

Azure Data Factory
Azure Data Factory

An Azure service for ingesting, preparing, and transforming data at scale.


Answer accepted by question author

Saurabh Sharma 23,866 Reputation points Microsoft Employee Moderator
2021-12-22T02:36:27.4+00:00

Hi @youssef125 ,
Sorry for late reply. Here is the information I have received internally which could try for your scenario -
If you’re looking to add columns from the right side to the left side and need to use a custom expression (i.e., like) then use a Join instead with a custom Join expression.
Or, if you are looking simply to see if the value exists on the right side or not, then use either Exists or Assert. Both also support custom expressions.
Please let me know how it goes or if you have any questions.

Thanks
Saurabh

Was this answer helpful?


0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.