Share via

mapping dataflow vs sparkpool

arkiboys 9,711 Reputation points
2021-11-11T09:14:54.85+00:00

Hi,
I understand spark can be used to do the same tasks as the mapping dataflows...
To me, (Using spark inside synapse workspace) is kind of similar to (Using c# inside ssis)
Questions:
1- Do you think if someone know mapping dataflows really well, then still need to learn spark?
2- Is my assumption correct in regards to spark in synapse being kind of similar to c# in ssis?

Any other thoughts?

Thank you

Azure Data Factory
Azure Data Factory

An Azure service for ingesting, preparing, and transforming data at scale.

0 comments No comments

Answer accepted by question author

MarkKromer-MSFT 5,231 Reputation points Microsoft Employee Moderator
2021-11-11T20:13:11.61+00:00

That's a pretty decent analogy. In Synapse pipelines (ADF), you can author a no-code data transformation job using Data Flows and you can write custom code in a Spark Notebook, then execute that from a control flow (pipeline) Notebook activity.

Was this answer helpful?


0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.