Sequential Order In Data Flow

Anonymous
2020-10-14T08:49:55.167+00:00

Hi Support Team,

In the azure data flow, I have a scenario as below, where the data is been read parallelly from the source, but for me it should read sequentially in order to have a proper data in the target table.

  1. Reading the data from the two or more tables and writing into a one target table.
  2. Reading the data from the two or more tables and (also exclude the data which is already written in the step 1 process ) and then write the data into same target table.

For example:
Assume A1, A2, A3 as source table and T1 as target table.

Now in the 1st flow we will read the data from A1 and A2 and load the data into T1 and in the second flow we should read the data from A2 and A3 (and also exclude already loaded data in T1 ) and then load the data into the T1.

What is happening here is when we run the data flow, simultaneously parallel reads will happen i.e.

  1. Reading the data A1 and A2
  2. Reading the data A2 and A3 ( exclude which are not in T1).

So, in the step 2 there wont be any records to which are to be excluded from the T1 table.

Question here is, how to read the data in Sequential order from the source in the data flow like we have custom Sink order?

Please let me know if this question is able to understand and provide me the solution for the same.

Reach out to me for any clarifications
Mail id : ******@thomsonreuters.com

Thanks
Bharath

Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
11,624 questions
{count} votes

2 answers

Sort by: Most helpful
  1. MarkKromer-MSFT 5,226 Reputation points Microsoft Employee Moderator
    2020-10-14T18:43:17.043+00:00

    Setting "Sink Ordering" is on the data flow general properties and may work for you here.

    https://learn.microsoft.com/en-us/azure/data-factory/data-flow-sink#custom-sink-ordering


  2. Kiran-MSFT 696 Reputation points Microsoft Employee
    2020-10-18T00:24:03.513+00:00

    You will need make 2 disconnected flows within the same dataflow. This can be done by making another source A2' from the same table and specifying sink order.

    A1, A2 -> T1

    A2', A3, T1 -> T1

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.