How to handle an Excel sheet with multiple datasets in it using ADF?

Rohit Rajendra Sawane 96 Reputation points
2022-11-11T11:56:37.3+00:00

Hi Team,

Could you help me out with below Scenario of ADF?

I have a single Excel file with just one tab, as displayed below with approx. same format and that tab contains three datasets. We want to move this data to a SQL table using ADF/data flow. With the following conditions, if we trigger the data by considering any Month that is present on the sheet. Then only that column should transfer to the SQL table.

For example, while triggering, if we pass the 22-Sep (Column Present in datasets), then only the 22-Sep column should be moved from the entire dataset to the given columns of SQL table as shown in attached image.

let me know if you required any further inputs.
259546-sqltablesample.jpg
259547-exceldatasetformat-sample.jpg

Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
11,623 questions
{count} votes

Accepted answer
  1. AnnuKumari-MSFT 34,556 Reputation points Microsoft Employee Moderator
    2022-11-14T20:19:25.137+00:00

    Hi @Anonymous ,
    Thankyou for using Microsoft Q&A platform and thanks for posting your question here.
    As I understand your query, you want to fetch the data from three datasets within same excel sheet based on the input dates. Please let me know if my understanding about the ask is incorrect.
    To achieve the requirement, we need to use two dataflows where we need to use .csv files as intermediate storage . You can follow the below steps in mapping dataflow:
    1. Add source transformation and point the dataset to the excel sheet having schema imported and preview the data.
    2. Add filter transformation to filter out the rows having 'Note' values as it is not required as well as the dataset name i.e companyno, empno,deptno. Use this expression: not(like(Column_1,'Note%')) || not(isNull(toDate(Column_2)))
    3. Add surrogate key transformation to create an identity column Id which will assign an incremental unique value to corresponding rows.
    4. Add conditional split transformation to bifurcate the data into multiple groups based on the below conditions: Id>=1 && Id<= 4 , Id>=5 && Id<= 9 and default that will have Id>=10

    260254-split1.gif

    5. Add select transformation and deselect Id column from all three streams.
    6. Add sink transformation to each of the streams and select inline dataset so that it doesn't create too many physical datasets. Provide folderpath , filename in outputtosinglefile option and set single partition.

    260216-split22.gif

    In second dataflow,

    1. Add these three sources as inline datasets. In data preview tab, click on map drifted option which will automatically create a derived column transformation to redefine the schema
    2. Add unpivot transformation to all three branches and ungroup by using _c0 column. Provide columnnames in unpivot key tab and unpivoted column tab.
    3. Create a parameter in the dataflow named inputdate with string datatype. Add an alter row transformation and select 'Update if' condition having this expression: dates==$inputdate
    4. Add sink transformation having SQL dataset. In the settings tab, desect 'Allow insert' and select 'Allow update' . In key columns, select 'state' . In mapping tab, uncheck automapping and map the columns properly. For first branch, map _c0 --> state, datacol--> companytotal . Similarly, for second branch, map _c0 --> state, datacol--> employee and for third branch, _c0 --> state, datacol--> depttotal

    260393-splitgif1.gif

    260299-splitgiff2.gif

    Now, create an ADF pipeline and call both the dataflows.

    1. Create a pipeline parameter : Inputdate having string datatype. In dataflow2, parameter tab, provide value for the dataflow parameter from the pipeline parameter as @pipeline().parameters.Inputdate and check the expression checkbox.
    2. Run the dataflow by providing the date for which you want to extract the data.
    3. Check the SQL table.

    260348-splitgiff3.gif

    Hope this will help. Please let us know if any further queries.

    ------------------------------

    • Please don't forget to click on 130616-image.png or upvote 130671-image.png button whenever the information provided helps you.
      Original posters help the community find answers faster by identifying the correct answer. Here is how
    • Want a reminder to come back and check responses? Here is how to subscribe to a notification
    • If you are interested in joining the VM program and help shape the future of Q&A: Here is how you can be part of Q&A Volunteer Moderators
    1 person found this answer helpful.
    0 comments No comments

1 additional answer

Sort by: Most helpful
  1. Rohit Rajendra Sawane 96 Reputation points
    2022-11-18T13:21:18.687+00:00

    Hi @AnnuKumari-MSFT

    Thank you for your answer. I am grateful for your support and explanation in very simple words. I have accepted the answer. Consider my rating as 5 stars. Sorry for replying late.

    Thanks a lot.

    Regards,

    Rohit S.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.