ADF Inline dataset parameterization of folderpath

Prakash14 121 Reputation points
2022-10-13T14:25:36.97+00:00

Using ADF dataflow to copy databricks delta files to sql database. Under Source Options of dataflow , file system and folder path are not able to be parameterized. My objective is get the filesystem and folderpath value from previous activity.

Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,702 questions
0 comments No comments
{count} votes

2 answers

Sort by: Most helpful
  1. KranthiPakala-MSFT 46,492 Reputation points Microsoft Employee
    2022-10-14T16:53:17.657+00:00

    Hello @Prakash14 ,

    Thanks for the question and using MS Q&A platform.

    In order to parameterize the fileSystem and folderPath of your delta source, you will have to follow the below procedure of steps:

    1. First create a dataflow parameters for both FileSystem and FolderPath as shown below

    250603-image.png

    1. Then go to your source options section, in Folder path properties, select the data flow parameters for fileSystem and folderPath you have created in step 1 using the expression builder.

    250577-image.png

    1. Then go your pipeline level settings, and click on data flow activity, then you will see the dataflow parameters below which you have configured in earlier steps. Next check box the Expression field and then under value property, click on pipeline expression and add the dynamic expression to pass the output of your previous activity from which you want to pass the values. Below is a sample GIF for the same. 250597-image.png

    GIF:

    250622-df-parameters.gif

    Hope this will help. Please let us know if any further queries.


    • Please don't forget to click on 130616-image.png or upvote 130671-image.png button whenever the information provided helps you. Original posters help the community find answers faster by identifying the correct answer. Here is how

  2. Prakash14 121 Reputation points
    2022-11-23T03:04:29.987+00:00

    df_container is the ADLS gen2 container name.
    df_folderpath is the ADLS gen2 folder path

    df_container : source , df_folderpath : input/data/

    Inside the source container - source/input/data/ there will be multiple directories and delta format files(databricks) will be there under each partition .

    date=2017-01-01/abc.parquet
    date=2017-03-01/xyz.parquet
    date=2017-04-01/pqr.parquet

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.