adf completly unefficient

Jean-Luc Andrei 1 Reputation point
2022-12-27T10:54:55.347+00:00

i created an etl job with adf

something simple

  • Extract : list of files from blobstorage format csv
  • Transform : for each source file, create a new csv with 2 columns mapped including one simple calculated column
  • Load : push new files into 2 target blobstorages

For one file of 215B (15 rows) to transform, it take 5 min.
how it is possible to be so unefficient? for something so simple, it should take at worst, 5 sec

ADF is supposed to be able proceed large amount of datas.
In next etl job i will need to proceed, i will have more than 400k rows...

Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
11,644 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. Nandan Hegde 36,151 Reputation points MVP Volunteer Moderator
    2022-12-27T11:35:10.487+00:00

    Hey, ADF by default takes some time to create the dataflow cluster at run time. So that would be same irrespective of the number of rows in your file.
    This might be the case why it took 5 mins.

    similar thread:
    https://stackoverflow.com/questions/63933355/azure-data-factory-transformation-pipeline-take-more-than-5-minutes-to-execute-w


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.