Exact no of DIU calculation

Samy Abdul 3,376 Reputation points
2021-08-06T12:14:46.067+00:00

Hi All, I understand that a Data Integration Unit (DIU) is a representative measure of a combination of CPU, memory and network resource allocation and the default DIU setting on all copy tasks is 4, the minimum you can configure is 2 and the maximum that you can set it up is 256. How do we come to conclude on exact no of DIU needed for particular pipeline? Is any formula that determines the exact DIU to be set it up ? Thankful for your inputs.

Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
11,426 questions
0 comments No comments
{count} votes

Accepted answer
  1. Vaibhav Chaudhari 38,901 Reputation points
    2021-08-06T14:43:04.747+00:00

    There is no formula available as such to calculate DIU. Some exercise & some test runs have to be performed.. try copying small / medium / large amount of data and every time scale up the DIU number & also Parallel copies config in copy activity, see what DIU works best and note down that performance and going forward refer the DIU used from captured metrics.

    Same suggestion is given in docs as well - https://learn.microsoft.com/en-us/azure/data-factory/copy-activity-performance#performance-tuning-steps

    ----------

    Please don't forget to Accept Answer and Up-vote if the response helped -- Vaibhav


0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.