An Azure service for ingesting, preparing, and transforming data at scale.
Is integration runtime shared across dataflow runs?
In ADF, an IR can indeed be used across multiple dataflows or pipelines, but its resources may be allocated differently based on whether the IR is auto-resolved or set to a fixed region, as in your case with North Europe. When using a fixed-region IR, resources are allocated within that region and shared among dataflows or pipelines, which can impact performance if multiple heavy data transformations run simultaneously.
Does each pipeline receive an equal share of resources when running concurrently?
If you are running multiple pipelines on a single IR, the resources (like CPU cores and memory) are effectively shared. For example, if your IR is configured with 4 cores and 4 pipelines are running simultaneously, they may each receive a portion of those resources, resulting in a reduced share for each. This means each pipeline might experience a performance hit, as available cores are distributed across them, potentially causing the process time to increase if multiple pipelines compete for resources. Unlike dedicated clusters, IR resources are dynamically allocated, which can lead to varying performance outcomes based on concurrent usage.
Does running multiple pipelines on the same IR impact vCore-hour billing?
Your concern about vCore-hour costs when running pipelines concurrently or sequentially is valid. Generally, vCore-hour billing applies to the amount of computational power consumed, irrespective of whether pipelines run concurrently or sequentially. In other words, if you have 3 pipelines each consuming 10 vCore-hours, you will still incur a total cost of 30 vCore-hours, regardless of whether they run simultaneously on a shared IR or one after another. The vCore-hour metric is based on the actual processing power used, so there would be no cost savings from adjusting the pipeline run order on a single IR.
Would running pipelines sequentially or in parallel on the same IR influence costs if TTL settings were disregarded?
If we set aside TTL considerations, the cost remains the same in both scenarios, as ADF charges are based on vCore-hour usage tied to the processing needs of each dataflow or pipeline. Running them sequentially might result in longer processing times but wouldn’t impact the total vCore-hours billed. However, TTL configurations could influence costs slightly, as they determine when IRs spin up or down, affecting minor additional costs, but this factor is disregarded in this scenario as per your question.