Share via

Azure data factory billing question

HimanshuSinha 19,597 Reputation points Microsoft Employee Moderator
2026-04-09T19:10:35.39+00:00

Hello ,
Just wanted to know a bit more on how billing is done for ADF ?
Thanks
Himanshu

Azure Data Factory
Azure Data Factory

An Azure service for ingesting, preparing, and transforming data at scale.

0 comments No comments

Answer accepted by question author

  1. Q&A Assist
    2026-04-09T19:10:49.7433333+00:00
    AI generated content. This question contains an answer created with AI from Q&A Assist. Learn more

    Azure Data Factory (ADF) uses a consumption-based billing model. There is no fixed-size compute to provision; charges accrue only when ADF operations run or ADF-related infrastructure is active.

    Billing is primarily driven by these meters:

    1. Orchestration activity runs
      • Charged per activity run that ADF orchestrates in pipelines.
      • Includes pipeline activity execution and external pipeline activity execution.
    2. Data Integration Unit (DIU) hours (Copy activities on Azure Integration Runtime)
      • For copy activities that run on Azure Integration Runtime, charges are based on:
        • Number of DIUs used, and
        • Execution duration (in hours).
      • Example: moving data from sources like AWS S3 to Azure Data Lake via copy activity consumes DIU-hours.
    3. vCore hours (Mapping Data Flows)
      • For data flow execution and debugging, charges are based on:
        • Compute type,
        • Number of vCores, and
        • Execution duration.
    4. SSIS Integration Runtime (IR)
      • If using Azure-SSIS IR, charges accrue based on:
        • Instance type, and
        • Duration that the SSIS IR is running.
    5. Data Factory operations and management
      • Creation, editing, retrieving, and monitoring of ADF artifacts incur Data Factory operations charges.

    At the end of the billing cycle, all these meters are summed and appear as separate line items under Azure Data Factory on the invoice. ADF is serverless and elastic, so cost scales with actual usage.

    Additional points:

    • Estimating costs: Use the Azure Data Factory pricing calculator to estimate costs by plugging in expected activity runs, DIU-hours, vCores, and durations. A recommended approach is to run a small proof-of-concept (for example, copy 100 GB instead of 1 TB), measure DIU-hours and activity runs, then extrapolate.
    • Cost tracking and attribution:
      • By default, billing is summarized by factory.
      • Factory settings allow switching to Show billing report: by pipeline, which emits separate billing entries per pipeline for more granular chargeback/FinOps.
      • Tags can be applied to the data factory and related resources to group and analyze consolidated costs. SSIS IRs inherit the factory tag (restart required after tag changes).
    • Monitoring consumption:
      • In ADF Studio → Monitor, each pipeline run has a Consumption view showing usage per ADF meter (orchestration, DIU-hours, data flow vCores, etc.) for that run.
      • This helps identify which pipelines and activities are driving costs.
    • Account and prepayment:
      • Azure Prepayment (monetary commitment) can be used to pay ADF charges, but not third-party Marketplace services.

    References:

    Was this answer helpful?

1 additional answer

Sort by: Most helpful
  1. Manoj Kumar Boyini 14,015 Reputation points Microsoft External Staff Moderator
    2026-04-09T22:02:55.2333333+00:00

    Hi @HimanshuSinha

    Adding a few additional considerations that can influence Azure Data Factory costs beyond the core billing meters already covered:

    Integration Runtime choice Azure Integration Runtime is serverless, but with Self-hosted Integration Runtime, ADF itself doesn’t charge for compute the cost comes from the underlying VM/on-prem infrastructure. This is important for hybrid scenarios where costs shift outside ADF.

    Network and data movement factors Copy activity costs can also be influenced by cross-region or cross-cloud data movement, data egress charges, and the placement of the integration runtime relative to source and destination. These costs are billed separately but can significantly impact the overall solution cost.

    Data Flow cluster behavior Mapping Data Flows use managed Spark clusters, and billing includes cluster startup (warm-up) time, active debug sessions, and Time To Live (TTL), which keeps clusters warm (reducing startup delays but continuing billing while active).

    Concurrency and pipeline design Parallel execution (ForEach parallelism, multiple concurrent pipelines) can increase overall consumption, leading to higher DIU consumption, increased vCore usage, and more orchestration activity runs—even if total data volume remains the same.

    Cost optimization tips Use Data Flows only when required, monitor and stop unused debug sessions, tune parallelism carefully, and validate costs with smaller test runs before scaling to production workloads.

    Was this answer helpful?

    0 comments No comments

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.