Unexpected Pipeline Slowness / Increase in Data Flow vCore Consumption

Angus White 0 Reputation points
2024-08-07T16:58:48.01+00:00

Hi,

We have been running the same pipelines on a weekly basis and have seen consistent results when it comes to run time / vCore consumption / cost.  A couple of weeks ago however, we noticed that our pipelines were running extremely slowly, and as a result we have consumed a huge amount of V-core processing + our Azure cost has more than doubled.

All our pipelines are impacted, but here is a single example:

Pipeline duration:

07/14/2024 (regular):
30s
07/20/2024 (irregular):
10m 6s

User's image

Dataflow vCore Consumption:
07/14/2024 (regular):
0.0722
07/20/2024 (irregular):
2.614

User's image

Any ideas on what is causing the impact to our pipeline performance? Nothing has changed from a volume perspective with our data volumes / flows. We've made no changes to the pipelines. We are still processing the same number of records. 

One theory is that our Azure resource / VM associated with the run time was low on space, would this have an impact?

Thanks,

Angus

Azure Cost Management
Azure Cost Management
A Microsoft offering that enables tracking of cloud usage and expenditures for Azure and other cloud providers.
3,606 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
11,639 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Bhargava-MSFT 31,261 Reputation points Microsoft Employee Moderator
    2024-08-08T17:36:11.41+00:00

    Hello Angus White,

    Welcome to the Microsoft Q&A forum.

    It seems like you're experiencing a performance issue with your pipelines, which has led to increased vCore consumption and higher Azure costs. Given that there have been no changes to your data volumes or pipelines, and you're processing the same number of records, there are a few potential factors that could be causing this issue:

    • If your Azure resource or VM associated with the runtime was low on resources, it could indeed impact performance.
    • There might be issues with the underlying infrastructure that your pipelines are running on. This could include network latency, hardware failures, or other resource contention issues.
    • If your data is stored in Azure Data Lake Storage, Azure SQL Database, or other cloud storage, check for any performance issues or I/O bottlenecks during the impacted period.
    • Check the network connectivity. If there are network connectivity issues between your Azure Synapse Analytics workspace and the data source or destination, it could be causing the pipeline to run slower.

    I recommend reviewing Azure Monitor logs and metrics around the time of the slowdown to see if there are any additional details or patterns that could explain the performance degradation. This might help identify whether the issue is related to resource usage, network issues, or something else.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.