Data Factory activities stuck in Queued
Copy task is invoked as a child activity and stays in "queued" state for over an hour. This is intermittent and resembles a bug identified in version 5.2.
Environment:
Azure Data Factory
Linked Service: Postgres (vnet access)
Linked Service: ADLS Gen 2
Linked Service: Azure SQL Database
IR Type: Self Hosted
IR Version: 5.23.8355.1
Scenario:
Parent pipeline has a Lookup then a For loop containing an Execute Pipeline activity.
Child pipeline has a Lookup then a For loop containing a series of Copy and Script activities.
When debugging child pipeline with static parameters, it tends to run well most of the time.
When debugging the parent pipeline, the child pipeline runs pretty well after a fresh reboot of the IR, but after about a day, the third activity (copy action) hangs in the queued state longer and longer. We have over-provisioned the IR host with resources to overcome this, but the problem continues.
This problem happens regardless of the "queue length" on the IR. Occurs both when there is other activity and when there is no other activity.
Pipeline Run id: 4f25f845-2edf-4811-9d6d-0df6d0d5df17
Activity run id: 2dea7b9f-9211-4360-b44e-23f832c4373a