Azure Data Factory - Copy activity using REST connector - Pagination vs Retry

Balachandran, Karthikeyan 25 Reputation points
2023-08-13T06:45:18.22+00:00

Hi team,

I use ADF pipeline comprising copy activity to pull data from a remote endpoint using REST connector and then place the data on the blob. I use the pagination feature available with REST connector as the response is paginated.

I have a problem here - whenever retry is fired (say because of 50x at the source endpoint), retry does NOT resume from the call that failed but starts all over again from 1st page. How do I fix this? Please advise.

Thank you.

Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
11,623 questions
{count} votes

1 answer

Sort by: Most helpful
  1. ShaikMaheer-MSFT 38,546 Reputation points Microsoft Employee Moderator
    2023-08-14T17:18:46.6+00:00

    Hi Balachandran, Karthikeyan,

    Thank you for posting query in Microsoft Q&A Platform.

    yes, you are right. Retry option will actually re-run entire activity again. That means copy activity will re-run all API calls again. So, in case of failure of copy activity if partial data already moved to blob, then don't consider using retry of copy activity. Instead, you can have different activities to run on failure of copy activity, to identify up to what page data loaded and loaded from there next pages.

    Basically, it's like have an extension of process in the pipeline on failure of copy activity, to identify till what page data loaded and process next steps from there.
    Hope this helps. Please let me know how it goes.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.