Skip incompatible rows (Fault tolerance) doesn't work

Sofía Neithardt 0 Reputation points
2023-08-07T20:10:10.5066667+00:00

Hi,

I have a dataset with semicolon delimiter but i also have a semicolon on a string field, so it cause some rows have more columns than the correct ones. Because of that i decided to activated the Fault Tolerance on Copy Activity, but unfortunately when i run the pipeline some rows are skipped, but a few ones stays on file causing errors in subsequent processes.

When i proceess again the output file without skipped rows, the other ones that hadn't been omitted are skipped now, so i don't understand what is happening.

I thought it was because of a limit of rows per file or a limit of rows skipped, but i made some test and it doesn't.

Have someone experimented something like that?

Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
11,651 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Bhargava-MSFT 31,261 Reputation points Microsoft Employee Moderator
    2023-08-08T18:29:58.36+00:00

    Hello @Sofía Neithardt ,

    Welcome to the Microsoft Q&A forum.

    When the Fault Tolerance feature is enabled, it skips rows that don't match the schema of the destination dataset. In your case, the semicolon delimiter is causing some rows to have more columns than the correct ones, which is why they are being skipped.

    Did you examine the logs on the Azure blob storage (incompatible rows) to see the cause of the failure?

    It seems like the rows skipped in the first run were not fixed in the data source, which caused them to be skipped again in the second run. Please check the log files to see if this is the case.

    Also, you can try to handle the semicolon delimiter issue by using the correct delimiter settings on the source dataset.

    Reference document:

    https://learn.microsoft.com/en-us/azure/data-factory/copy-activity-fault-tolerance#copying-tabular-data

    I hope this helps. Please let me know if you have any further questions.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.