Error while reading file abfss:REDACTED_LOCAL_PART@miglandingzonelake.dfs.core.windows.net - source folder deleted

GB 6

Hello, I am getting this intermittent error when running an ADF pipeline. After receiving this error the storage account blob container folder is emptied of all files! I presume this what redacted means? I have deleted the DataSet and recreated with the same name and the error returned after working for a while? Recreating with a new name seems to have avoided the error thus far? Is there a way to monitor the cache mentioned? To have an error that deletes source files is a little scary to say the least.

Hopefully someone can help.

Operation on target <data flow name> failed: {"StatusCode":"DFExecutorUserError","Message":"Job failed due to reason: Error while reading file abfss:REDACTED_LOCAL_PART@miglandingzonelake.dfs.core.windows.net/<path/filename.csv>. It is possible the underlying files have been updated. You can explicitly invalidate the cache in Spark by running 'REFRESH TABLE tableName' command in SQL or by recreating the Dataset/DataFrame involved."

The DataSet used is as follows. It is used a source and sink in the data flow.
{
"name": "<ldz_name>",
"properties": {
"linkedServiceName": {
"referenceName": "<LandingZoneDataLake>",
"type": "LinkedServiceReference"
},
"folder": {
"name": "ldz_name"
},
"annotations": [],
"type": "DelimitedText",
"typeProperties": {
"location": {
"type": "AzureBlobFSLocation",
"fileSystem": "ldz_name"
},
"columnDelimiter": ",",
"escapeChar": "\",
"firstRowAsHeader": true,
"quoteChar": "\""
},
"schema": []
}
}

KranthiPakala-MSFT 46,442 Reputation points Microsoft Employee

2021-02-12T23:15:57.133+00:00

Hi @GB ,

Welcome to Microsoft Q&A forum and thanks for bringing this our notice.

For deeper investigation on this issue, I have initiated a private message with you requesting few additional details to share over email with us. Could you please share those details as mentioned in the private message.

Looking forward to your response.

Thank you
KranthiPakala-MSFT 46,442 Reputation points Microsoft Employee

2021-02-19T00:17:53.673+00:00

Hi @GB ,

Following up to see if you have got a chance to file a support ticket? If you don't have a support plan, please let us know.

Thank you
GB 6 Reputation points

2021-02-19T11:15:14.613+00:00

A ticket is being raised by our 1st line support partners- thanks for help thus far.
KranthiPakala-MSFT 46,442 Reputation points Microsoft Employee

2021-02-19T17:12:33.867+00:00

Hi @GB ,

Thanks for the update. Could you please share the SR# for internal tracking?

Regards,
Kranthi
Damien Sonnerat 1 Reputation point

2021-06-04T06:07:42.61+00:00

I had the exact same error with the files removed but using parquet files. The issue was due to the fact that data flow (spark behind it) where creating a new parquet file in a folder which contains a folder which itself had already a parquet file.
It seems that the parquets files have to be at the very end of the folder structures (in the leaves).

Not allowed:
A -> B -> c.parquet
-> d.parquet

Allowed:
A->B -> c.parquet
->E -> d.parquet

This is the only change I made to make it work.
Maybe you can check and figure out if you case is similar.

Share via

Error while reading file abfss:REDACTED_LOCAL_PART@miglandingzonelake.dfs.core.windows.net - source folder deleted