I have an error when calling Synapse Analytics laptop script via azure data factory to concurrently read a parquet table, the parquet file is stored in azure data lake store gen2, some report errors and some don't, there is no problem with manual executio

Niu, Ivan 20 Reputation points
2023-10-21T18:23:25.2633333+00:00

Hello.

Calling Synapse Analytics laptop script via azure data factory concurrently reads parquet table with error, some tables don't report error, some tables report error, parquet file is stored in azure data lake store gen2, logs show that parquet file can't exist in cache/has been read to the end of the file but is still being read.

Strangely enough, the manual execution is fine and the parquet is fine and can be read normally.

When reading 10 parquet tables at a time, both concurrent and non-concurrent reads report errors.

Is there any way to solve this problem? This issue hasn't come up before, but has been around since the end of September.

User's image

User's image

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,559 questions
Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
5,373 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
11,623 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.