Azure Databricks - InvalidAbfsRestOperationException

Sharma, Abhinav 21 Reputation points
2022-12-27T08:46:07.883+00:00

I am trying to run Azure Databricks Notebook using ADB job clusters. The notebook language is pyspark. I am trying to access a container located in azure storage account. Container contains text files.

Notebook code uses command spark.read.format('text).load('filePath). While trying to read data from azure storage account container, command gets stuck and after few hours, ~5-6 hours, it gets timed out and throw below exception.

"Py4JJavaError: An error occurred while calling o1778.load. : Status code: -1 error code: null error message: InvalidAbfsRestOperationExceptionjava.net.SocketTimeoutException: connect timed out"

Can anyone please let me know whether this exception pertains to network, storage account, spark cluster or some other resource and what could be possible solution.

Please let me know if any more information is required on this.

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,305 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.