The spark driver has stopped unexpectedly and is restarting. Your notebook will be automatically reattached.

Radu Sitaru (ACCENTURE) 20 Reputation points Microsoft Employee
2023-09-12T07:46:04.69+00:00

I have a scheduled pipeline in Azure Data Factory which uses Databricks notebooks:
User's image

For some of the notebooks, I am getting the below error, at the last step of the notebook, which is the step where data is saved on our data lake:
User's image

This error only occurs when the notebooks are triggered by the pipeline from ADF.

When I trigger the notebooks manually, everything works.

What is the problem and the solution?

Regards,

Azure Data Lake Storage
Azure Data Lake Storage
An Azure service that provides an enterprise-wide hyper-scale repository for big data analytic workloads and is integrated with Azure Blob Storage.
1,466 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,174 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,681 questions
{count} votes

Accepted answer
  1. Bhargava-MSFT 31,016 Reputation points Microsoft Employee
    2023-09-14T16:36:05.61+00:00

    Issue: When databrick notebooks are triggered by the ADF pipeline get an error "The spark driver has stopped unexpectedly and is restarting. Your notebook will be automatically reattached"

    Solution:

    "The spark driver has stopped unexpectedly" is due to insufficient driver memory.

    We observed when the error occurs from the ADF pipelines, the notebooks are running concurrently.

    The error was resolved after changing the Notebooks run to sequential via the ADF pipeline.

    1 person found this answer helpful.
    0 comments No comments

1 additional answer

Sort by: Most helpful
  1. Radu Sitaru (ACCENTURE) 20 Reputation points Microsoft Employee
    2023-09-14T11:41:01.7+00:00

    Problem solved by: not running notebooks concurrently. My pipeline included running notebooks concurrently.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.