Job aborted due to stage failure: Task 0 in stage 1486.0 failed 4 times, most recent failure: Lost task 0.3 in stage 1486.0

2023-02-09T09:39:23.7833333+00:00

While doing the merge load from bronze layer to silver below error is coming

org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 1486.0 failed 4 times, most recent failure: Lost task 0.3 in stage 1486.0 (TID 1665) (10.116.129.142 executor 0): org.apache.spark.SparkException: Failed to store executor broadcast spark_join_relation_469_-315473829 in BlockManager.

Please suggest what are the checks that need to be done for this error.

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,369 questions
{count} votes

1 answer

Sort by: Most helpful
  1. KranthiPakala-MSFT 46,612 Reputation points Microsoft Employee
    2023-02-23T09:03:12.94+00:00

    Hi Abhishek Dutta (******@tcs.com),

    Welcome to Microsoft Q&A forum and thanks for posting your query.

    That error message comes from Spark's TorrentBroadcast class. Fortunately, the source code has a handy comment that explains how it works. Based on how the broadcast works, I suspect that this error is caused by the driver running out of memory. To overcome this issue, please try increasing the size of the driver node and see if that resolves the error.

    User's image

    If increasing the driver node didn't help to resolve the issue, then I would recommend filing a support ticket for deeper investigation.

    Hope this info helps. Let us know how it goes.

    Thank you

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.