How to pass Spark Config to Synapse Pipeline Spark Job Definition

Matthias Tauber 0 Reputation points
2023-06-15T07:48:53.4166667+00:00

Hi,

I need to pass Spark Config options to a Spark Job Definition in Synapse Pipeline. I Added the configuration in the Spark Job definition - when directly submitting the Spark Job from there it appears to work fine, however when I reference the Job definition in my pipeline using Spark Job Definition Object - the config does not seem to be inherited.

There is also no possibility to specifiy the config in the Synapse Pipeline Spark Job definition using the UI, when trying to add it directly in the underlying json my changes are not saved.

I need to add the following parameters for my Spark Job:

spark.driver.userClassPathFirst: "true",
spark.executor.userClassPathFirst: "true"
Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
5,374 questions
0 comments No comments
{count} votes

2 answers

Sort by: Most helpful
  1. PRADEEPCHEEKATLA 90,641 Reputation points Moderator
    2023-06-15T13:06:28.6733333+00:00

    @Matthias Tauber - Thanks for the question and using MS Q&A platform.

    Spark configuration: Specify values for Spark configuration properties listed in the topic: Spark Configuration - Application properties. Users can use default configuration and customized configuration.

    To pass Spark Config options to a Spark Job Definition in Synapse Pipeline, you can add the configuration in the "Advanced" section of the Spark Job Definition object in your pipeline. Here are the steps to do this:

    User's image

    When you run your pipeline, the Spark Job Definition object will inherit the Spark Config options you specified in the "Advanced" section.

    User's image

    For more details, refer to Quickstart: Transform data using Apache Spark job definition.

    Hope this helps. Do let us know if you any further queries.


    If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.


  2. Matthias Tauber 0 Reputation points
    2023-06-21T12:29:42.0266667+00:00

    @PRADEEPCHEEKATLA after some more effort we managed to solve the issue by directly baking the config into the synapse spark job definition object. Thank you for your support.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.