Azure Synapse CI/CD Pipeline - Override Apache Spark Pool Name in Notebooks

Question

Azure Synapse CI/CD Pipeline - Override Apache Spark Pool Name in Notebooks

Tsu Kernik 0

I'm working on setting up a CI/CD pipeline for an Azure Synapse workspace, and I need to create custom parameters in the deployment process to handle environment-specific values. Specifically, I need to override the Apache Spark pool name used in the development environment (synDEVSPark) to the production Apache Spark pool name (synPROSPark1).

I've followed the instructions from the Microsoft documentation, but I'm encountering an error during the synapse deployment task.

Here are the steps I've taken:

Created a template-parameters-definition.json file to define the parameters (shown in the image below).
Set up the override parameter in the Synapse deployment task (shown in the image below:
Created a pipeline variable to hold the production Apache Spark pool name (shown in the below image). Despite these efforts, the deployment fails with the following error (shown in the image below): Failed to fetch the deployment status {"code":"400","message":"CreateOrUpdateNotebook failed: [statusCode from ADF:BadRequest, ErrorMessage:{\"code\":\"BadRequest\",\"message\":\"The document creation or update failed because of invalid reference 'synDEVSPark'... How can I correctly override the Apache Spark pool name in the Synapse deployment task to ensure it uses the production Spark pool (synPROSPark1) instead of the development Spark pool (synDEVSPark)? Is there a specific way to reference the Apache Spark pool in the parameters, considering it is not a linked service? Any help or guidance on what I might be doing wrong or how to properly configure the override parameters would be greatly appreciated. Thank you!

Smaran Thoomu 32,530 Reputation points Microsoft External Staff Moderator

2024-08-02T16:54:13.5766667+00:00

@Tsu Kernik Following up to see if the below suggestion was helpful. And, if you have any further query do let us know.

1 answer

Your answer

Smaran Thoomu 32,530 Reputation points Microsoft External Staff Moderator

2024-08-02T16:54:13.5766667+00:00

@Tsu Kernik Following up to see if the below suggestion was helpful. And, if you have any further query do let us know.

Answer 1

Hi @Tsu Kernik

Thanks for the question and using MS Q&A platform.

Please correct me if my understanding is wrong. You are looking to parameterize the Notebook parameter, to replace your Spark pool name with a different value (without the default value) when deploying to a higher environment. You can use the below code in your template-parameter-definition file. This code will expose the notebook parameters.

"Microsoft.Synapse/workspaces/notebooks": {

        "properties": {

            "bigDataPool": {

                "referenceName": "="

            },

             "metadata": {

                "a365ComputeOptions": {

                        "id": "=",

                         "name": "=",

                        "endpoint": "="

                }

            }

        }

     }

on the next step, go to the Workspace Deployment task in your Release Pipeline and add these new parameters in the “OverrideParameters” section. Here I have added Notebook 1_properties_bigDataPool_referenceName = DemoUAT ( to change the sparkpool name from demo to DemoUAT)

User's image

I hope this helps. Please let me know if you have any further questions.

Smaran Thoomu 32,530 Reputation points Microsoft External Staff Moderator

2024-08-05T01:10:55.3033333+00:00

@Tsu Kernik Following up to see if the above answer was helpful. If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

Share via

Azure Synapse CI/CD Pipeline - Override Apache Spark Pool Name in Notebooks

1 answer

Your answer