isolated databricks cluster call from synapses or azure datafactory

alexander grajales vanegas 6 Reputation points
2024-08-20T18:06:27.27+00:00

Captura de pantalla 2024-08-20 122534

how can I create a job in databricks with parameters of isolated from synapses or azure datafactory, because I can not find any option that allows to pass as parameter this value and not being able to do so I have no access to my unit catalog in databricks

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,858 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,566 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Vinodh247 17,941 Reputation points
    2024-08-22T00:27:35.83+00:00

    Can you confirm if you have tried the following:

    1. Create a Linked Service:
      • Click on + New and choose Azure Databricks as the compute option.
      • Fill in the required fields, including the Databricks workspace and select New job cluster to ensure isolation. This allows you to run jobs in an isolated manner without affecting shared clusters.
    2. Configure the Databricks Notebook Activity:
      • Within your pipeline, add the Databricks Notebook Activity.
      • In the properties pane, you can specify parameters to be passed to the Databricks notebook. For example, you can define a parameter named input and set its value using an expression like @pipeline().parameters.name to pass values dynamically from ADF to Databricks.
    3. Trigger and Monitor the Pipeline:
    4. Access Unity Catalog:
      • Ensure that the Databricks workspace is configured to allow access to the Unity Catalog. This may require setting appropriate permissions and roles within Databricks to access the catalog from your isolated jobs.

    Note:

    • Cluster Isolation is generally managed within Databricks and is tied to the cluster configuration rather than being a parameter passed at runtime from ADF or Synapse.
    • If you cannot directly pass the isolation as a parameter from ADF or Synapse, you need to ensure the job in Databricks is always associated with the isolated cluster that has access to Unity Catalog.
    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.