Integrating HDInsight Zeppelin Notebook in Azure Data Factory

ClarissaJacquline 200 Reputation points
2024-02-09T04:59:56.8566667+00:00

What's the optimal approach for invoking a Zeppelin notebook from an HDInsight cluster within Azure Data Factory? I have a Zeppelin notebook on HDInsight, and I want to initiate it through Azure Data Factory. Additionally, what's the most efficient method for crafting Spark code and triggering it within Data Factory? I'm seeking guidance on seamlessly integrating these elements for effective data processing.

Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
11,624 questions
{count} votes

3 answers

Sort by: Most helpful
  1. Richard Swinbank 527 Reputation points MVP
    2024-02-09T09:52:39.7233333+00:00

    You can call Databricks notebook using ADF's Databricks Notebook activity, but there is no equivalent activity for Zeppelin notebooks in HDInsight. I believe it is still not possible to invoke Zeppelin notebooks in ADF.

    You can call Spark scripts or JARs in HDInsight using ADF's Spark activity, in the HDInsight group in ADF Studio.

    0 comments No comments

  2. Richard Swinbank 527 Reputation points MVP
    2024-02-09T09:55:17.8066667+00:00

    (Removed duplicate post -- ignore this)

    0 comments No comments

  3. AnnuKumari-MSFT 34,556 Reputation points Microsoft Employee Moderator
    2024-02-13T07:09:44.23+00:00

    Hi ClarissaJacquline , Thankyou for using Microsoft Q&A platform and thanks for posting your query here.

    Unfortunately , there is no direct way to achieve this requirement as within HDinsights , there is no notebook activity.

    However, you can try converting your zeppline notebook code to python code and run it using databricks notebook.

    Kindly check, You might find this below documentation helpful: https://github.com/rdblue/jupyter-zeppelin

    Hope it helps . Kindly accept the answer by clicking on Accept answer button. Thankyou

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.