How do I install ydata-profiling on an Azure Synapse spark cluster?

didier caron 20 Reputation points
2023-02-25T08:46:10.7466667+00:00

I am trying to use the ydata-profiling package from pypi and it requires a spark version of 3.3 when I run the %pip install ydata-profiling [notebook,pyspark] I get an interpreter died error. Install steps are described here https://ydata-profiling.ydata.ai/docs/master/pages/getting_started/installation.html

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
5,373 questions
{count} votes

Accepted answer
  1. PRADEEPCHEEKATLA 90,641 Reputation points Moderator
    2023-03-08T07:59:37.48+00:00

    @didier caron

    Thanks for the question and using MS Q&A platform.

    As per the repro, I was experience the same issue when I used %pip and able to resolve the issue by using just pip

    User's image

    Hope this helps. Do let us know if you any further queries.


    If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

    0 comments No comments

1 additional answer

Sort by: Most helpful
  1. Vinodh247 34,661 Reputation points MVP Volunteer Moderator
    2023-02-26T11:12:12.38+00:00

    Hi

    Thanks for reaching out to Microsoft Q&A.

    Pls go through below links that are close to what you have asked for. I think one of this would def help.

    https://learn.microsoft.com/en-us/azure/synapse-analytics/spark/apache-spark-manage-pool-packages

    https://learn.microsoft.com/en-us/azure/synapse-analytics/spark/apache-spark-azure-portal-add-libraries

    https://techcommunity.microsoft.com/t5/analytics-on-azure-blog/install-third-party-whl-packages-within-apache-spark-cluster/ba-p/3667135

    Please Upvote and Accept as answer if the reply was helpful, this will be helpful to other community members.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.