Different Version in cluster and workspace

Krishnamoorthy, Rajamannar 61 Reputation points
2021-10-28T01:21:43.54+00:00

I have installed NumPy version 1.20.0 in my cluster by when I check the same in the cluster it's a different version 1.19.5. I have attached both the cluster and workspace screenshots. Also installing the packages in the cluster libraries takes more than 5 minutes but the scheduled job starts before the packages are installed, is there any way to speed up the package installation process or start the job after pkg installation.

Cluster Version
144240-image.png

Workspace Version
144280-image.png

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,514 questions
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
11,623 questions
{count} votes

Accepted answer
  1. PRADEEPCHEEKATLA 90,641 Reputation points Moderator
    2021-10-28T09:29:38.747+00:00

    Hello @Krishnamoorthy, Rajamannar ,

    Welcome to the Microsoft Q&A platform.

    Make sure to restart the cluster to reflect the new packages that you have installed.

    144532-image.png

    As per the repro, I had used numpy==1.20.0 package installation. By default it comes with 1.19.2 version and after installing 1.20.0 version you need to restart the cluster to reflect the newer package versions.

    144470-image.png

    Regarding the packages in the cluster libraries takes more than 5 minutes this is an excepted behaviour based on the package which you are trying to install.

    Hope this will help. Please let us know if any further queries.

    ------------------------------

    • Please don't forget to click on 130616-image.png or upvote 130671-image.png button whenever the information provided helps you. Original posters help the community find answers faster by identifying the correct answer. Here is how
    • Want a reminder to come back and check responses? Here is how to subscribe to a notification
    • If you are interested in joining the VM program and help shape the future of Q&A: Here is how you can be part of Q&A Volunteer Moderators

1 additional answer

Sort by: Most helpful
  1. Shalvin 161 Reputation points
    2021-10-30T23:29:45.437+00:00

    Hello @Krishnamoorthy, Rajamannar ,

    I also have seen similar observation and the solution was to install packages when I wanted. Below steps has worked for me always.

    1. Create a requirements.txt file with the package versions you want
    2. Upload the file to DBFS (Eg: /FileStore/requirements.txt)
    3. Use one of the below options to load the requirements

    i. Create a cell as the first code cell in your job notebook and run the command %pip install -r /dbfs/FileStore/requirements.txt (I use this approach)
    ii. Use a cluster init script to run the command pip install -r /dbfs/FileStore/requirements.txt

    Thanks,
    Shalvin


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.