Azure Databricks | Scala on High Concurrency Cluster Mode

Lokesh 211 Reputation points
2022-07-13T05:40:30.657+00:00

We are using scala notebooks with high concurrency clusters.
So far what we have read is that Scala notebooks would not benefit from high concurrency clusters, is it true?

These clusters are supposed to be shared with various other teams.

220189-2022-07-13-11-03-52.jpg?

.NET
.NET
Microsoft Technologies based on the .NET software framework.
3,415 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,947 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. HimanshuSinha-msft 19,381 Reputation points Microsoft Employee
    2022-07-14T21:43:30.55+00:00

    Hello @Lokesh ,
    Thanks for the question and using MS Q&A platform.

    As we understand the ask here is that Scala notebooks would not benefit from high concurrency clusters ? please do let us know if its not accurate.
    You understanding this is correct , Scala does not benefit from high concurrency clusters . This is called out here : https://learn.microsoft.com/en-us/azure/databricks/clusters/configure#--high-concurrency-clusters

    The reason being that in the high concurrent mode the cluster needs to run the workload of different users , but Scala code will be executed inside the Spark JVM (per machine) that is shared between all users, so you can get access to everything that is inside JVM.

    Please do let me if you have any queries.
    Thanks
    Himanshu


    • Please don't forget to click on 130616-image.png or upvote 130671-image.png button whenever the information provided helps you. Original posters help the community find answers faster by identifying the correct answer. Here is how
    • Want a reminder to come back and check responses? Here is how to subscribe to a notification
      • If you are interested in joining the VM program and help shape the future of Q&A: Here is how you can be part of Q&A Volunteer Moderators