Cost Breakdown of Databricks by job or user

tms345 116 Reputation points
2021-03-25T21:50:16.723+00:00

Hi,

I'm new to Azure and Databricks. I have a cluster up and running. However, multiple users are currently using it.

Using the Azure cost analysis...it seems you can only track cost against the resource group. Is there a way to track databricks cost at a job or user level when the cluster already exists?

If not, then should every "job" create its own cluster (with custom tags) and then delete once done? or are there better options?

Thank you!

Azure Cost Management
Azure Cost Management
A Microsoft offering that enables tracking of cloud usage and expenditures for Azure and other cloud providers.
2,024 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
1,915 questions
0 comments No comments
{count} votes

Accepted answer
  1. PRADEEPCHEEKATLA-MSFT 76,836 Reputation points Microsoft Employee
    2021-03-26T06:07:07.697+00:00

    Hello @tms345 ,

    Welcome to the Microsoft Q&A platform.

    Yes, you are correct you can only track cost against the resource group. Unfortunately, there is no way to track databricks cost at a job or user level when cluster is running.

    What is the difference between jobs compute and All-Purpose compute workloads?

    The Jobs Compute workload is defined as a job that both starts and terminates the cluster on which it runs. For example, a workload may be triggered by the Databricks job scheduler, which launches a new Apache Spark cluster solely for the job and automatically terminates the cluster after the job is complete.

    The All-Purpose Compute workload is any workload that is not an automated workload, for example, running a command within Databricks notebooks. These commands run on Apache Spark clusters which may persist until manually terminated. Multiple users can share a cluster to perform interactive analysis collaboratively.

    For more details, refer Azure Databricks pricing - FAQ

    To monitor cost and accurately attribute Azure Databricks usage to your organization’s business units and teams (for chargebacks, for example), you can tag workspaces (resource groups), clusters, and pools.

    Note: Using Tags feature which we can apply the filter and get the charges for the specific cluster.

    Reference: Monitor usage using cluster, pool, and workspace tags.

    Hope this helps. Do let us know if you any further queries.

    ------------

    Please don’t forget to Accept Answer and Up-Vote wherever the information provided helps you, this can be beneficial to other community members.

    2 people found this answer helpful.

0 additional answers

Sort by: Most helpful