Azure Databricks - Usage dashboard for showing the costs by person

Moira Fascioli ADM 0 Reputation points
2025-05-26T13:54:38.0866667+00:00

Hi all, I am using the USAGE DASHBOARD in Databricks in order to show the costs by person. We have Databricks on Azure.

I am using a report within the Usage Dashboard in which I can deep dive the costs by "RUN AS", I have a very high quote for "NULL". I see very low costs tied to the persons. Can you help me to understand?

Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,495 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Venkat Reddy Navari 2,555 Reputation points Microsoft External Staff Moderator
    2025-05-26T15:00:53.2166667+00:00

    Hi @Moira Fascioli ADM
    Pritam Kabiraj explanation is on the right track "RUN AS: NULL" typically appears when Databricks jobs or cluster activities are initiated without a specific user context. This can happen in scenarios such as:"

    • Jobs or pipelines triggered by service principals
    • Scheduled or automated tasks (like those started via APIs or orchestration tools)
    • Shared or legacy clusters where individual user attribution isn’t configured

    This is likely why you’re seeing high costs under "NULL" and relatively low costs under named users — much of the compute usage may be driven by non-interactive, system-level processes.

    To improve visibility and cost attribution, here are some steps you can take:

    1. Review job configurations – Check whether jobs are running using a service principal or system identity, and whether Run As user is explicitly set.
    2. Use tagging – Apply cluster and job-level tags (e.g., owner, department, team) to help attribute costs correctly.
    3. Enforce cluster policies – You can use cluster policies to require tagging or assign default user context.
    4. Audit shared cluster usage – Consider moving from shared interactive clusters to job clusters or single-user clusters to improve tracking.
    5. Check audit or diagnostic logs – Azure Diagnostic Logs or Databricks audit logs can help trace the source of usage that's appearing as "NULL".

    I hope this information helps. Please do let us know if you have any further queries.

    Kindly consider upvoting the comment if the information provided is helpful. This can assist other community members in resolving similar issues.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.