Databricks monitoring using Azure Monitor

Rohit 61 Reputation points
2020-08-04T09:24:26.763+00:00

Hi Team,

I want to monitor azure datababricks metrics and other info like quota, cluster capacity, no of nodes and I wanna put all this information to azure dashboard. How to put the databricks logs to azure monitor without grafana..

Thanks & @Regards
Rohit Boddu.

Azure Monitor
Azure Monitor
An Azure service that is used to collect, analyze, and act on telemetry data from Azure and on-premises environments.
2,963 questions
Azure Databricks
Azure Databricks
An Apache Spark-based analytics platform optimized for Azure.
2,043 questions
0 comments No comments
{count} votes

2 answers

Sort by: Most helpful
  1. PRADEEPCHEEKATLA-MSFT 84,051 Reputation points Microsoft Employee
    2020-08-10T06:48:10.347+00:00

    Hello @Rohit , @Ayyappan, Jayarajkumar ,

    In additional to @Leon Laude response.

    You can find a Guide on Monitoring Azure Databricks on the Azure Architecture Center, explaining the concepts used in this article - Monitoring And Logging In Azure Databricks With Azure Log Analytics And Grafana.

    To provide full data collection, we combine the Spark monitoring library with a custom log4j.properties configuration. The build of the monitoring library for Spark 2.4 and the installation in Databricks is automated through the scripts referenced in the tutorial and available at https://github.com/algattik/databricks-monitoring-tutorial/.

    Azure Databricks quota limitation found at Subscription level.

    Select your subscription => Under settings => Usage + Quotas.

    16530-image.png

    Can we get the utilization % of our nodes at different point of time?

    You can use Ganglia metrics to get utilization % for nodes at different point of time.

    To access the Ganglia UI, navigate to the Metrics tab on the cluster details page. CPU metrics are available in the Ganglia UI for all Databricks runtimes. GPU metrics are available for GPU-enabled clusters running Databricks Runtime 4.1 and above.

    To view live metrics, click the Ganglia UI link.

    16683-image.png

    To view historical metrics, click a snapshot file. The snapshot contains aggregated metrics for the hour preceding the selected time.

    16570-image.png

    Cost of each jobs or databricks units in Azure monitor.

    You cannot get cost for each job in Azure Databricks.

    Azure Databricks cost can be found at Subscription level.

    Select your subscription => Cost Analysis => Resource type = "microsoft.databricks/workspaces".

    16596-image.png

    Hope this helps. Do let us know if you any further queries.

    ----------------------------------------------------------------------------------------

    Do click on "Accept Answer" and Upvote on the post that helps you, this can be beneficial to other community members.

    3 people found this answer helpful.

  2. Leon Laude 85,716 Reputation points
    2020-08-04T09:37:03.773+00:00

    Hi,

    Something here might help:

    Monitoring Azure Databricks
    https://learn.microsoft.com/en-us/azure/architecture/databricks-monitoring

    Send Azure Databricks application logs to Azure Monitor
    https://learn.microsoft.com/en-us/azure/architecture/databricks-monitoring/application-logs

    For all the supported Azure Monitor metrics, see the list here:
    https://learn.microsoft.com/en-us/azure/azure-monitor/platform/metrics-supported

    ----------

    (If the reply was helpful please don't forget to accept as answer, thank you)

    Best regards,
    Leon

    1 person found this answer helpful.