Monitor pipelines

Monitor Lakeflow Spark Declarative Pipelines using the built-in pipeline UI, the event log, query history, and custom event hooks. These features track update progress, data quality, lineage, and streaming metrics.

Topic Description
Monitor using the UI Observe the progress and status of pipeline updates, and alert on the success or failure. View metrics for streaming sources, like Apache Kafka and Auto Loader.
Event log Extract detailed information on pipeline updates such as data lineage, data quality metrics, and resource usage using the pipeline event log.
Additionally, see the schema for the event log.
Query history Inspect and diagnose query performance by looking at the query history.
Custom monitoring Define custom actions to take when specific events occur using event hooks.

Additionally, there are troubleshooting topics for specific scenarios.

Topic Description
Recover a pipeline from streaming checkpoint failure Recover a pipeline that has an invalid or corrupted streaming checkpoint.
Fix high initialization times in pipelines Fix high initialization times for a pipeline by splitting and load balancing flows across pipelines.