Implement a data lakehouse analytics solution with Azure Databricks

6 hr 37 min
Learning Path
6 Modules

At a glance

Level

Intermediate
Skill

 
Product

Azure Databricks
Role

Data Engineer
Subject

Data engineering

Learn how to harness the power of Apache Spark and powerful clusters running on the Azure Databricks platform to run large data engineering workloads in the cloud.

Prerequisites

None

Start

Modules in this learning path

Explore Azure Databricks

48 min
Module
8 Units

Azure Databricks is a cloud service that provides a scalable platform for data analytics using Apache Spark.

Start

Introduction 2 min
Get started with Azure Databricks 3 min
Identify Azure Databricks workloads 3 min
Understand key concepts 3 min
Data governance using Unity Catalog and Microsoft Purview 3 min
Exercise - Explore Azure Databricks 30 min
Knowledge check 3 min
Summary 1 min

Perform data analysis with Azure Databricks

54 min
Module
7 Units

Learn how to perform data analysis using Azure Databricks. Explore various data ingestion methods and how to integrate data from sources like Azure Data Lake and Azure SQL Database. This module guides you through using collaborative notebooks to perform exploratory data analysis (EDA), so you can visualize, manipulate, and examine data to uncover patterns, anomalies, and correlations.

Use Apache Spark in Azure Databricks

1 hr 14 min
Module
9 Units

Azure Databricks is built on Apache Spark and enables data engineers and analysts to run Spark jobs to transform, analyze and visualize data at scale.

Manage data with Delta Lake

1 hr 8 min
Module
9 Units

Delta Lake is a data management solution in Azure Databricks providing features including ACID transactions, schema enforcement, and time travel ensuring data consistency, integrity, and versioning capabilities.

Build data pipelines with Delta Live Tables

1 hr 24 min
Module
7 Units

Building data pipelines with Delta Live Tables enables real-time, scalable, and reliable data processing using Delta Lake's advanced features in Azure Databricks

Deploy workloads with Azure Databricks Workflows

1 hr 9 min
Module
8 Units

Deploying workloads with Azure Databricks Workflows involves orchestrating and automating complex data processing pipelines, machine learning workflows, and analytics tasks. In this module, you learn how to deploy workloads with Databricks Workflows.

The future is yours

Implement a data lakehouse analytics solution with Azure Databricks

At a glance

Prerequisites

Modules in this learning path