Data engineering with Azure Databricks

Intermediate
Data Engineer
Databricks

Learn how to harness the power of Apache Spark and powerful clusters running on the Azure Databricks platform to run large data engineering workloads in the cloud.

Prerequisites

None

Modules in this learning path

Azure Databricks is a cloud service that provides a scalable platform for data analytics using Apache Spark.

Azure Databricks is built on Apache Spark and enables data engineers and analysts to run Spark jobs to transform, analyze and visualize data at scale.

Delta Lake is an open source relational storage area for Spark that you can use to implement a data lakehouse architecture in Azure Databricks.

Azure Databricks provides SQL Warehouses that enable data analysts to work with data using familiar relational SQL queries.

Using pipelines in Azure Data Factory to run notebooks in Azure Databricks enables you to automate data engineering processes at cloud scale.