Use Apache Spark in Microsoft Fabric

Intermediate
Data Analyst
Data Engineer
Microsoft Fabric

Apache Spark is a core technology for large-scale data analytics. Microsoft Fabric provides support for Spark clusters, enabling you to analyze and process data in a Lakehouse at scale.

Learning objectives

In this module, you'll learn how to:

  • Configure Spark in a Microsoft Fabric workspace
  • Identify suitable scenarios for Spark notebooks and Spark jobs
  • Use Spark to connect to data soueces and ingest data
  • Use Spark dataframes to analyze and transform data
  • Use Spark SQL to query data in tables and views
  • Visualize data in a Spark notebook

Prerequisites

Before starting this module, you should be familiar with the Microsoft Fabric interface and core concepts.