1000 XP

Use Apache Spark in Microsoft Fabric

Intermediate
Data Analyst
Data Engineer
Microsoft Fabric

Apache Spark is a core technology for large-scale data analytics. Microsoft Fabric provides support for Spark clusters, enabling you to analyze and process data in a Lakehouse at scale.

Learning objectives

In this module, you'll learn how to:

  • Configure Spark in a Microsoft Fabric workspace

  • Identify suitable scenarios for Spark notebooks and Spark jobs

  • Use Spark dataframes to analyze and transform data

  • Use Spark SQL to query data in tables and views

  • Visualize data in a Spark notebook

Start

Prerequisites

Before starting this module, you should be familiar with the Microsoft Fabric interface and core concepts.

Module assessment

Assess your understanding of this module. Sign in and answer all questions correctly to earn a pass designation on your profile.

Take the module assessment