Perform data engineering with Azure Synapse Apache Spark Pools

Intermediate
Data Engineer
Azure Synapse Analytics

Apache Spark is a highly scalable distributed processing solution for big data analytics and transformation. You can leverage its power in Azure Synapse Analytics by using Spark pools.

Prerequisites

Before starting this learning path, you should be familiar with Azure Synapse Analytics. Consider completing the Introduction to Azure Synapse Analytics module first.

Modules in this learning path

Apache Spark is a core technology for large-scale data analytics. Learn how to use Spark in Azure Synapse Analytics to analyze and visualize data in a data lake.

Data engineers commonly need to transform large volumes of data. Apache Spark pools in Azure Synapse Analytics provide a distributed processing platform that they can use to accomplish this goal.

Delta Lake is an open source relational storage area for Spark that you can use to implement a data lakehouse architecture in Azure Synapse Analytics.