Read Parquet files using Azure Databricks

This article shows you how to read data from Apache Parquet files using Azure Databricks.

What is Parquet?

Apache Parquet is a columnar file format with optimizations that speed up queries. It's a more efficient file format than CSV or JSON.

For more information, see Parquet Files.

Options

Use the .option() and .options() methods of DataFrameReader and DataFrameWriter to configure Parquet data sources. For a complete list of supported options, see DataFrameReader Parquet options and DataFrameWriter Parquet options.

Notebook example: Read and write to Parquet files

The following notebook shows how to read and write data to Parquet files.

Reading Parquet files notebook

Get notebook