Data sources

This section describes the Apache Spark data sources you can use in Azure Databricks. Many include a notebook that demonstrates how to use the data source to read and write data.

The following data sources are either directly supported in Databricks Runtime or require simple shell commands to enable access:

In addition, Azure Databricks supports Delta Lake and makes it easy to create Delta tables from multiple data formats.

For more information about Apache Spark data sources, see Generic Load/Save Functions and Generic File Source Options.

To learn how to access metadata for file-based data sources, see File metadata column.

The following storage data sources require you to configure the connection to storage. Some also require that you create an Azure Databricks library and install it in a cluster: