Overview of SharePoint ingestion setup

Learn about the supported authentication methods for the managed SharePoint connector in Lakeflow Connect. The managed SharePoint connector supports ingestion of unstructured files (such as PDFs and DOCX) as well as structured formats (such as CSV, JSON, and Excel) into Delta tables. For custom pipelines using Spark and SQL APIs with full control over parsing and transformations, see Ingest files from SharePoint.

Important

This feature is in Beta. Workspace admins can control access to this feature from the Previews page. See Manage Azure Databricks previews.

Choose your SharePoint connector

Lakeflow Connect offers two SharePoint connectors. They both access data in SharePoint, but differ in their level of management.

Connector Description
Managed SharePoint connector A fully-managed connector. Simple, low-maintenance connector for enterprise applications that ingests data into Delta tables and keeps them in sync with the source.
Standard SharePoint connector Build custom ingestion pipelines with SQL, PySpark, or Lakeflow Spark Declarative Pipelines using batch and streaming APIs such as read_files, spark.read, COPY INTO, and Auto Loader. Offers the flexibility to perform complex transformations during ingestion, while giving you greater responsibility for managing and maintaining your pipelines.

Tip

Databricks recommends the managed SharePoint connector for most use cases.

Which authentication method should I choose?

The managed SharePoint connector supports the following authentication methods:

This method requires no Azure app registration. Databricks manages OAuth configuration and token refresh automatically. For complete setup steps, see Configure OAuth U2M: Databricks-managed for Microsoft SharePoint ingestion.