Salesforce ingestion connector

Databricks Lakeflow Connect provides a built-in connector for ingesting data directly from the Salesforce Platform into Azure Databricks. Data teams can easily build efficient, incremental pipelines at scale, and businesses can derive rich insights by unifying all their data and AI assets on the Data Intelligence Platform.

An organization might want to use Salesforce data to predict customer churn. The following video demonstrates how a retailer can do this by ingesting their customer order data, analyzing it, and then combining it with customer interactions across other channels for a holistic customer view.

Feature availability

Feature Availability
UI-based pipeline authoring Green check icon Supported
API-based pipeline authoring Yes
Declarative Automation Bundles Yes
Incremental ingestion Yes - By default, formula fields require full snapshots. To enable incremental ingestion for formula fields, see Ingest Salesforce formula fields incrementally.
Unity Catalog governance Yes
Orchestration using Databricks Workflows Yes
SCD type 2 Yes
API-based column selection and deselection Yes
API-based row filtering Yes
Automated schema evolution: New and deleted columns Yes
Automated schema evolution: Data type changes No
Automated schema evolution: Column renames Yes - Treated as a new column (new name) and deleted column (old name).
Automated schema evolution: New tables N/A
Maximum number of tables per pipeline 250

Authentication methods

Authentication method Availability
OAuth U2M Green check icon Supported
OAuth M2M Red X icon Not supported
OAuth (manual refresh token) Red X icon Not supported
Basic authentication (username/password) Red X icon Not supported
Basic authentication (API key) Red X icon Not supported
Basic authentication (service account JSON key) Red X icon Not supported

What to know before you start

Topic Why it matters
Azure Databricks user persona The workflow depends on your Azure Databricks user persona:
  • Single-user: An admin user creates a Unity Catalog connection and an ingestion pipeline.
  • Multi-user: An admin user creates a connection for non-admin users to create pipelines with.
Authentication method The steps to create a connection depend on the authentication method you choose.
Interface The steps to create a pipeline depend on the interface.
Ingestion frequency The pipeline schedule depends on your latency and cost requirements.
Common patterns Depending on your ingestion needs, the pipeline might use configurations like history tracking, column selection, and row filtering. Supported configurations vary by connector. See Feature availability.

Start ingesting from Salesforce

The following table provides an overview of the end-to-end Salesforce ingestion flow, based on user type:

User Steps
Admin Either:
  • Use Catalog Explorer to create a connection to Salesforce so that non-admins can create pipelines. See Salesforce.
  • Use the data ingestion UI to create a connection and a pipeline at the same time. See Ingest data from Salesforce.
Non-admin Use any supported interface to create a pipeline from an existing connection. See Ingest data from Salesforce.