Ingest data into your Warehouse using data pipelines

Article
01/17/2025

Applies to: ✅ Warehouse in Microsoft Fabric

Data pipelines offer an alternative to using the COPY command through a graphical user interface. A data pipeline is a logical grouping of activities that together perform a data ingestion task. Pipelines allow you to manage extract, transform, and load (ETL) activities instead of managing each one individually.

In this tutorial, you'll create a new pipeline that loads sample data into a Warehouse in Microsoft Fabric.

Note

Some features from Azure Data Factory are not available in Microsoft Fabric, but the concepts are interchangeable. You can learn more about Azure Data Factory and Pipelines on Pipelines and activities in Azure Data Factory and Azure Synapse Analytics. For a quickstart, visit Quickstart: Create your first pipeline to copy data.

Create a data pipeline

To create a new pipeline navigate to your workspace, select the +New button, and select Data pipeline.
To create a new pipeline navigate to your workspace, select the + New item button, and select Data pipeline.
- In your workspace, select + New Item and look for the Data pipeline card in the Get data section.
- Or, select Create in the navigation pane. Look for the Data pipeline card in the Data Factory section.
In the New pipeline dialog, provide a name for your new pipeline and select Create.
You'll land in the pipeline canvas area, where you see options to get started.

Pick the Copy data assistant option to launch the Copy assistant.
The first page of the Copy data assistant helps you pick your own data from various data sources, or select from one of the provided samples to get started. Select Sample data from the menu bar on this page. For this tutorial, we'll use the COVID-19 Data Lake sample. Select this option and select Next.
In the next page, you can select a dataset, the source file format, and preview the selected dataset. Select Bing COVID-19, the CSV format, and select Next.
The next page, Data destinations, allows you to configure the type of the destination workspace. We'll load data into a warehouse in our workspace. Select your desired warehouse in the dropdown list and select Next.
The last step to configure the destination is to provide a name to the destination table and configure the column mappings. Here you can choose to load the data to a new table or to an existing one, provide a schema and table names, change column names, remove columns, or change their mappings. You can accept the defaults, or adjust the settings to your preference.

When you're done reviewing the options, select Next.
The next page gives you the option to use staging, or provide advanced options for the data copy operation (which uses the T-SQL COPY command). Review the options without changing them, and select Next.
The last page in the assistant offers a summary of the copy activity. Select the option Start data transfer immediately and select Save + Run.
You are directed to the pipeline canvas area, where a new Copy Data activity is already configured for you. The pipeline starts to run automatically. You can monitor the status of your pipeline in the Output pane:
After a few seconds, your pipeline finishes successfully. Navigating back to your warehouse, you can select your table to preview the data and confirm that the copy operation concluded.

For more on data ingestion into your Warehouse in Microsoft Fabric, visit:

Next step

Query the SQL analytics endpoint or Warehouse in Microsoft Fabric

Share via

Ingest data into your Warehouse using data pipelines

Create a data pipeline

Next step

Feedback

Additional resources