Ingest data into your Warehouse using the COPY statement

Applies to: ✅ Warehouse in Microsoft Fabric

The COPY statement is the primary way to ingest data into Warehouse tables. COPY performs high high-throughput data ingestion from an external Azure storage account, with the flexibility to configure source file format options, a location to store rejected rows, skipping header rows, and other options.

This tutorial shows data ingestion examples for a Warehouse table using the T-SQL COPY statement. It uses the Bing COVID-19 sample data from the Azure Open Datasets. For details about this data, including its schema and usage rights, see Bing COVID-19.

Note

Warehouse also enables you to use BULK INSERT statement for data ingestion. The COPY INTO statement is the recommended statement for the new ingestion code, while the BULK INSERT statement enables you to reuse the code that you are using in SQL Server or Azure SQL Database.

To learn more about the T-SQL COPY statement including more examples and the full syntax, see COPY (Transact-SQL).

Create a table

Before you use the COPY statement, the destination table needs to be created. To create the destination table for this sample, use the following steps:

In your Microsoft Fabric workspace, find and open your warehouse.
Switch to the Home tab and select New SQL query.

To create the table used as the destination in this tutorial, run the following code:

    CREATE TABLE dbo.TaxiTrips
    (
        doLocationId            varchar(MAX)      NULL,
        endLat                  float             NULL,
        endLon                  float             NULL,
        extra                   float             NULL,
        fareAmount              float             NULL,
        improvementSurcharge    varchar(MAX)      NULL,
        mtaTax                  float             NULL,
        passengerCount          int               NULL,
        paymentType             varchar(MAX)      NULL,
        puLocationId            varchar(MAX)      NULL,
        puMonth                 int               NULL,
        puYear                  int               NULL,
        rateCodeId              int               NULL,
        startLat                float             NULL,
        startLon                float             NULL,
        storeAndFwdFlag         varchar(1)        NULL,
        tipAmount               float             NULL,
        tollsAmount             float             NULL,
        totalAmount             float             NULL,
        tpepDropoffDateTime     datetime2(6)      NULL,
        tpepPickupDateTime      datetime2(6)      NULL,
        tripDistance            float             NULL,
        vendorId_str            varchar(MAX)      NULL,
        vendorId_lpep           int               NULL
    );

Ingest Parquet data using the COPY statement

In this example, we load data using a Parquet source. Since this data is publicly available and doesn't require authentication, you can easily copy this data by specifying the source and the destination. No authentication details are needed. You'll only need to specify the FILE_TYPE argument.

Use the following code to run the COPY statement with a Parquet source:

COPY INTO dbo.TaxiTrips
FROM 'https://azureopendatastorage.blob.core.windows.net/nyctlc/yellow'
WITH (
    FILE_TYPE = 'PARQUET'
)

Check the results

The COPY statement completes by ingesting 1,571,671,152 rows into your new table. You can confirm the operation ran successfully by running a query that returns the total number of rows in your table:

SELECT COUNT_BIG(*) FROM dbo.TaxiTrips;

Data ingestion options

Other ways to ingest data into your warehouse include:

Feedback

Was this page helpful?

Last updated on 2026-03-18