What is the difference between external data and an integration dataset in Azure Synapse?

Hamid Banafunzi 25 Reputation points
2023-07-18T00:46:48.7766667+00:00

What is the difference between external data and an integration dataset in Azure Synapse?

User's image

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
4,109 questions
{count} votes

2 answers

Sort by: Most helpful
  1. KranthiPakala-MSFT 46,402 Reputation points Microsoft Employee
    2023-07-19T00:30:54.67+00:00

    @Hamid Banafunzi Welcome to Microsoft Q&A forum and thanks for reaching out here.

    Below is the difference between and integration dataset and an external data.

    INTEGRATION DATASET:

    An integration dataset is a way to define the structure and location of your data within a data store. It is used in pipeline activities and data flows to specify where your data is located and how it is organized.

    For example, let's say you have a CSV file stored in Azure Blob Storage that contains customer data. You can create an integration dataset that defines the structure of the CSV file (e.g., the column names and data types) and the location of the file in Blob Storage. You can then use this integration dataset in a Synapse pipeline activity or data flow to read or write data to/from the CSV file.

    In other words, an integration dataset is a way to define the "what" and "where" of your data within a data store. It specifies the structure of your data (e.g., column names and data types) and the location of your data (e.g., Blob Storage or SQL Server).

    User's image

    EXTERNAL DATA:

    An External data in Azure Synapse Workspace refers to data that is stored outside of the Synapse Workspace environment, which can be accessed and queried from within Synapse Workspace using external tables. An external table is a table that is defined in Synapse Workspace but is backed by data that is stored in an external data store/source.

    Once an external data connection is created, the underlying data of that connection will be available for analysis in the Data hub or for pipeline activities in the Integrate hub. Please note this is limited to below data sources only.

    User's image

    For better understanding of the external data and its use case scenarios, please refer to this video demonstration: Connecting to External Data with Azure Synapse

    Hope this helps. Let me know if you have further questions.


    Please don’t forget to Accept Answer and Yes for "was this answer helpful" wherever the information provided helps you, this can be beneficial to other community members.

    1 person found this answer helpful.
    0 comments No comments

  2. Vinodh247 9,071 Reputation points
    2023-07-18T07:31:39.1333333+00:00

    \Thanks for reaching out to Microsoft Q&A.

    !User's image

    User's image

    Please Upvote and Accept as answer if the reply was helpful, this will be benefitting the other community members who go through the same issue.

    0 comments No comments