Datasets, regions, and sinks supported by Microsoft Graph Data Connect

Microsoft Graph Data Connect supports a variety of datasets, data regions, and storage locations in Microsoft Azure. This article describes the supported datasets and how to access the dataset schemas, the Microsoft 365 and Microsoft Azure regions that are supported, and the storage locations that Microsoft Graph Data Connect utilizes through Azure Synapse or Azure Data Factory.

Datasets

Microsoft Graph Data Connect currently supports the following datasets. To view the schemas for each dataset, create a new dataset in Azure Synapse or Azure Data Factory and use the Schema tab to view it.

Azure Active Directory

Dataset name Description Sample and Schema Billing
BasicDataSet_v0.User_v0 Contains user information (DisplayName, UserPrincipalName, and other information). Free
BasicDataSet_v0.User_v1 Contains user information. Free
BasicDataSet_v0.Manager_v0 Contains user information for the manager of each user. Free
BasicDataSet_v0.DirectReport_v0 Contains user information about the employees that directly report to each user. Free

Outlook and Exchange Online

Dataset name Description Sample and Schema Billing
BasicDataSet_v0.CalendarView_v0 Contains the events from the Calendar view. Paid
BasicDataSet_v0.Contact_v0 Contains the available information from each user’s address book. Paid
BasicDataSet_v0.Contact_v1 Contains contact information from each user's address book. Paid
BasicDataSet_v0.Event_v0 Contains the information from a user’s calendar events. Paid
BasicDataSet_v0.Event_v1 Contains the events in each user's calendar. Paid
BasicDataSet_v0.Inbox_v1 Contains the mail folders from each user's inbox. Paid
BasicDataSet_v0.MailboxSettings_v0 Contains the mailbox settings of each user. Free
BasicDataSet_v0.MailFolder_v0 Contains the mail folders from each user's mailbox. Paid
BasicDataSet_v0.Message_v0 Contains the email messages from a user’s mailbox. Paid
BasicDataSet_v0.Message_v1 Contains the email message in each user's mailbox. Paid
BasicDataSet_v0.SentItem_v0 Contains the messages sent from each user's mailbox. Paid
BasicDataSet_v0.SentItem_v1 Contains the message sent from each user's mailbox. Paid
BasicDataSet_v0.TodoTaskFolders_v0 Contains all task folders in the user's mailbox. Paid
BasicDataSet_v0.TodoTasks_v0 Contains all the tasks in the signed-in user's mailbox. Paid
BasicDataSet_v0.OutlookGroupConversations_v0 Contains the message sent from each user's mailbox. Paid

Microsoft Teams

Dataset name Description Sample and Schema Billing
BasicDataSet_v0.TeamChat_v1 Contains Teams chat messages for one-on-one and group chat messages. This dataset excludes chat messages explicitly deleted by users. Paid
BasicDataSet_v0.TeamsCallRecords_v1 Contains activity records from Teams calls and meetings. Paid
BasicDataSet_v0.TeamsChannelDetails_v0 Contains details about Channels in a team. Paid
BasicDataSet_v0.TeamsStandardChannelMessages_v0 Contains channel posts and messages from Standard Channels in Teams. Paid
BasicDataSet_v0.TeamsTranscript_v1 Contains transcripts from calls and meetings in Teams when the transcript is enabled for a meeting or a call. Paid

Microsoft Groups

Dataset name Description Sample and Schema Billing
BasicDataSet_v0.GroupDetails_v0 Contains a Teams Chat Group details. Free
BasicDataSet_v0.GroupMembers_v0 Contains a Teams Chat Group member details. Free
BasicDataSet_v0.GroupOwners_v0 Contains the Teams Chat Group owners' details. Free

OneDrive and SharePoint Online

Please consider that these datasets are available after 48 hours (about two days). For example, you can query data for 07/01 starting on 07/03. 

The data available is from the last 21 days (about three weeks). For example, if you would like to query data for 07/01, you can do so from 07/03 to 07/24. 

Dataset name Description Sample and Schema Billing
DocumentSharingDataset_v0_Preview Contains information about sharing permissions of documents. Free
SharePointSitesDataset_v0_Preview Contains information about SharePoint sites. Free
SharePointGroupsDataset_v0_Preview Contains SharePoint group information, including details about group members. Free

Viva Insights

Dataset name Description Sample and Schema Billing
VivaInsightsDataset_v0 Contains Viva Insights metrics. Requires Viva Insights license

Regions

Microsoft Graph Data Connect supports extracting data from a variety of Microsoft 365 regions. To successfully move data from the Microsoft 365 datacenter into your Microsoft Azure storage, the Azure Synapse or Azure Data Factory instance and the Azure storage location must both map to a supported region for the location of the Microsoft 365 data.

The following table indicates which Microsoft 365 regions are supported and the corresponding Azure regions required for data movement.

Office region Azure region
Asia-Pacific
  • East Asia
  • Southeast Asia
Australia
  • Australia East
  • Australia Southeast
Europe
  • North Europe
  • West Europe
North America
  • Central US
  • East US
  • East US 2
  • North Central US
  • South Central US
  • West Central US
  • West US
  • West US 2
United Kingdom
  • UK South
  • UK West
Canada (CAN)
  • Canada Central
  • Canada East
Japan (JPN)
  • Japan West
  • Japan East
India (IND)
  • South India
  • Central India
Korea (KOR)
  • Korea Central
  • Korea South
Switzerland (CHE)
  • Switzerland North
Germany (DEU)
  • Germany West Central
Norway (NOR)
  • Norway East
France (FRA)
  • France Central
UAE (UAE)
  • UAE North

Sinks

Sinks are the output location that Azure Synapse or Azure Data Factory uses to place data in Azure storage. Microsoft Graph Data Connect supports the following sink storage types:

The following characteristics apply to sinks:

  • Service Principal authentication is the only supported authentication mechanism for all sink types in a copy activity with Microsoft 365 as the source.

  • When using Azure Storage Blob as the sink, you must ensure that your application has Storage Blob Data Contributor access to the Azure Storage Blob location.

  • For copy activity, the output files are formatted as JSON. This format is fixed and modifying the format is not supported. However, you can use Azure Synapse or Azure Data Factory to copy the result of a Microsoft Graph Data Connect pipeline into another storage mechanism (such as Azure SQL Database).

  • Mapping data flows: Copy and transform data from Microsoft 365 (Office 365) - Azure Data Factory & Azure Synapse | Microsoft Learn |

    • Output can be in parquet format. For details about the supported data transformations, see Flatten transformation in mapping data flow.

    • Microsoft Graph Data Connect on mapping data flows supports direct output of the data into Azure SQL DB.

 

The following table indicates the areas that are supported for the corresponding copy activity and mapping data flows.

Area Copy Activity Mapping data flows
Output data formats supported JSON JSON, Parquet
Data transformation (normalization/flattening/etc.) Requires additional transformation step in the ADF/Synapse pipeline Supports inline transformations
Supported data sinks ADLS gen2, Azure Blob ADLS gen2, Azure Blob, Azure SQL DB
Azure VNET IR Not supported Supported

See also