Datasets, regions, and sinks supported by Microsoft Graph Data Connect

Microsoft Graph Data Connect supports a variety of datasets, data regions, and storage locations in Microsoft Azure. This article describes the supported datasets and how to access the dataset schemas, the Microsoft 365 and Microsoft Azure regions that are supported, and the storage locations that Microsoft Graph Data Connect utilizes through Azure Synapse or Azure Data Factory.

Datasets

Microsoft Graph Data Connect currently supports the following datasets. To view the schemas for each dataset, create a new dataset in Azure Synapse or Azure Data Factory and use the Schema tab to view it.

Azure Active Directory

Dataset name Description Sample and Schema Billing
BasicDataSet_v0.User_v0 Contains user information (DisplayName, UserPrincipalName, and other information). Free
BasicDataSet_v0.User_v1 Contains user information. Free
BasicDataSet_v0.Manager_v0 Contains user information for the manager of each user. Free
BasicDataSet_v0.DirectReport_v0 Contains user information about the employees that directly report to each user. Free

Outlook and Exchange Online

Dataset name Description Sample and Schema Billing
BasicDataSet_v0.CalendarView_v0 Contains the events from the Calendar view. Paid
BasicDataSet_v0.Contact_v0 Contains the available information from each user’s address book. Paid
BasicDataSet_v0.Contact_v1 Contains contact information from each user's address book. Paid
BasicDataSet_v0.Event_v0 Contains the information from a user’s calendar events. Paid
BasicDataSet_v0.Event_v1 Contains the events in each user's calendar. Paid
BasicDataSet_v0.Inbox_v1 Contains the mail folders from each user's inbox. Paid
BasicDataSet_v0.MailboxSettings_v0 Contains the mailbox settings of each user. Free
BasicDataSet_v0.MailFolder_v0 Contains the mail folders from each user's mailbox. Paid
BasicDataSet_v0.Message_v0 Contains the email messages from a user’s mailbox. Paid
BasicDataSet_v0.Message_v1 Contains the email message in each user's mailbox. Paid
BasicDataSet_v0.SentItem_v0 Contains the messages sent from each user's mailbox. Paid
BasicDataSet_v0.SentItem_v1 Contains the message sent from each user's mailbox. Paid
BasicDataSet_v0.OutlookGroupConversations_v0 Contains the message sent from each user's mailbox. Paid

Microsoft Teams

Dataset name Description Sample and Schema Billing
BasicDataSet_v0.TeamChat_v1 Contains Teams chat messages for one-on-one and group chat messages. This dataset excludes chat messages explicitly deleted by users. Paid
BasicDataset_v0.TeamStandardChannelMessages_v0 Contains a Teams standard channel message details. Paid
BasicDataSet_v0.TeamsCallRecords_v1 Contains activity records from Teams calls and meetings. Paid
BasicDataSet_v0.TeamsChannelDetails_v0 Contains details about Channels in a team. Paid
BasicDataSet_v0.TeamsStandardChannelMessages_v0 Contains channel posts and messages from Standard Channels in Teams. Paid
BasicDataSet_v0.TeamsTranscript_v1 Contains transcripts from calls and meetings in Teams when the transcript is enabled for a meeting or a call. Paid

Microsoft Groups

Dataset name Description Sample and Schema Billing
BasicDataSet_v0.GroupDetails_v0 Contains a Teams Chat Group details. Free
BasicDataSet_v0.GroupMembers_v0 Contains a Teams Chat Group member details. Free
BasicDataSet_v0.GroupOwners_v0 Contains the Teams Chat Group owners' details. Free

OneDrive and SharePoint Online

Please consider that these datasets are available after 48 hours (about two days). For instance, you can query data for 07/01 starting on 07/03. 

The data available is from the last 21 days (about three weeks). For instance, if you would like to query data for 07/01, you can do so from 07/03 to 07/24. 

Dataset name Description Sample and Schema Billing
DocumentSharingDataset_v0_Preview Contains information about sharing permissions of documents. Free (under preview only). These datasets will be available to those in Preview. You must join the Graph TAP program using the signup form at https://aka.ms/GraphTAPForm and request to join the Preview.
SharePointSitesDataset_v0_Preview Contains information about SharePoint sites. Free (under preview only). These datasets will be available to those in Preview. You will need to join the Graph TAP program using the signup form at https://aka.ms/GraphTAPForm and request to join the Preview.
SharePointGroupsDataset_v0_Preview Contains SharePoint group information, including details about group members. Free (under preview only). These datasets will be available to those in Preview. You must join the Graph TAP program using the signup form at https://aka.ms/GraphTAPForm and request to join the Preview.

Viva Insights

Dataset name Description Sample and Schema Billing
VivaInsightsDataset_v0 Contains Viva Insights metrics. Requires Viva Insights license

Regions

Microsoft Graph Data Connect supports extracting data from a variety of Microsoft 365 regions. To successfully move data from the Microsoft 365 datacenter into your Microsoft Azure storage, the Azure Synapse or Azure Data Factory instance and the Azure storage location must both map to a supported region for the location of the Microsoft 365 data.

The following table indicates which Microsoft 365 regions are supported and the corresponding Azure regions required for data movement.

Office region Azure region
Asia-Pacific
  • East Asia
  • Southeast Asia
Australia
  • Australia East
  • Australia Southeast
Europe
  • North Europe
  • West Europe
North America
  • Central US
  • East US
  • East US 2
  • North Central US
  • South Central US
  • West Central US
  • West US
  • West US 2
United Kingdom
  • UK South
  • UK West

Sinks

Sinks are the output location that Azure Synapse or Azure Data Factory uses to place data in Azure storage. Microsoft Graph Data Connect supports the following sink storage types:

The following characteristics apply to sinks:

  • The output files are of format JSON lines. The output format is fixed, and there is no support for modifying the format of the output. However, you can use Azure Synapse or Azure Data Factory to copy the result of a Microsoft Graph Data Connect pipeline into another storage mechanism (such as Azure SQL Database).
  • Service Principal authentication is the only supported authentication mechanism for all sink types in a copy activity with Microsoft 365 as the source.
  • When using Azure Storage Blob as the sink, you must ensure that your application has Storage Blob Data Contributor access to the Azure Storage Blob location.

See also