Datasets, regions, and sinks supported by Microsoft Graph Data Connect
Microsoft Graph Data Connect supports a variety of datasets, data regions, and storage locations in Microsoft Azure. This article describes the supported datasets and how to access the dataset schemas, the Microsoft 365 and Microsoft Azure regions that are supported, and the storage locations that Microsoft Graph Data Connect utilizes through Azure Synapse or Azure Data Factory.
Datasets
Microsoft Graph Data Connect currently supports the following datasets. To view the schemas for each dataset, create a new dataset in Azure Synapse or Azure Data Factory and use the Schema tab to view it.
Azure Active Directory
Dataset name | Description | Sample and Schema | Billing |
---|---|---|---|
BasicDataSet_v0.User_v0 | Contains user information (DisplayName, UserPrincipalName, and other information). | Free | |
BasicDataSet_v0.User_v1 | Contains user information. | Free | |
BasicDataSet_v0.Manager_v0 | Contains user information for the manager of each user. | Free | |
BasicDataSet_v0.DirectReport_v0 | Contains user information about the employees that directly report to each user. | Free |
Outlook and Exchange Online
Dataset name | Description | Sample and Schema | Billing |
---|---|---|---|
BasicDataSet_v0.CalendarView_v0 | Contains the events from the Calendar view. | Paid | |
BasicDataSet_v0.Contact_v0 | Contains the available information from each user’s address book. | Paid | |
BasicDataSet_v0.Contact_v1 | Contains contact information from each user's address book. | Paid | |
BasicDataSet_v0.Event_v0 | Contains the information from a user’s calendar events. | Paid | |
BasicDataSet_v0.Event_v1 | Contains the events in each user's calendar. | Paid | |
BasicDataSet_v0.Inbox_v1 | Contains the mail folders from each user's inbox. | Paid | |
BasicDataSet_v0.MailboxSettings_v0 | Contains the mailbox settings of each user. | Free | |
BasicDataSet_v0.MailFolder_v0 | Contains the mail folders from each user's mailbox. | Paid | |
BasicDataSet_v0.Message_v0 | Contains the email messages from a user’s mailbox. | Paid | |
BasicDataSet_v0.Message_v1 | Contains the email message in each user's mailbox. | Paid | |
BasicDataSet_v0.SentItem_v0 | Contains the messages sent from each user's mailbox. | Paid | |
BasicDataSet_v0.SentItem_v1 | Contains the message sent from each user's mailbox. | Paid | |
BasicDataSet_v0.TodoTaskFolders_v0 | Contains all task folders in the user's mailbox. | Paid | |
BasicDataSet_v0.TodoTasks_v0 | Contains all the tasks in the signed-in user's mailbox. | Paid | |
BasicDataSet_v0.OutlookGroupConversations_v0 | Contains the message sent from each user's mailbox. | Paid |
Microsoft Teams
Dataset name | Description | Sample and Schema | Billing |
---|---|---|---|
BasicDataSet_v0.TeamChat_v1 | Contains Teams chat messages for one-on-one and group chat messages. This dataset excludes chat messages explicitly deleted by users. | Paid | |
BasicDataSet_v0.TeamsCallRecords_v1 | Contains activity records from Teams calls and meetings. | Paid | |
BasicDataSet_v0.TeamsChannelDetails_v0 | Contains details about Channels in a team. | Paid | |
BasicDataSet_v0.TeamsStandardChannelMessages_v0 | Contains channel posts and messages from Standard Channels in Teams. | Paid | |
BasicDataSet_v0.TeamsTranscript_v1 | Contains transcripts from calls and meetings in Teams when the transcript is enabled for a meeting or a call. | Paid |
Microsoft Groups
Dataset name | Description | Sample and Schema | Billing |
---|---|---|---|
BasicDataSet_v0.GroupDetails_v0 | Contains a Teams Chat Group details. | Free | |
BasicDataSet_v0.GroupMembers_v0 | Contains a Teams Chat Group member details. | Free | |
BasicDataSet_v0.GroupOwners_v0 | Contains the Teams Chat Group owners' details. | Free |
OneDrive and SharePoint Online
Please consider that these datasets are available after 48 hours (about two days). For example, you can query data for 07/01 starting on 07/03.
The data available is from the last 21 days (about three weeks). For example, if you would like to query data for 07/01, you can do so from 07/03 to 07/24.
Dataset name | Description | Sample and Schema | Billing |
---|---|---|---|
DocumentSharingDataset_v0_Preview | Contains information about sharing permissions of documents. | Free | |
SharePointSitesDataset_v0_Preview | Contains information about SharePoint sites. | Free | |
SharePointGroupsDataset_v0_Preview | Contains SharePoint group information, including details about group members. | Free |
Viva Insights
Dataset name | Description | Sample and Schema | Billing |
---|---|---|---|
VivaInsightsDataset_v0 | Contains Viva Insights metrics. | Requires Viva Insights license |
Regions
Microsoft Graph Data Connect supports extracting data from a variety of Microsoft 365 regions. To successfully move data from the Microsoft 365 datacenter into your Microsoft Azure storage, the Azure Synapse or Azure Data Factory instance and the Azure storage location must both map to a supported region for the location of the Microsoft 365 data.
The following table indicates which Microsoft 365 regions are supported and the corresponding Azure regions required for data movement.
Office region | Azure region |
---|---|
Asia-Pacific |
|
Australia |
|
Europe |
|
North America |
|
United Kingdom |
|
Canada (CAN) |
|
Japan (JPN) |
|
India (IND) |
|
Korea (KOR) |
|
Switzerland (CHE) |
|
Germany (DEU) |
|
Norway (NOR) |
|
France (FRA) |
|
UAE (UAE) |
|
Sinks
Sinks are the output location that Azure Synapse or Azure Data Factory uses to place data in Azure storage. Microsoft Graph Data Connect supports the following sink storage types:
- Azure Data Lake Storage Gen1
- Azure Data Lake Storage Gen2
- Azure Storage Blob
- Azure SQL DB (mapping data flows only)
The following characteristics apply to sinks:
Service Principal authentication is the only supported authentication mechanism for all sink types in a copy activity with Microsoft 365 as the source.
When using Azure Storage Blob as the sink, you must ensure that your application has Storage Blob Data Contributor access to the Azure Storage Blob location.
For copy activity, the output files are formatted as JSON. This format is fixed and modifying the format is not supported. However, you can use Azure Synapse or Azure Data Factory to copy the result of a Microsoft Graph Data Connect pipeline into another storage mechanism (such as Azure SQL Database).
Mapping data flows: Copy and transform data from Microsoft 365 (Office 365) - Azure Data Factory & Azure Synapse | Microsoft Learn |
Output can be in parquet format. For details about the supported data transformations, see Flatten transformation in mapping data flow.
Microsoft Graph Data Connect on mapping data flows supports direct output of the data into Azure SQL DB.
The following table indicates the areas that are supported for the corresponding copy activity and mapping data flows.
Area | Copy Activity | Mapping data flows |
---|---|---|
Output data formats supported | JSON | JSON, Parquet |
Data transformation (normalization/flattening/etc.) | Requires additional transformation step in the ADF/Synapse pipeline | Supports inline transformations |
Supported data sinks | ADLS gen2, Azure Blob | ADLS gen2, Azure Blob, Azure SQL DB |
Azure VNET IR | Not supported | Supported |
See also
Feedback
Submit and view feedback for