Document metadata fields in eDiscovery (Premium)
The following table lists the metadata fields for documents in a review set in a case in Microsoft Purview eDiscovery (Premium). For more information about searchable properties when searching Microsoft 365 content locations when you're collecting data for an eDiscovery (Premium) case, see Keyword queries and search conditions for Content Search.
This table provides the following information:
- Field name and Display field name: The name of the metadata field and the name of the field that's displayed when viewing the file metadata of a selected document in a review set. Some metadata fields aren't included when viewing the file metadata of a document. These fields are highlighted with an asterisk (*).
- Searchable field name: The name of the property that you can search for when running a review set query.
- Exported field name: The name of the metadata field that included when documents are exported.
- Description: A description of the metadata field.
The Keywords field in review set search uses Keyword Query Language (KQL). The fields listed in the Searchable field name column can be used in the Keywords field in a review set search to form complex queries without you having to use the query builder. For more information about KQL, see Keyword Query Language syntax reference.
|Field name and Display field name
|Searchable field name
|Exported field name
|Attachment Content ID
|Attachment content ID of the item.
|Attorney client privilege score
|Attorney-client privilege model content score.
|Author from the document metadata.
|Bcc field for message types. The format is DisplayName <SMTPAddress>.
|Cc field for message types. The format is DisplayName <SMTPAddress>.
|This field is the Teams channel name. Only applies to Microsoft Teams content.
|Retention labels applied to content in Office 365.
|Human readable path that describes the source of the item.
|Extracted text of the item.
|Conversation body of the item.
|Conversation ID from the message. For Teams 1:1 and group chats, all transcript files and their family items within the same conversation share the same Conversation ID. For more information, see eDiscovery (Premium) workflow for content in Microsoft Teams.
|Conversation Family ID
|The ID that identifies individual elements of a conversation and the related items in the conversation.
|Conversation index from the message.
|This field depends on content type.
Teams 1:1 chat: first 40 characters of first message.
Teams 1:N chat: Name of group chat; if not available, the first 40 characters of the first message.
Teams Channel Post: Post title or announcement subhead; if not available, the first 40 characters of the first message.
|Conversation Pdf Time
|Date when the PDF version of the conversation was created.
|Conversation Redaction Burn Time
|Date when the PDF version of the conversation was created for Chat.
|Conversation topic of the item.
|The type of chat conversation. Values are:
Teams 1:1 and group chats and all Viva Engage conversations: Group
Teams channels and private channels: Channel
|Contains Deleted Message
|Indicates if the chat transcript includes a deleted message
|Contains Edited Message
|Indicates if the chat transcript includes an edited message
|Teams Announcement Title
|Title from a teams announcement.
|The path of the converted export file. For internal Microsoft use only.
|Name of the custodian the item was associated with.
|Date is a computed field that depends on the file type.
Email: Sent date
|Comments from the document metadata.
|Company from the document metadata.
|Document date created
|Create date from document metadata.
|The index in the family. -1 or 0 means it's the root.
|Keywords from the document metadata.
|Document modified by
|The user who last modified the document from document metadata.
|Revision from the document metadata.
|Subject from the document metadata.
|Template from the document metadata.
|The name of the user who last saved the document.
|Dominant theme as calculated for analytics.
|Group ID for exact duplicates.
|Values are None, Reply, or Forward; based on the subject line of a message.
|Email Delivery Receipt Requested
|Email address supplied in Internet Headers for delivery receipt.
|Importance of the message: 0 - Low; 1 - Normal; 2 - High
|Ignored processing errors
|Error was ignored and not remediated.
|The full set of email headers from the email message
|Indicates a message's level within the email thread it belongs to; attachments inherit its parent message's value.
|Email Message ID
|Internet message ID from the message.
|Email address supplied in Internet Headers for read receipt.
|Security setting of the message: 0 - None; 1 - Signed; 2 - Encrypted; 3 - Encrypted and signed.
|Sensitivity setting of the message: 0 - None; 1 Personal; 2 - Private; 3 - CompanyConfidential.
|Group ID for all messages in the same email set.
|Position of the message within the email set; consists of node IDs from the root to the current message and are separated by periods (.).
|The path of the exported file.
|Extracted content type
|Extracted content type, in the form of mime type; for example, image/jpeg
|The path to the extracted text file in the export.
|Number of characters in the extracted text.
|Numeric identifier for families that are exact duplicates of each other (same content and all the same attachments).
|Groups together attachments and extracted items from email and chats with its parent item. This includes the chat or email and all attachments and extracted items.
|Number of documents in the family.
|For content from SharePoint and OneDrive: Document.
For content from Exchange: Email or Attachment.
For content from Teams or Viva Engage: Conversations.
|Document identifier unique within the case.
|File system date created
|Created date from file system (only applies to non-Office 365 data).
|File system date modified
|Modified date from file system (only applies to non-Office 365 data).
|File type of the item based on file extension.
|Groups together all items for email and documents. For email, this includes the message and all attachments and extracted items. For documents, this includes the document and any embedded items.
|Indicates whether or not the message has attachments.
|True when at least one of the participants is found in the attorney list; otherwise, the value is False.
|Indicates whether or not the item has text; possible values are True and False.
|This ID is used to uniquely identify a document within a review set. This field can't be used in a review set search and the ID can't be used to access a document in its native location.
|Inclusive type calculated for analytics: 0 - not inclusive; 1 - inclusive; 2 - inclusive minus; 3 - inclusive copy.
|In Reply To ID
|In reply to ID from the message.
|The original file extension of the file.
|The file ID of the top level item in the review set. For an attachment, this ID will be the ID of the parent. This can be used to group families together.
|Is modern attachment
|This file is a modern attachment or linked file.
|Is from document version
|Current document is from a different version of another document.
|Is email attachment
|This item is from an email attachment that shows up as an attached item to the message.
|Is inline attachment
|This was attached inline and shows up in the body of the message.
|One document in every set of exact duplicates is marked as representative.
|Item class supplied by exchange server; for example, IPM.Note
|Last modified date
|Last modified date from document metadata.
|The ID of the load set in which the item was added to a review set.
|String that indicates the type of location that documents were sourced from.
Imported Data - Non-Office 365 data
|String that identifies the source of the item. For exchange, this will be the SMTP address of the mailbox; for SharePoint and OneDrive, the URL for the site collection.
|This file is the pivot in a near duplicate set.
|Marked as representative
|One document from each set of exact duplicates is marked as representatives.
|Meeting End Date
|Meeting end date for meetings.
|Meeting Start Date
|Meeting start date for meetings.
|The type of message to search for. Possible values:
|Modern Attachment Parent ID
|The Immutable ID of the document's parent.
|Native extension of the item.
|Native file name
|Native file name of the item.
|Native file size
|Number of bytes of the native item.
|MD5 hash (128-bit hash value) of the file stream.
|SHA256 hash (256-bit hash value) of the file stream.
|ND/ET Sort: Excluding attachments
|Concatenation of the email thread (ET) set and Near-duplicate (ND) set. This field is used for efficient sorting at review time. A D is prefixed to ND sets and an E is prefixed to ET sets.
|ND/ET Sort: Including attachments
|Concatenation of an email thread (ET) set and near-duplicate (ND) set. This field is used for efficient sorting at review time. A D is prefixed to ND sets and an E is prefixed to ET sets. Each email item in an ET set is followed by its appropriate attachments.
|Near Duplicate Set
|Items that are similar to the pivot document share the same ND_set.
|Author from SharePoint.
|O365 created by
|Created by from SharePoint.
|O365 date created
|Created date from SharePoint.
|The date a document (or document version) collected from SharePoint or OneDrive for Business was modified. This is the same modified date as the one displayed in the version history in the SharePoint and OneDrive user experience.
|O365 modified by
|Modified by from SharePoint or OneDrive.
|List of custodians of documents that are exact duplicates (for email, based on content; for documents, based on hash).
|Other file IDs
|List of file IDs of documents that are exact duplicates (for email, based on content; for documents, based on hash).
|List of compound paths of documents that are exact duplicates (email: based on content, documents: based on hash).
|ID of the item's parent.
|The closest preceding email message in the email thread.
|List of all domains of participants of a message.
|List of all participants of a message; for example, Sender, To, Cc, Bcc.
|The ID of a pivot.
|True if attorney-client privilege detection model considers the document potentially privileged
|Processing status after the item was added to a review set.
|Read percentile for the document based on Relevance.
|The date and time the email was received in UTC.
|Number of recipients in the message.
|List of all domains of recipients of a message.
|List of all recipients of a message (To, Cc, Bcc).
|The path of the redacted replacement file in the export.
|The path of the redacted text file replacement in the export. For internal Microsoft use only.
|Relevance tag Case issue 1
|Relevance tag Case issue 1 from Relevance.
|Relevance score of a document based on Relevance.
|Relevance score of a document based on Relevance.
|Numeric identifier of each set of exact duplicates.
|The row number of the item in the load file.
|Sender (From) field for message types. The format is DisplayName <SmtpAddress>.
|Calculated field comprised of the sender or author of the item.
|Domain of the sender.
|Sent date of the message.
Chats: Beginning date from the transcript
|Documents of similar content (ND_set) or email within the same email thread (Email_set) share the same Set_ID.
|Set Order: Inclusive First
|Sorting field - email and attachments: counter-chronological; documents: pivot first then by descending similarity score.
|Indicates how similar a document is to the pivot of the near duplicate set.
|Subject of the message.
|Calculated field comprised of the subject or title of the item.
|Tags applied in a review set.
|Teams: Name of team
Viva Engage: Community name
|Themes list as calculated for analytics.
|The Thread ID from email messages, Teams conversations, and Viva Engage conversations. For email messages, all reply messages and attachments share the same Thread ID. For Teams 1:1 and group chats, all transcript files and their associated items within the same conversation share the same Thread ID. For more information, see View documents in a review set.
|Title from the document metadata. Title from the document metadata. For Teams and Viva Engage content, this is the value from the ConversationName property.
|To field for message types. The format is DisplayName<SmtpAddress>
|Unique in email set
|False if there's a duplicate of the attachment in its email set.
|Version Group ID
|Groups together the different versions of the same document.
|The version number of a document collected from SharePoint or OneDrive for Business. This is the same version number as the one displayed in the version history in the SharePoint and OneDrive user experience.
|True if the item was remediated, otherwise False.
|Number of words in the item.