Limits for Content search and eDiscovery (Standard)

Various limits are applied to eDiscovery search tools in the Microsoft Purview compliance portal. This includes searches run on the Content search page and searches that are associated with an eDiscovery case on the eDiscovery (Standard) page. These limits help to maintain the health and quality of services provided to organizations. There are also limits related to the indexing of email messages in Exchange Online for search. You can't modify the limits for eDiscovery searches or email indexing, but you should be aware of them so that you can take these limits into consideration when planning, running, and troubleshooting eDiscovery searches.

For limits related to the Microsoft Purview eDiscovery (Premium) tool, see Limits in eDiscovery (Premium)

Tip

If you're not an E5 customer, use the 90-day Microsoft Purview solutions trial to explore how additional Purview capabilities can help your organization manage data security and compliance needs. Start now at the Microsoft Purview compliance portal trials hub. Learn details about signing up and trial terms.

Search limits

The following table lists the search limits when using the content search tool in the compliance portal and for searches that are associated with a Microsoft Purview eDiscovery (Standard) case.

Description of limit Limit
The maximum number of mailboxes or sites that can be searched in a single search No limit 1
The maximum number of items found in all user mailboxes that can possibly be displayed on the preview page when previewing search results. The newest items are displayed. 1,000 2
The maximum number of user mailboxes that can be previewed for search results. If there are more than 1000 mailboxes that contain content that matches the search query, at most, only the top 1000 mailboxes with the most search results will be available for preview. 1,000
The maximum number of items found in SharePoint and OneDrive for Business sites that are displayed on the preview page when previewing search results. The newest items are displayed. 200
The maximum number of sites (in SharePoint and OneDrive for Business) that can be previewed for search results. If there are more than 200 total sites that contain content that matches the search query, only the top 200 sites with the most search results will be available for preview. 200
The maximum number of items per public folder mailbox that are displayed on the preview page when previewing content search results. 100
The maximum number of items found in all public folder mailboxes that are displayed on the preview page when previewing content search results. 200
The maximum number of public folder mailboxes that can be previewed for search results. If there are more than 500 public folder mailboxes that contain content that matches the search query, only the top 500 public folder mailboxes with the most search results will be available for preview. 500
The maximum size of an item that can be viewed on the preview page. 10,000,000 bytes (approximately 9.5 MB)
The maximum number of characters for the search query (including operators and conditions) for a search. Mailboxes: 10,000.

Sites: 4,000 when searching all sites or 2,000 when searching up to 20 sites. 3

The maximum number of variants returned when using a prefix wildcard to search for an exact phrase in a search query or when using a prefix wildcard and the NEAR Boolean operator. 10,000 4
The minimum number of alpha characters for prefix wildcards; for example, time*, one*, or set*. 3
The maximum number of mailboxes in a search that you can delete items in by doing a "search and purge" action (by using the New-ComplianceSearchAction -Purge command). If the search that you're doing a purge action for has more source mailboxes than this limit, the purge action will fail. For more information about search and purge, see Search for and delete email messages in your organization. 50,000
The maximum number of locations in a search that you can export items from. If the search that you're exporting has more locations than this limit, the export will fail. For more information, see Export content search results. 100,000

Note

1 Although you can search an unlimited number of mailboxes in a single search, you can only download the exported search results from a maximum of 100,000 mailboxes using the eDiscovery Export Tool in the compliance portal.

2 The intent of the preview page is to show a limited sample of the results. Even for massive searches with thousands of results, the number of items shown on the preview page can, and often will, be much less than maximum possible value of 1000. To see the complete search results, you need to export the results.

3 When searching SharePoint and OneDrive for Business locations, the characters in the URLs of the sites being searched are counted against this limit. This limit takes effect after the query is expanded and includes characters from the keyword query, any search permissions filters applied to the user, and the URLs of all site locations. This means the query will get expanded against each of the keywords. For example, if a search query has 15 keywords and additional parameters and conditions, the query gets expanded 15 times, each with the other parameters and conditions in the query. So even though the number of characters in the search query may be below the limit, it's the expanded query that may contribute to exceeding this limit.

5 For non-phrase queries (a keyword value that doesn't use double quotation marks) we use a special prefix index. This tells us that a word occurs in a document, but not where it occurs in the document. To do a phrase query (a keyword value with double quotation marks), we need to compare the position within the document for the words in the phrase. This means that we can't use the prefix index for phrase queries. In this case, we internally expand the query with all possible words that the prefix expands to; for example, "time*" can expand to "time OR timer OR times OR timex OR timeboxed OR ...". 10,000 is the maximum number of variants the word can expand to, not the number of documents matching the query. There is no upper limit for non-phrase terms.

Search times

Microsoft collects performance information for searches run by all organizations. While the complexity of the search query can impact search times, the biggest factor that affects how long searches take is the number of mailboxes searched. Although Microsoft doesn't provide a Service Level Agreement for search times, the following table lists average search times for collection searches based on the number of mailboxes included in the search.

Number of mailboxes Average search time
100 30 seconds
1,000 45 seconds
10,000 4 minutes
25,000 10 minutes
50,000 20 minutes
100,000 25 minutes

Export limits

The following table lists the limits when exporting the results of a content search. These limits also apply when you export content from an eDiscovery (Standard) case.

Description of limit Limit
Maximum amount of exportable data from a single search1 2 TB
Maximum an organization can export in a single day2 2 TB
Maximum number of mailboxes for search results that can be downloaded using the eDiscovery Export Tool 100,000
Maximum size of PST file that can be exported3 10 GB
Maximum number of exports or reports displayed in Content Search or eDiscovery cases 1,000
Rate at which search results from mailboxes and sites are uploaded to a Microsoft-provided Azure Storage location. Maximum of 2 GB per hour

Note

1 If the search results are larger than 2 TB, consider using date ranges, or other types of filters to decrease the total size of the search results.

2 This limit is reset daily at 12:00AM UTC.

3 If the search results from a user's mailbox are larger than 10 GB, the search results for the mailbox will be exported in two (or more) separate PST files. If you choose to export all search results in a single PST file, the PST file will be spilt into additional PST files if the total size of the search results is larger than 10 GB.

Indexing limits for email messages

The following table describes the indexing limits that might result in an email message being returned as an unindexed item or a partially indexed item in the results of a content search.

Indexing limit Maximum value Description
Maximum attachment size 1 150 MB The maximum size of an email attachment that will parse for indexing. Any attachment that's larger than this limit won't be parsed for indexing, and the message with the attachment will be marked as partially indexed.
Maximum number of attachments 250 The maximum number of files attached to an email message that will be parsed for indexing. If a message has more than 250 attachments, the first 250 attachments are parsed and indexed, and the message is marked as partially indexed because it had additional attachments that weren't parsed.
Maximum attachment depth 30 The maximum number of nested attachments that are parsed. For example, if an email message has another message attached to it and the attached message has an attached Word document, the Word document and the attached message will be indexed. This behavior will continue for up to 30 nested attachments.
Maximum number of attached images 0 An image that's attached to an email message is skipped by the parser and isn't indexed.
Maximum time spent parsing an item 30 seconds A maximum of 30 seconds is spent parsing an item for indexing. If the parsing time exceeds 30 seconds, the item is marked as partially indexed.
Maximum parser output 2 million characters The maximum amount of text output from the parser that's indexed. For example, if the parser extracted 8 million characters from a document, only the first 2 million characters are indexed.
Maximum annotation tokens 2 million When an email message is indexed, each word is annotated with different processing instructions that specify how that word should be indexed. Each set of processing instructions is called an annotation token. To maintain the quality of service in Office 365, there's a limit of 2 million annotation tokens for an email message.
Maximum body size in index 67 million characters The total number of characters in the body of an email message and all its attachments. When an email message is indexed, all text in the body of the message and in all attachments is concatenated into a single string. The maximum size of this string that is indexed is 67 million characters.
Maximum unique tokens in body 1 million As previously explained, tokens are the result of extracting text from content, removing punctuation and spaces, and then dividing it into words (called tokens) that are stored in the index. For example, the phrase "cat, mouse, bird, dog, dog" contains 5 tokens. But only 4 of these are unique tokens. There's a limit of 1 million unique tokens per email message, which helps prevent the index from getting too large with random tokens.

Note

1 Parsing is the process where the indexing service extracts text from the attachment, removes unnecessary characters like punctuation and spaces, and then divides the text into words (in a process called tokenization), that are then stored in the index.

Jobs limits

Description Limit
Maximum number of concurrent jobs in your organization. 50
Maximum number of concurrent jobs that a single user can start at the same time. 25
Maximum number of concurrent tenant-wide jobs(for example, tenant-wide searches) in your organization. 5
Maximum number of concurrent tenant-wide jobs(for example, tenant-wide searches) that a single user can start at one time. 5
Maximum number of jobs per day in your organization.1 500

Note

1 This limit is reset daily at 12:00AM UTC.

More information

There are additional limits related to different aspects of searching for content, such as content indexing. For more information about these limits, see the following articles:

For information about content searches, see:

For case limits related to eDiscovery (Standard) and eDiscovery (Premium), see:

Need help with eDiscovery issues?

Check out the Resolve common eDiscovery issues article for basic troubleshooting steps that you can take to identify and resolve issues that you might encounter during an eDiscovery search or elsewhere in the eDiscovery process.