Is there any limitation from SharePoint online on indexing large PDF files

john john 946 Reputation points
2021-09-28T12:42:26.953+00:00

I want to start a new project, which basically include uploading large PDF files and be able to search them. when i say large PDF files i am referring to either in respect to number of pages which can goes beyond 500,000 pages in a single file OR in respect to the file size which can also exceed 1/2 GB for a single file OR both of them of-course.

So SharePoint Online is one of the options to implement this project. but my question is if there is a any limitation in SharePoint online on large PDF files? either in uploading them and/or searching them/

Thanks

SharePoint
SharePoint
A group of Microsoft Products and technologies used for sharing and managing content, knowledge, and applications.
9,756 questions
0 comments No comments
{count} votes

2 answers

Sort by: Most helpful
  1. Sharath Kumar Aluri 3,071 Reputation points
    2021-09-28T14:35:28.873+00:00

    As long as the file is under 15 gigs you are good, but yes there is a character limit, The maximum amount of text output from the parser that's indexed. For example, if the parser extracted 8 million characters from a document or pdf file, only the first 2 million characters are indexed.

    Ref: https://learn.microsoft.com/en-us/microsoft-365/compliance/limits-for-content-search?view=o365-worldwide

    Thanks & Regards,
    Sharath Aluri


  2. Sharath Kumar Aluri 3,071 Reputation points
    2021-09-28T17:53:18.97+00:00

    the only way I could think of is splitting the file into multiple files, other than that there is no work around since this is SharePoint Online.

    Thanks & Regards,
    Sharath Aluri

    0 comments No comments