Hello,
I am currently encountering an issue across multiple SharePoint Subscription Edition (SSE) farms (fully up to date with the latest patches) where legacy Microsoft Office file formats—specifically .doc, .xls, and .ppt—are no longer being successfully crawled by SharePoint Search.
Issue Description
During full and incremental crawls, any legacy Office documents are consistently failing. This behavior occurs regardless of where the files are stored:
- File shares (via content sources)
- SharePoint document libraries
Modern Office formats such as .docx, .xlsx, and .pptx are crawled and indexed without issue.
Error Details
The crawl log reports the following error:
The docfile has been corrupted. (Access is denied.; Error parsing document ssic://[docid]. Error initializing IFilter for extension '.doc' (Error code is 0x80030109). The function encountered an unknown error.)
- Error Code:
0x80030109
- ULS logs do not provide any additional useful diagnostic information beyond what is shown in the crawl log.
Observations
- The issue is consistent across multiple SSE farms.
- File permissions appear to be correct and do not explain the "Access is denied" portion of the error.
- The problem specifically affects legacy Office binary formats only.
- Restarting search components and performing full crawls has not resolved the issue.
Business Impact
We are currently in the process of migrating from SharePoint 2013 to SharePoint Subscription Edition, and a significant portion of our content still exists in legacy Office formats. Since these files are not being indexed, they are not appearing in search results, which is blocking validation and user acceptance of the new environment.
Request
Has anyone encountered a similar issue with legacy Office file crawling in SharePoint Subscription Edition?
Specifically:
- Are there known limitations or changes in SSE regarding support for legacy Office IFilters?
- Could this be related to missing or deprecated components (e.g., Office Filter Pack)?
- Are there recommended workarounds or configuration changes to restore indexing of these file types?
Any guidance or insight would be appreciated.
Thank you.