Hello Bryan Schwering
Thank you for your question on Microsoft Q&A!
What I have understood from your question is that you're trying to create a Microsoft Purview sensitivity information type (SIT) that can scan custom metadata tags stored in the /docProps/core.xml file inside Office documents, but you're unable to get Purview to recognize or scan that content. Let me know if my understanding is correct.
Currently, Microsoft Purview’s built-in classifiers and SITs are designed to scan the main document content (e.g., body text in Word, cells in Excel) and some standard document properties—but they do not parse or extract data from internal package files like /docProps/core.xml, which contains core metadata such as title, subject, or custom tags added via Office applications.
To help clarify your scenario further:
- Are these custom tags added via the Office UI (e.g., under File > Info > Properties > Advanced Properties > Custom tab), or are they embedded programmatically into the document package?
- Have you confirmed that the tags appear in the document’s visible metadata when viewed in Windows File Explorer or through PowerShell/Office APIs?
- Are you using Microsoft Purview Information Protection (for labeling/classification) or Microsoft Purview Data Map (for scanning data sources like SharePoint/OneDrive)?
As of now, Purview does not support custom parsing of internal Office Open XML structure (like core.xml) during automated scans or classification. If your tags are stored as standard Office document properties (e.g., “Keywords” or “Category”), those may be detectable—but custom XML-level fields typically are not.
For official context on what metadata Purview can classify, refer to: https://learn.microsoft.com/en-us/microsoft-365/compliance/sensitivity-labels-office-apps?view=o365-worldwide and https://learn.microsoft.com/en-us/purview/sensitivity-labels
Let me know the answers to the questions above so I can better assess whether a workaround or alternative approach might be possible.
Please "Accept as Answer" if the answer provided is useful, so that you can help others in the community looking for remediation for similar issues.
Thanks
Pratyush