Introduction
Finding sensitive data between all the data produced in an organization requires different search and recognition patterns, which are called sensitive information types. In this module, you'll learn how to use sensitive information types to support your information protection strategy.
A sensitive information type uses a pattern, such as a regular expression or a function, to detect specific data formats. Keywords and checksums provide corroborative evidence that strengthens detection, while confidence levels and proximity further refine how matches are evaluated. Together, these elements form the foundation of the policies you establish in Microsoft 365 to protect your information and support your data lifecycle management strategy.
Sensitive information types are one of three classification methods available in Microsoft Purview, alongside manual classification by users and automated machine learning through trainable classifiers. Understanding how sensitive information types work gives you the foundation to build effective pattern-based detection across your environment.
Learning objectives
Upon completion of this module, you should be able to:
Recognize the difference between built-in and custom sensitive information types
Configure sensitive information types with exact data match-based classification
Implement document fingerprinting
Create custom keyword dictionaries
Prerequisites
Basic understanding of the Microsoft 365 services
Basic understanding information protection and governance in Microsoft 365