Search the Microsoft Purview Data Catalog
After data is scanned and ingested into the Microsoft Purview data map, data consumers need to easily find the data needed for their analytics or governance workloads. Data discovery can be time consuming because you may not know where to find the data that you want. Even after finding the data, you may have doubts about whether you can trust the data and take a dependency on it.
The goal of search in Microsoft Purview is to speed up the process of quickly finding the data that matters. This article outlines how to search the Microsoft Purview data catalog to quickly find the data you're looking for.
Searching the catalog
The search bar can be quickly accessed from the top bar of the Microsoft Purview governance portal UX. In the data catalog home page, the search bar is in the center of the screen.
Once you click on the search bar, you'll be presented with your search history and the items recently accessed in the data catalog. This allows you to quickly pick up from previous data exploration that was already done.
Enter in keywords that help narrow down your search such as name, data type, classifications, and glossary terms. As you enter in search keywords, Microsoft Purview dynamically suggests results and searches that may fit your needs. To complete your search, click on "View search results" or press "Enter".
Once you enter in your search, Microsoft Purview returns a list of data assets and glossary terms a user is a data reader for to that matched to the keywords entered in.
The Microsoft Purview relevance engine sorts through all the matches and ranks them based on what it believes their usefulness is to a user. For example, a data consumer is likely more interested in a table curated by a data steward that matches on multiple keywords than an unannotated folder. Many factors determine an asset’s relevance score and the Microsoft Purview search team is constantly tuning the relevance engine to ensure the top search results have value to you.
If the top results don’t include the assets you're looking for, you can use the facets on the left-hand side to filter down by business metadata such glossary terms, classifications, and the containing collection. If you're interested in a particular data source type such as Azure Data Lake Storage Gen2 or Azure SQL Database, you can use a pill filter to narrow down your search.
Search will only return items in collections you're a data reader or curator for. For more information, see create and manage Collections.
For certain annotations, you can click on the ellipses to choose between an AND condition or an OR condition.
From the search results page, you can select an asset to view details such as schema, lineage, and classifications. To learn more about the asset details page, see Manage catalog assets.
Searching Microsoft Purview in connected services
Once you register your Microsoft Purview instance to an Azure Data Factory or an Azure Synapse Analytics workspace, you can search the Microsoft Purview data catalog directly from those services. To learn more, see Discover data in ADF using Microsoft Purview and Discover data in Synapse using Microsoft Purview.
Bulk edit search results
If you're looking to make changes to multiple assets returned by search, Microsoft Purview lets you modify glossary terms, classifications, and contacts in bulk. To learn more, see the bulk edit assets guide.
Browse the data catalog
While searching is great if you know what you're looking for, there are times where data consumers wish to explore the data available to them. The Microsoft Purview data catalog offers a browse experience that enables users to explore what data is available to them either by collection or through traversing the hierarchy of each data source in the catalog. For more information, see browse the data catalog.
Search query syntax
All search queries consist of keywords and operators. A keyword is a something that would be part of an asset's properties. Potential keywords can be a classification, glossary term, asset description, or an asset name. A keyword can be just a part of the property you're looking to match to. Use keywords and the operators to ensure Microsoft Purview returns the assets you're looking for.
Certain characters including spaces, dashes, and commas are interpreted as delimiters. Searching a string like
hive-database is the same as searching two keywords
The following table contains the operators that can be used to compose a search query. Operators can be combined as many times as need in a single query.
|OR||Specifies that an asset must have at least one of the two keywords. Must be in all caps. A white space is also an OR operator.||The query
|AND||Specifies that an asset must have both keywords. Must be in all caps||The query
|NOT||Specifies that an asset can't contain the keyword to the right of the NOT clause. Must be in all caps||The query
|()||Groups a set of keywords and operators together. When combining multiple operators, parentheses specify the order of operations.||The query
|""||Specifies exact content in a phrase that the query must match to.||The query
|field:keyword||Searches the keyword in a specific attribute of an asset. Field search is case insensitive and is limited to the following fields at this time:
Searching "*" will return all the assets and glossary terms in the catalog.
- Grouping isn't supported within a field search. Customers should use operators to connect field searches. For example,
name:(alice AND bob)is invalid search syntax, but
name:alice AND name:bobis supported.
Submit and view feedback for