Get insights into data stewardship from Microsoft Purview

As described in the insights concepts, the data stewardship report is part of the "Health" section of the Data Estate Insights App. This report offers a one-stop shop experience for data, governance, and quality focused users like chief data officers and data stewards to get actionable insights into key areas of gap in their data estate.

In this guide, you'll learn how to:

  • Navigate and view data stewardship report from your Microsoft Purview account.
  • Drill down for more asset count details.

Prerequisites

Before getting started with Microsoft Purview Data Estate Insights, make sure that you've completed the following steps:

Understand your data estate and catalog health in Data Estate Insights

In Microsoft Purview Data Estate Insights, you can get an overview of all assets inventoried in the Data Map, and any key gaps that can be closed by governance stakeholders, for better governance of the data estate.

  1. Open the Microsoft Purview portal.

  2. If you're using the classic Microsoft Purview portal, select the Data estate insights menu item. If you're using the new Microsoft Purview portal open the Data catalog, select the Data estate insights drop down, and then select Reports.

  3. Select Data Stewardship.

View data stewardship dashboard

The dashboard is purpose-built for the governance and quality focused users, like data stewards and chief data officers, to understand the data estate health of their organization. The dashboard shows high level KPIs that need to reduce governance risks:

  • Asset curation: All data assets are categorized into three buckets - "Fully curated", "Partially curated" and "Not curated", based on certain attributes of assets being present. An asset is "Fully curated" if it has at least one classification tag, an assigned Data Owner and a description. If any of these attributes is missing, but not all, then the asset is categorized as "Partially curated" and if all of them are missing, then it's "Not curated".

  • Asset data ownership: Assets that have the owner attribute within "Contacts" tab as blank are categorized as "No owner", else it's categorized as "Owner assigned".

    Screenshot of the data stewardship insights summary graphs, showing the three main KPI charts.

Data estate health

Data estate health is a scorecard view that helps management and governance focused users, like chief data officers, understand critical governance metrics that can be looked at by collection hierarchy.

Screenshot of the data stewardship data estate health table in the middle of the dashboard.

You can view the following metrics:

  • Assets: Count of assets by collection drill-down
  • With sensitive classifications: Count of assets with any system classification applied
  • Fully curated assets: Count of assets that have a data owner, at least one classification and a description.
  • Owner assigned: Count of assets with data owner assigned on them
  • No classifications: Count of assets with no classification tag
  • Out of date: Percentage of assets that have not been updated in over 365 days.
  • New: Count of new assets pushed in the Data Map in the last 30 days
  • Updated: Count of assets updated in the Data Map in the last 30 days
  • Deleted: Count of deleted assets from the Data Map in the last 30 days

You can also drill down by collection paths. As you hover on each column name, it provides description of the column, provides recommended percentage ranges, and takes you to the detailed graph for further drill-down.

Screenshot of the data stewardship data estate health table, with the fully curated column hovered over. A summary is show, and the view more in Stewardship insights option is selected.

Screenshot of the asset curation detailed view, as shown after selecting the view more in stewardship insights option is selected.

Asset curation

All data assets are categorized into three buckets - Fully curated, Partially curated and Not curated, based on whether assets have been given certain attributes.

Screenshot of the data stewardship insights health dashboard, with the asset curation bar chart highlighted.

An asset is Fully curated if it has at least one classification tag, an assigned data owner, and a description.

If any of these attributes is missing, but not all, then the asset is categorized as Partially curated. If all of them are missing, then it's listed as Not curated.

You can drill down by collection hierarchy.

Screenshot of the data stewardship asset curation chart, with the collection filter opened to show all available collections.

For further information about which assets aren't fully curated, you can select View details link that will take you into the deeper view.

Screenshot of the data stewardship asset curation chart, with the view details button highlighted below the chart.

In the View details page, if you select a specific collection, it will list all assets with attribute values or blanks that make up the fully curated assets.

Screenshot of the asset curation detailed view, shown after selecting View Details beneath the asset curation chart.

The detail page shows two key pieces of information:

First, it tells you what was the classification source, if the asset is classified. It's Manual if a curator/data owner applied the classification manually. It's Automatic if it was classified during scanning. This page only provides the last applied classification state.

Second, if an asset is unclassified, it tells us why it's not classified, in the column Reasons for unclassified. Currently, Data estate insights can tell one of the following reasons:

  • No match found
  • Low confidence score
  • Not applicable

Screenshot of the collection detail view, with the classification source and reasons for unclassified columns highlighted.

You can select any asset and add missing attributes, without leaving the Data estate insights app.

Screenshot of the asset list page, with an asset selected and the edit menu open.

This graph shows how the assets and key metrics have been trending over:

  • Last 30 days: The graph takes last run of the day or recording of the last run across days as a data point.
  • Last six weeks: The graph takes last run of the week where week ends on Sunday. If there was no run on Sunday, then it takes the last recorded run.
  • Last 12 months: The graph takes last run of the month.
  • Last four quarters: The graph takes last run of the calendar quarter.

Screenshot of the data stewardship insights summary graphs, with data estate selected, showing the trends and gap analysis graph at the bottom of the page.

Next steps

Learn more about Microsoft Purview Data Estate Insights through: