Build an end to end data governance and master data management stack with Microsoft Purview and CluedIn

Intermediate
Developer
Microsoft Purview

Deploy a data technology stack that provides data governance, data quality, data lineage, data enrichment, and data standardization layers to your Azure ecosystem. Combine the power of Microsoft Purview, Azure Data Factory, and CluedIn into a powerhouse of data governance and data quality. This data pipeline takes raw data and surfaces high quality and ready-for-insight data to downstream systems and users.

Note

This is a Guided Project module where you complete an end-to-end project by following step-by-step instructions. Steps to deploy an environment are included.

Learning objectives

In this module, you'll:

  • Deploy a complete data governance and master data management environment.
  • Set up automated scans on Azure Data Lake Store Gen2 (ADLS Gen2) into Microsoft Purview.
  • Set up automated data pipelines from Azure Data Factory to CluedIn through assets registered in Microsoft Purview.
  • Stream data from CluedIn to ADLS Gen2.
  • Standardize, clean, deduplicate, and enrich your data in CluedIn.

Prerequisites

  • An Azure account with an active subscription. If you don't have one, you can follow this link to create a free subscription.
  • Ability to navigate the Azure portal
  • Familiarity with Azure Data Factory
  • Familiarity with high-level concepts of Microsoft Purview