Powerbi report creation from Azure purview data model lineage

Anto Reni Huxley Antokinsley 81 Reputation points
2021-10-10T10:59:25.027+00:00

I need to create a power bi report with the lineage of assets in Azure Purview. Below are the steps to be done for report creation from the scanned data model Purview studio.

  1. Record the Lineage of the Scanned data model.
  2. Use Rest APIs to extract the DataModel, Classifications, Glossary, Lineage in json/parquet format.
  3. Build Power BI Report using json/parquet format.

I was able to extract the guid of all Azure SQL DB entities by using the below mentioned link.
https://learn.microsoft.com/en-us/azure/sql-database/scripts/sql-database-import-purview-labels

Now I am stuck extracting the lineage for all the guids taken in the previous step dynamically in postman client/powershell. Kindly let me know how to solve this.

Microsoft Purview
Microsoft Purview
A Microsoft data governance service that helps manage and govern on-premises, multicloud, and software-as-a-service data. Previously known as Azure Purview.
1,045 questions
0 comments No comments
{count} votes

Accepted answer
  1. KranthiPakala-MSFT 46,437 Reputation points Microsoft Employee
    2021-10-11T18:28:38.383+00:00

    Hi @Anto Reni Huxley Antokinsley ,

    Thanks for using Microsoft Q&A forum and posting your query here.

    As per conversation with internal team, there is no good way to get lineage for multiple entities at a time.

    You might be able to identify all of the process entities and GET /entity/bulk and extract the inputs/outputs from there.

    Here is the endpoint that you need to use to get lineage info of the entity specified by GUID : Ref doc - Azure Purview Lineage - Get Lineage Graph

    Get lineage info of the entity specified by GUID.

    GET {Endpoint}/catalog/api/atlas/v2/lineage/{guid}?direction={direction}  
    

    With optional parameters:

    GET {Endpoint}/catalog/api/atlas/v2/lineage/{guid}?depth={depth}&width={width}&direction={direction}&includeParent={includeParent}&getDerivedLineage={getDerivedLineage}  
    

    PyAPacheAtlas makes this easier : pyapacheatlas.core package — PyApacheAtlas 0.9.0 documentation (wjohnson.github.io)

    Hope this info helps. Do let us know if you have further query

    ----------

    • Please don't forget to click on 130616-image.png and upvote 130671-image.png button whenever the information provided helps you. Original posters help the community find answers faster by identifying the correct answer. Here is how
    • Want a reminder to come back and check responses? Here is how to subscribe to a notification
    • If you are interested in joining the VM program and help shape the future of Q&A: Here is how you can be part of Q&A Volunteer Moderators

0 additional answers

Sort by: Most helpful