Purview - Extracting MetaData

Rob Sharpe 20 Reputation points
2024-05-24T14:26:06.05+00:00

Hi there,

I am looking to extract Metadata from a data source in Purview - its source is an AWS S3 bucket.

I have seen that I can extract the metadata using REST API's into a JSON format.

My question is - what language should I use to do this - I have basic python knowledge, so that might be an option.

Is there an option that I an use to do this or do I need to write something to extract the metadata ?

Thanks

Microsoft Purview
Microsoft Purview
A Microsoft data governance service that helps manage and govern on-premises, multicloud, and software-as-a-service data. Previously known as Azure Purview.
1,224 questions
{count} votes

Accepted answer
  1. PRADEEPCHEEKATLA 90,226 Reputation points
    2024-06-03T06:58:13.44+00:00

    @Rob Sharpe - To extract metadata from a data source system into Microsoft Purview Data Map, you can use REST APIs. You can use any programming language that supports REST APIs to extract metadata from Purview. Python is a good option as it has libraries that can help you make REST API calls.

    To extract metadata from Purview into a file or set of files, you can use the REST APIs. You can use the GET /catalog/dataAssets/{dataAssetId}/metadata API to get all the metadata for a specific data asset. You can also use the GET /catalog/dataAssets/{dataAssetId}/metadata/{nameSpace} API to get metadata for a specific namespace.

    To programmatically limit the metadata extraction to just S3, you can use the GET /catalog/dataAssets/{dataAssetId}/metadata/{nameSpace} API and specify the S3 namespace. For example, if the S3 namespace is awsS3, you can use the following API call to get metadata for just the S3 data asset:

    GET /catalog/dataAssets/{dataAssetId}/metadata/awsS3
    

    I hope this helps! Let me know if you have any other questions.

    1 person found this answer helpful.

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.