@Rob Sharpe - To extract metadata from a data source system into Microsoft Purview Data Map, you can use REST APIs. You can use any programming language that supports REST APIs to extract metadata from Purview. Python is a good option as it has libraries that can help you make REST API calls.
To extract metadata from Purview into a file or set of files, you can use the REST APIs. You can use the GET /catalog/dataAssets/{dataAssetId}/metadata
API to get all the metadata for a specific data asset. You can also use the GET /catalog/dataAssets/{dataAssetId}/metadata/{nameSpace}
API to get metadata for a specific namespace.
To programmatically limit the metadata extraction to just S3, you can use the GET /catalog/dataAssets/{dataAssetId}/metadata/{nameSpace}
API and specify the S3 namespace. For example, if the S3 namespace is awsS3
, you can use the following API call to get metadata for just the S3 data asset:
GET /catalog/dataAssets/{dataAssetId}/metadata/awsS3
I hope this helps! Let me know if you have any other questions.