Share via

How to scan a single data ADLS source for multiple collectioons

Oluyemisi 15 Reputation points
2024-04-12T16:14:33.4766667+00:00

We have recently implemented Purview as apart of a data strategy project that involves ADLS Gen 2. We have multiple data pipelines into a single ADSL container.

Now in implementing Purview, and looking to create separate Collections for each business function/domain, we understand that we cannot use the same data source for multiple collections.

What is the best way to achieve separating the collections for the different functions so that Data Steward for each function can manage their respective data assets?

Azure Data Factory
Azure Data Factory

An Azure service for ingesting, preparing, and transforming data at scale.

Microsoft Security | Microsoft Purview

1 answer

Sort by: Most helpful
  1. Harishga 6,005 Reputation points Microsoft External Staff
    2024-04-15T05:01:55.4333333+00:00

    Hi @Oluyemisi

    Welcome to Microsoft Q&A platform and thanks for posting your question here.

    Why you cannot use the same data source for multiple collections in Microsoft Purview

    The reason why you cannot use the same data source for multiple collections in Microsoft Purview is that each collection is designed to represent a specific business function or domain, and it needs to have its own set of metadata and data policies. If you use the same data source for multiple collections, it can lead to conflicts in metadata and data policies, which can cause confusion and errors in data governance. 

    The best way to separate collections for different functions.

    The best way to achieve separating the collections for different functions is to create separate subfolders in your ADLS container for each business function/domain and register each subfolder as a separate data source in Microsoft Purview. This will allow you to create separate collections for each business function/domain while still using the same data source.

    Example of how to separate collections for different functions.

    For example, let's say you have an ADLS container named "my container" and you want to create separate collections for two business functions: Finance and Sales. You can create two subfolders in "my container" named "finance" and "sales" and register each subfolder as a separate data source in Microsoft Purview. Then, you can create two collections in Microsoft Purview named "Finance" and "Sales" and assign the Data Steward for each function to the corresponding collection. Finally, you can register the assets for each function in the corresponding collection.

    Reference
    https://learn.microsoft.com/en-us/purview/how-to-create-and-manage-collections

    Hope this helps. Do let us know if you any further queries.


    If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

    Was this answer helpful?

    0 comments No comments

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.