How to see the file size of all type of data assets in Purview

Dhanoordaran V 170 Reputation points
2025-04-17T14:59:57.3133333+00:00

I can able to see the file size for csv files inside the "properties" tab of data asset.

But I'm unable to see the size for most of the file types of "parquet" format and files from Synapse analytics.

Is there a way to get that detail directly or via api ?, please explain

Microsoft Purview
Microsoft Purview
A Microsoft data governance service that helps manage and govern on-premises, multicloud, and software-as-a-service data. Previously known as Azure Purview.
1,575 questions
0 comments No comments
{count} votes

Accepted answer
  1. Venkat Reddy Navari 1,780 Reputation points Microsoft External Staff Moderator
    2025-04-17T15:56:49.57+00:00

    Hi @Dhanoordaran V
    You're correct that Microsoft Purview shows file sizes for some file types (like CSV) under the Properties tab, but support for other formats—such as Parquet and certain Synapse Analytics assets—can be inconsistent depending on the source system and integration.

    Here are some key points to consider

    1. Parquet Files: For Parquet files stored in Azure Data Lake Storage (ADLS), file size metadata may not always be extracted by default. Ensure that the scan rule set for your data source includes the "Extract file-level metadata" option (enabled in advanced settings during scan configuration).
    2. Synapse Analytics: For Synapse SQL or dedicated pools, the Purview integration might not extract individual file sizes unless those are part of linked datasets (e.g., external tables over ADLS). Currently, file size metadata is more reliably extracted from storage-backed data sources rather than query-based systems.
    3. Using the Purview REST API: You can use the Purview Search API to query for assets and their metadata. However, file size is returned only if it was captured during the scan. Look for attributes like qualifiedName, name, and fileSize.

    Example API filter:

    {
      "keywords": "parquet",
      "filter": {
        "and": [
          {
            "attributeName": "fileSize",
            "operator": "isNotNull"
          }
        ]
      }
    }
    
    
    

    I hope this information helps. Please do let us know if you have any further queries.

    Kindly consider upvoting the comment if the information provided is helpful. This can assist other community members in resolving similar issues.

    1 person found this answer helpful.

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.