If "Metadata discovery" has status as "completed with execption" will it affect classification results? please explain

Dhanoordaran V 150 Reputation points
2025-04-04T07:00:27.28+00:00

Question 1:

In "Run detail" under scan status, if "Metadata discovery" has status as "completed with exception" will it affect classification results? please explain the logic behind it

Question 2:

When examining the scan error log, it is mostly because of two errors :"parquet java invocation error" and "not a valid parquet file", how to address that.

Kindly help to address them.

Thanks

Microsoft Purview
Microsoft Purview
A Microsoft data governance service that helps manage and govern on-premises, multicloud, and software-as-a-service data. Previously known as Azure Purview.
1,532 questions
0 comments No comments
{count} votes

Accepted answer
  1. J N S S Kasyap 1,625 Reputation points Microsoft External Staff
    2025-04-04T08:43:20.82+00:00

    Hi @Dhanoordaran V

    In "Run detail" under scan status, if "Metadata discovery" has status as "completed with exception" will it affect classification results?

    It could affect classification results, depending on the nature of the exception and how critical the metadata is to the classification process.
    The "completed with exception" status indicates that the metadata discovery process finished but encountered issues (exceptions) that prevented it from fully or correctly completing its task. For example, some files might not have been processed, or certain metadata might be missing or corrupted.

    Exceptions during metadata discovery suggest that the system couldn’t fully prepare the data for classification. For instance:

    • A missing schema might lead to incorrect feature mapping.
    • Unprocessed files might reduce the dataset size, skewing results.
    • Partial metadata might confuse the classifier about data context.

    The severity depends on the amount of data affected; if only a small subset of files triggered exceptions, the impact might be negligible.

    When examining the scan error log, it is mostly because of two errors :"parquet java invocation error" and "not a valid parquet file", how to address that.

    The below Microsoft Documentation will help for the above question

    https://learn.microsoft.com/en-us/azure/data-factory/connector-troubleshoot-parquet

    I hope this information helps. Please do let us know if you have any further queries.

    Kindly consider upvoting the comment if the information provided is helpful. This can assist other community members in resolving similar issues.

    Thank you.

    1 person found this answer helpful.

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.