Hi @Dhanoordaran V
In "Run detail" under scan status, if "Metadata discovery" has status as "completed with exception" will it affect classification results?
It could affect classification results, depending on the nature of the exception and how critical the metadata is to the classification process.
The "completed with exception" status indicates that the metadata discovery process finished but encountered issues (exceptions) that prevented it from fully or correctly completing its task. For example, some files might not have been processed, or certain metadata might be missing or corrupted.
Exceptions during metadata discovery suggest that the system couldn’t fully prepare the data for classification. For instance:
- A missing schema might lead to incorrect feature mapping.
- Unprocessed files might reduce the dataset size, skewing results.
- Partial metadata might confuse the classifier about data context.
The severity depends on the amount of data affected; if only a small subset of files triggered exceptions, the impact might be negligible.
When examining the scan error log, it is mostly because of two errors :"parquet java invocation error" and "not a valid parquet file", how to address that.
The below Microsoft Documentation will help for the above question
https://learn.microsoft.com/en-us/azure/data-factory/connector-troubleshoot-parquet
I hope this information helps. Please do let us know if you have any further queries.
Kindly consider upvoting the comment if the information provided is helpful. This can assist other community members in resolving similar issues.
Thank you.