Get max value from a parquet file by using ADF

Zhu, Yueli YZ [NC] 280 Reputation points
2024-01-24T18:43:16.22+00:00

Hi, I am able to use ADF data flow to read a parquet file and get the row numbers of this parquet file. Following is the data flow.User's image

But when I followed this https://stackoverflow.com/questions/74813503/how-to-take-max-value-and-replace-it-in-drived-colomn-in-data-factoryand tried to get max value of one column, it fail. As long as I added the window, I could not even do Data Preview from source. Here is the error message: Spark job failed: { "text/plain": "{"runId":"f128afcc-c3af-4bb7-a0a9-0544303c577b","sessionId":"0a1d08df-99ce-496f-b437-f7e098422c65","status":"Failed","payload":{"statusCode":400,"shortMessage":"DF-EXPR-010 at Window 'window1'(Line 16/Col 35): Column 'Hello' used in expression is unavailable or invalid.","detailedMessage":"Failure 2024-01-24 18:34:29.554 failed DebugManager.processJob, run=f128afcc-c3af-4bb7-a0a9-0544303c577b, errorMessage=DF-EXPR-010 at Window 'window1'(Line 16/Col 35): Column 'Hello' used in expression is unavailable or invalid."}}\n" } - RunId: f128afcc-c3af-4bb7-a0a9-0544303c577b Do you have any suggestions on how to get max value from a parquet file ? Thanks

Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
0 comments No comments
{count} votes

Answer accepted by question author
  1. Subashri Vasudevan 11,306 Reputation points Volunteer Moderator
    2024-01-25T08:01:38.1166667+00:00

    Hello,

    Looks like you havent imported the schema in source parquet file. Once you import schema, you will see number of columns in source transformation. Currently it doesnt show any column info, which means that the schema is not imported.

    Go to the source data set and import schema from file. And then add a window transformation to find max key.

    Please try this and let us know for any further query on it. Thank you.

    1 person found this answer helpful.

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.