Hi Jaganathan, NK. (Naveen Kumar),
Thank you for posting query in Microsoft Q&A Platform.
Hope you are trying to use Azure data factory for this case. Kindly try below to make it work.
Use
GetMetaData Activity
withChildItems
filed selected in it, to list down all parquet file names from folder. And then useForEach Activity
to loop over each file name. InsideForEach Activity
use anotherGetMetaData Activity
withStructure
field selected. This gives you schema of parquet files. Here use copy activity, in Copy activity, use source as some dummy file and useadditional column
option to send schema of parquet file to sink. Use SQL table as sink incopy activity
.Once loop completes, in SQL table we will have each file name with schema of it. Now, you can use another
copy activity
outside ofForEach Activity
to copy that table data to csv file.
Please consider check below videos to understand few of components of above implementation.
Add additional columns during copy in Azure Data Factory
Get Metadata Activity in Azure Data Factory
For Each activity in Azure Data Factory
Hope this helps. Please let me know if any further queries.
Please consider hitting Accept Answer
button. Accepted answers help community as well. Thank you.