An Azure service that provides an enterprise-grade analytics engine.
Hi Jaganathan, NK. (Naveen Kumar),
Thank you for posting query in Microsoft Q&A Platform.
Hope you are trying to use Azure data factory for this case. Kindly try below to make it work.
Use
GetMetaData ActivitywithChildItemsfiled selected in it, to list down all parquet file names from folder. And then useForEach Activityto loop over each file name. InsideForEach Activityuse anotherGetMetaData ActivitywithStructurefield selected. This gives you schema of parquet files. Here use copy activity, in Copy activity, use source as some dummy file and useadditional columnoption to send schema of parquet file to sink. Use SQL table as sink incopy activity.Once loop completes, in SQL table we will have each file name with schema of it. Now, you can use another
copy activityoutside ofForEach Activityto copy that table data to csv file.
Please consider check below videos to understand few of components of above implementation.
Add additional columns during copy in Azure Data Factory
Get Metadata Activity in Azure Data Factory
For Each activity in Azure Data Factory
Hope this helps. Please let me know if any further queries.
Please consider hitting Accept Answer button. Accepted answers help community as well. Thank you.