Ingest multiple xml files through ADF which has different schemas

Raymond Mitchell 15 Reputation points
2023-05-08T15:25:06.67+00:00

Hi all,

I have an Azure blob storage of xml files with at least two different schemas. I would like to ingest all of these files through ADF (probably using Dataflow) and only process one type of schema to sink to a SQL database. How is this done correctly as I'm unable to query any xml data (File contents in the data flow). As a backup, I can upload all data into a staging table and use sql and adf pipeline to deal with this but I feel that XML handling in ADF should be better. Thanks

Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
{count} votes

1 answer

Sort by: Most helpful
  1. ShaikMaheer-MSFT 38,631 Reputation points Microsoft Employee Moderator
    2023-05-09T11:43:55.7166667+00:00

    Hi Ray Mitchell,

    Thank you for posting query in Microsoft Q&A Platform.

    You need to consider creating dataset of XML format pointing to you XML file and use that XML dataset in dataflows. You can consider using derived column transformation or filter transformation or select transformation as per need and to filter out not requirement data or columns.

    Also, you rightly mentioned you can consider using staging table approach too.

    Hope this helps. Please let me know how it goes or if you struck at any specific place then kindly share your implementation details along with sample input and output.


    Please consider hitting Accept Answer button. Accepted answers help community as well.


Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.