XML to JSON conversion in ADF missing out 'lang' tag details

Alefiya Tikiwala 0 Reputation points
2025-02-20T08:47:03.38+00:00
I am converting an xml from html source to a json file using ADF. However, for one of the attributes, the 'lang' element is not getting translated.

The xml data in file is as follows:

'''

<displayname xml:lang="deAT"> Weiß</displayname>

<displayname xml:lang="deCH"> Weiß</displayname>

<displayname xml:lang="deDE"> Weiß</displayname>

<displayname xml:lang="xdefault">Colour Portable Photo Printer  White</displayname>

<displayname xml:lang="enAE">Colour Portable Photo Printer  White</displayname>

<displayname xml:lang="enGB">Colour Portable Photo Printer  White</displayname>

<displayname xml:lang="enIE">Colour Portable Photo Printer  White</displayname>

<displayname xml:lang="esES">blanco</displayname>

<displayname xml:lang="frBE"> Blanc</displayname>

<displayname xml:lang="frCH"> Blanc</displayname>

<displayname xml:lang="frFR"> Blanc</displayname>

<displayname xml:lang="itIT"> Bianco</displayname>

<displayname xml:lang="nlBE"> wit</displayname>

<displayname xml:lang="nlNL"> wit</displayname>

<displayname xml:lang="plPL"> biała</displayname>

<displayname xml:lang="ptPT"> Branco</displayname>

'''

However, in my json file in sink it is translated as follows:'

'''

 "displayname": [

                    "Weiß",

                    "Weiß",

                    "Weiß",

                    "Colour Portable Photo Printer  White",

                    "Colour Portable Photo Printer  White",

                    "Colour Portable Photo Printer  White ",

                    "Colour Portable Photo Printer  White",

                    "blanco",

                    " Blanc",

                    " Blanc",

                    " Blanc",

                    " Bianco",

                    " wit",

                    " wit",

                    "biała",

                    " Branco"

                ],

'''

I want the 'lang' tag value to filter out data based on different languages. The array in json doesn't let me identify the language name associated i.e. nlNL, frFR etc.

Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
11,345 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Alefiya Tikiwala 0 Reputation points
    2025-02-24T13:10:55.37+00:00

    User's image

    What expression should be put in derived column transformation in the dataflow for extracting lang attribute?
    Also, as you mentioned regarding 'Preserve Attributes' option in XML dataset in ADF, can you tell me where is it found? Maybe is it because my XML is derived from HTTP connection & not from ADLS? @Smaran Thoomu


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.