Excel or CSV file to XML - ADF

Question

Excel or CSV file to XML - ADF

Santhi Dhanuskodi 325

Hi, I want to convert excel/csv file into XML format. This xml is used as a body text for a rest api.

Basically i need to send excel data to a REST API using ADF. Rest API accepts xml format. excel file is present in storage container.

What are the possible ways to achieve this? Python script or data flow has inbuilt support excel to xml conversion? any other ways? If using python script, what are the dependancies for executing python script? what are the infra requirements for the same? From where this script will run and how?

AnnuKumari-MSFT 34,556 Reputation points Microsoft Employee Moderator

2024-03-08T08:33:19.0033333+00:00

Hi Santhi Dhanuskodi ,

I have updated the below answer. Kindly check if it helped, Please do consider clicking Accept Answer as accepted answers help community as well. Also, please click on Yes for the survey 'Was the answer helpful'
AnnuKumari-MSFT 34,556 Reputation points Microsoft Employee Moderator

2024-03-12T09:08:44.3366667+00:00

Hi Santhi Dhanuskodi ,

Just following up to see if the below answer helped. Please do consider clicking Accept Answer as accepted answers help community as well. Also, please click on Yes for the survey 'Was the answer helpful'

Accepted answer

1 additional answer

Your answer

AnnuKumari-MSFT 34,556 Reputation points Microsoft Employee Moderator

2024-03-08T08:33:19.0033333+00:00

Hi Santhi Dhanuskodi ,

I have updated the below answer. Kindly check if it helped, Please do consider clicking Accept Answer as accepted answers help community as well. Also, please click on Yes for the survey 'Was the answer helpful'
AnnuKumari-MSFT 34,556 Reputation points Microsoft Employee Moderator

2024-03-12T09:08:44.3366667+00:00

Hi Santhi Dhanuskodi ,

Just following up to see if the below answer helped. Please do consider clicking Accept Answer as accepted answers help community as well. Also, please click on Yes for the survey 'Was the answer helpful'

Answer 1

@Santhi Dhanuskodi

Thanks for using Microsoft Q&A platform and thanks for posting your query here.

I understand that you want to convert excel/csv to XML using ADF.

You can use Azure Function to write python code convert the Excel/CSV file to XML format and run the azure function from ADF pipeline.

Kindly refer to the below resources:

https://www.geeksforgeeks.org/how-to-convert-excel-to-xml-format-in-python/
https://blog.groupdocs.cloud/conversion/convert-xml-to-excel-and-excel-to-xml-in-python/
You can write Pyspark code in Azure synapse notebook.
- To read data from excel file:

    import pandas as pd
    account_key_value="your_storage_acc_key"
    df = pd.read_excel('abfs://******@jadls2.dfs.core.windows.net/xl_files/sample.xlsx', storage_options = {'account_key' : account_key_value})
    s_df=spark.createDataFrame(df)
    display(s_df)

Instead of account key, you can also use these options:

storage_options = {'sas_token' : 'sas_token_value'}
storage_options = {'connection_string' : 'connection_string_value'}
storage_options = {'tenant_id': 'tenant_id_value', 'client_id' : 'client_id_value', 'client_secret': 'client_secret_value'}

To write data intot xml file:

    df = spark.read.format('xml').options(rowTag='book').load('books.xml')
    (df.select("author", "_id").write
      .options(rowTag='book', rootTag='books')
      .xml('newbooks.xml')
    )

Here are the resources you can refer:

https://learn.microsoft.com/en-us/azure/databricks/query/formats/xml

https://stackoverflow.com/questions/76812820/how-to-read-excel-file-in-synapse-notebook-which-is-in-azure-datalake-gen-2-stor

Hope it helps. Kindly accept the answer by clicking on Accept answer button. Thankyou

Santhi Dhanuskodi 325 Reputation points

2024-02-29T11:22:53.6866667+00:00

sink doesnt support XML format
AnnuKumari-MSFT 34,556 Reputation points Microsoft Employee Moderator

2024-03-06T09:19:04.6166667+00:00

Hi Santhi Dhanuskodi ,

Apologies , my bad. I have updated my answer . Kindly check if it helps resolve your query. In case it helps, please do consider accepting the answer by clicking on Accept answer button. Thankyou

Answer 2

XML format is supported for the following connectors: Amazon S3, Amazon S3 Compatible Storage, Azure Blob, Azure Data Lake Storage Gen1, Azure Data Lake Storage Gen2, Azure Files, File System, FTP, Google Cloud Storage, HDFS, HTTP, Oracle Cloud Storage and SFTP. It is supported as source but not sink. Consider exploring the "Transform XML" feature in Azure Logic Apps to see if it aligns with your needs :

A guide on using Azure Logic Apps for importing CSV files to SQL Server: http://blogs.recneps.net/post/Using-Azure-Logic-Apps-to-Import-CSV-to-SQL-Server
A discussion on Stack Overflow about converting CSV to XML in Logic Apps: https://stackoverflow.com/questions/61417713/how-to-convert-a-csv-to-xml-in-logic-apps
A forum post on MSDN about converting CSV/Excel to XML in Logic Apps: https://social.msdn.microsoft.com/Forums/en-US/dfce78db-ebd6-41d9-ac7b-423a1f83f186/convert-csvexcell-to-xml-in-logic-app Additionally, it's strongly encouraged to support existing feature requests on the ADF User Voice forum by upvoting or commenting. This can help prioritize the development of features such as:

XML support at the sink side
Support for XML files for ADLS as a sink If you have the possibility to use python :

import pandas as pd
import xml.etree.ElementTree as ET

# Load the Excel/CSV file
df = pd.read_csv('path_to_your_file.csv')

# Convert the DataFrame to XML format
root = ET.Element("Root")
for _, row in df.iterrows():
    record = ET.SubElement(root, "Record")
    for col in df.columns:
        child = ET.SubElement(record, col)
        child.text = str(row[col])

# Convert the XML tree to a string
xml_str = ET.tostring(root, encoding='utf8', method='xml')

# Save or use the XML string as needed

Santhi Dhanuskodi 325 Reputation points

2024-02-27T05:46:52.5666667+00:00

I want to use python script. How can i read the file present in azure storage container and process and place the xml file back to storage container. could you pls give sample script?

Share via

Excel or CSV file to XML - ADF

1 additional answer

Your answer