I found this project : https://github.com/blackrock/xml_to_parquet
Convert XML files into Apache Parquet format using just an XSD schema and XML file. This process uses the XSD to transform all content from the XML into a corresponding Parquet file, maintaining nested data structures that replicate the XML paths.
Convert a small XML file to a Parquet file
python xml_to_parquet.py -x PurchaseOrder.xsd PurchaseOrder.xml
INFO - 2021-01-21 12:32:38 - Parsing XML Files..
INFO - 2021-01-21 12:32:38 - Processing 1 files
DEBUG - 2021-01-21 12:32:38 - Generating schema from PurchaseOrder.xsd
DEBUG - 2021-01-21 12:32:38 - Parsing PurchaseOrder.xml
DEBUG - 2021-01-21 12:32:38 - Saving to file PurchaseOrder.xml.parquet
DEBUG - 2021-01-21 12:32:38 - Completed PurchaseOrder.xml
More links :
https://medium.com/@sonradata/convert-xml-with-spark-to-parquet-1c5ba561b193