Using ADF to load Parquet files into Azure SQL Database

Question

Using ADF to load Parquet files into Azure SQL Database

Kman 41

We are ingesting data from Oracle (On-premises) using Self Hosted Integration Runtime using Azure Data Factory into Azure SQL Database.

I wanted to know if we can load Parquet files into Azure SQL Database using Azure Data Factory. We are not using Azure Synapse or Databricks or any form of Spark.

1 answer

Your answer

Answer 1

Hi @Kman ,

Thanks for using Microsoft Q&A forum and posting your query.

Yes you can copy data from Oracle to Parquet format using Azure Data Factory.

Parquet format is supported for the following ADF connectors: Amazon S3, Amazon S3 Compatible Storage, Azure Blob, Azure Data Lake Storage Gen1, Azure Data Lake Storage Gen2, Azure Files, File System, FTP, Google Cloud Storage, HDFS, HTTP, Oracle Cloud Storage and SFTP.

Azure Data factory supports the below file formats:

Please note that for copy empowered by Self-hosted Integration Runtime e.g. between on-premises and cloud data stores, if you are not copying Parquet files as-is, you need to install the 64-bit JRE 8 (Java Runtime Environment) or OpenJDK and Microsoft Visual C++ 2010 Redistributable Package on your IR machine

For copy running on Self-hosted IR with Parquet file serialization/deserialization, the service locates the Java runtime by firstly checking the registry (SOFTWARE\JavaSoft\Java Runtime Environment{Current Version}\JavaHome) for JRE, if not found, secondly checking system variable JAVA_HOME for OpenJDK.

Limitation Note: Parquet complex data types (e.g. MAP, LIST, STRUCT) are currently supported only in Data Flows, not in Copy Activity. To use complex types in data flows, do not import the file schema in the dataset, leaving schema blank in the dataset. Then, in the Source transformation, import the projection.

For more info about the Parquet format in ADF please refer to this doc - Parquet format in Azure Data Factory and Azure Synapse Analytics

I'm not sure about the requirement to copy data from Oracle to parquet format and then from Parquet to Azure SQL, but you can also copy directly from Oracle to Azure SQL.

----------

Please don't forget to click on and upvote button whenever the information provided helps you. Original posters help the community find answers faster by identifying the correct answer. Here is how
Want a reminder to come back and check responses? Here is how to subscribe to a notification
If you are interested in joining the VM program and help shape the future of Q&A: Here is how you can be part of Q&A Volunteer Moderators

Kman 41 Reputation points

2021-10-15T08:01:17.307+00:00

@KranthiPakala-MSFT I wanted to know if we can copy Parquet files into Azure SQL via Data Factory, we are not using Azure Synapase.
KranthiPakala-MSFT 46,737 Reputation points Microsoft Employee Moderator

2021-10-19T04:10:03.563+00:00

Hi @Kman ,

Yes, you can copy parquet file data to Azure SQL using Azure Data factory but please note below is the limitation in Azure Data factory Copy activity for Parquet complex data types.

Limitation : Parquet complex data types (e.g. MAP, LIST, STRUCT) are currently supported only in Mapping Data Flows, not in Copy Activity. To use complex types in data flows, do not import the file schema in the dataset, leaving schema blank in the dataset. Then, in the Source transformation, import the projection.

Thank you

----------

Please don't forget to click on and upvote button whenever the information provided helps you. Original posters help the community find answers faster by identifying the correct answer. Here is how

Want a reminder to come back and check responses? Here is how to subscribe to a notification

If you are interested in joining the VM program and help shape the future of Q&A: Here is how you can be part of Q&A Volunteer Moderators
KranthiPakala-MSFT 46,737 Reputation points Microsoft Employee Moderator

2021-10-21T18:58:12.673+00:00

Hi @Kman ,

We still have not heard back from you. Just wanted to check if the above information was helpful? If it answers your query, please do click “Accept Answer” and/or Up-Vote, as it might be beneficial to other community members reading this thread.

And, if you have any further query do let us know.

Share via

Using ADF to load Parquet files into Azure SQL Database

1 answer

Your answer