Copy Data Activity

Question

Copy Data Activity

Aditya Singh 160

I want to copy data (table) from Azure SQL Database Managed Instance linked service to a blob storage(ADLS GEN2) in delta format using Copy data activity in the pipeline section in Azure Synapse analytics? I only get this options
User's image

Answer accepted by question author

0 additional answers

Your answer

Answer 1

Amira Bedhiafi 41,121 Volunteer Moderator

You'd typically need to use a Spark activity after the data copy to read the data and then write it back in delta format.

After copying the data, you can use an Azure Synapse Spark pool to convert the data into delta format. Here’s a rough guide on how you might do this:

Add a new Spark notebook or Spark job in your Synapse workspace.
Read the parquet files into a Spark DataFrame.
Write out the DataFrame in delta format using the .format("delta") option in the DataFrame writer.

You can automate this process by creating a pipeline that includes both the Copy Data activity followed by the Spark job or notebook activity.

# Read the Parquet files into a DataFrame
df = spark.read.format("parquet").load("path/to/your/staging/parquet/files")
# Write out the DataFrame in Delta format
df.write.format("delta").save("path/to/your/final/delta/output")

Aditya Singh 160 Reputation points

2024-02-13T04:55:44.4466667+00:00

Thank you @Amira Bedhiafi
Amira Bedhiafi 41,121 Reputation points Volunteer Moderator

2024-02-13T09:44:54.95+00:00

Welcome :D

Share via

Copy Data Activity

0 additional answers

Your answer