An Azure service for ingesting, preparing, and transforming data at scale.
Hi @arkiboys ,
I think one solution would be to specify the SINK filename(s). Thanks!
This browser is no longer supported.
Upgrade to Microsoft Edge to take advantage of the latest features, security updates, and technical support.
in dataflow there is a sink which writes to storage in .parquet
in a foreach loop the dataflow is called for each item to be loaded.
on first load, the storage has data for each item.
if I run the adf the second time, all the items get reloaded into .parquet files in storage and so duplication occurs in storage.
How can I make sure that the data is overwritten if it exists in .parquet in storage already rather than duplicating?
I tried the clear folder option in sink setting but that will clear on each iteration which is not the solution.
Thank you
An Azure service for ingesting, preparing, and transforming data at scale.
Answer accepted by question author
Hi @arkiboys ,
I think one solution would be to specify the SINK filename(s). Thanks!