Hi,
I am using ADF's copy activity to un-zip file in Azure blob storage. I have uploaded a valid zip file into blob storage and using copy activity to copy and unzip this file into new container. I think I have used all the right settings:
- Activity: Copy
- Source data set: Binary, Compression type: ZipDeflate(.zip), Compression level: Optimal
- Sink dataset: Binary, Compression type: None, Copy behavior: Preserve hierarchy
When i run my pipeline with above settings, I get following error:
Error message: ErrorCode=UserErrorUnzipInvalidFile,'Type=Microsoft.DataTransfer.Common.Shared.HybridDeliveryException,Message=The file 'Prod_20230730230042.zip' is not a valid Zip file with Deflate compression method.,Source=Microsoft.DataTransfer.ClientLibrary,''Type=System.IO.InvalidDataException,Message=A local file header is corrupt.,Source=System.IO.Compression,'
Activity Id: 1937352d-b6aa-42e0-85d3-c46d45342847
Pipeline run Id: 1937352d-b6aa-42e0-85d3-c46d45342847
Integration runtime: AutoResolveIntegrationRuntime
Region: West US 2
I have validated that the zip file is correct and I am able to unzip the file on my Windows machine and also with My Python code.
I have read somewhere that ADF only supports deflate compression algorithm. By default zip uses deflate compression algorithm but there are others as well. Can you let me know if there is a way to identify the compression algorithm used for my file and reason why my activity could be failing?
Thanks
Amey