An Azure service for ingesting, preparing, and transforming data at scale.
Hi nishant garg,
it sounds like you’re running into an NPOI zip-parsing error when ADF’s HTTP-Excel dataset tries to fetch your SharePoint download URL instead of a clean .xlsx stream. Here are a few things you can try:
Make sure the file isn’t protected • Open it in Excel, click Enable Editing and then Save. This clears any workbook/sheet protection that the NPOI library (which ADF uses under the covers) can’t handle.
Validate your download URL • In Postman or your browser, hit the URL and confirm you get back a raw .xlsx file (content-type application/vnd.openxmlformats-officedocument.spreadsheetml.sheet) and not an HTML page or redirect. If you see HTML/“Access Denied,” ADF will download invalid content and NPOI will choke.
Explicitly set sheetName or sheetIndex • In your Excel dataset’s properties, specify either sheetName (e.g. "Sheet1") or sheetIndex (0‐based). Without one of those, ADF can’t determine which worksheet to read.
Watch out for timeouts on large files • HTTP datasets have a ~100 second timeout to load the entire workbook into memory. If your file is large (>100 MB) you might hit a timeout or partial download. Workarounds: – Use a Data Flow source instead of Copy/Preview (it supports streaming). – Spin up a Self-Hosted IR and do a Copy Activity from HTTP → Blob Storage, then point your Excel dataset at the blob. – Break the workbook into smaller files or convert to CSV if feasible.
Stage via Blob Storage as a fallback • If HTTP still refuses to deliver a clean stream, create a Copy Activity that downloads the file from HTTP into Azure Blob. Then your Excel dataset can reliably read from Blob Storage without HTTP timeouts or redirects.
Give those a try and let me know which one works for you!
Reference list
- Disable security protection on Excel file https://support.microsoft.com/azure-data-factory-enable-editing-remove-protection
- Excel file format in Azure Data Factory & Synapse (HTTP connector notes) https://learn.microsoft.com/azure/data-factory/format-excel#dataset-properties