Hi everyone,
I’m closing the thread with the solution that worked for us, in case it helps someone else. After a deep dive we confirmed the problem wasn’t the Storage firewall or our ADF settings, but our internal DNS resolution used by the Self‑Hosted Integration Runtime (SHIR).
- Using
nslookup
we saw the SHIR queries three on‑prem DNS servers. The first one ( 10.27.x.x ) replied but returned an empty answer for<storage>.blob.core.windows.net,
the other two servers ( 10.25.x.x and 10.26.x.x ) returned the correct private IP for the Private Endpoint. Windows treats the empty answer as “valid” and caches it, which explains the intermittent failures.
Immediate workaround: in the VNet’s DNS server list we placed the servers that resolve correctly first and left the faulty one last. From that moment the pipelines stopped failing.
Permanent fix: on the malfunctioning DNS server we added a conditional forwarder for blob.core.windows.net
and privatelink.blob.core.windows.net
, pointing to 168.63.129.16 (Azure DNS). That server now also returns the correct IP, so the order no longer matters.
(Optional) To speed things up you can clear the cache on the SHIR VM with ipconfig /release
, ipconfig /renew
and ipconfig /flushdns
, but it will work as soon as the negative TTL expires anyway.
We’ve been running for more than 48 hours without a single “remote name could not be resolved” error.
Thanks to @Ganesh Gurram for the suggestions—hope this helps anyone facing the same issue. Best regards!