Share via

Windows 10 loses some internet and browser connectivity about 24 hours after reboot

Anonymous
2017-07-18T18:05:52+00:00

Sorry for the lengthy post but I have done extensive troubleshooting on this and I'm at a loss. My Windows 10 computer is driving me crazy. It seems to be losing some form of internet connectivity every day approximately 24 hours after rebooting. Then because I can’t connect completely to the internet and I can’t find a troubleshooting solution, I have to reboot again which fixes it for approximately 24 hours. Rinse, wash, repeat…

To provide some background, I have a CS degree and have been a software developer for 20+ years. I have tons of experience configuring desktops, servers, virtual environments, networks, etc. While I would be considered an expert in many areas, I know there is far more that I don’t know than I do know. But that’s just to say that I am by no means a novice with the issues I’m having.

My machine:

  • Windows 10 Pro (was upgrade from previous versions of Windows)
  • Home network (logging on locally via domain credentials from when this was in a corporate environment but no longer connected to domain). But this has been working fine for 3+ years
  • Windows Update is at the latest updates (but I turned off the stupid auto-update and reboot to avoid shutdowns when critical apps are open)
  • 2 x Xeon E5-2630 2.6GHz
  • 64 GB Ram
  • 2 drives (primary is SSD, secondary is SATA)

My environment

  • Sonicwall TZ210 router to 100Mb FIOS connection
  • Machine connected directly to HP ProCurve 1810G switch
  • No wireless involved with this machine
  • I have 4 static IP addresses on my machine to run various internal web development sites through IIS, etc.
  • My static IPs are not in the DHCP range that other devices on my network use
  • I use 8.8.8.8 for DNS (and have tried 8.8.4.4 and some ISP DNS also)
  • I have several VPNs that I may connect to in order to work throughout the day.
  • I use static routes to redirect certain traffic through the VPN
  • I do not use any other Proxies (other than VPN traffic) to connect to the internet

This began about 2-3 weeks ago when I installed a big group of Windows Updates which included the cumulative updates, etc. It was also around the time I switched from McAffee VirusScan Enterprise to AntiVirus Plus for renewal purposes. I probably did some other things around that time to but lots of things were changing.

At that point, I would be working along just fine, and then all of the sudden, all browsers would lose connectivity: “This site can’t be displayed” in Chrome. Same equivalent in Firefox, IE, and Edge. However, some apps like Slack and Skype work for the most part but occasionally (and then eventually) stop working altogether. However, the AAC+ music streaming seems to run indefinitely. But nothing I change will affect the browsers loading any webpages (which makes it much harder to write software). I don't lose connection to VPN's etc.

There are no indications from McAfee or Malwarebytes when the connection stops. But I have no problem with any connectivity the entire day until it just freaks out.

While the failure is happening

  • I can ping any public IP by IP address and generally by domain name
  • I can do nslookup using 8.8.8.8
  • I can ping my Sonicwall router but cannot connect to it via the browser
  • None of the other machines on my network have ANY problems while my machine is unable to connect
  • None of my browsers will load web pages (including local websites hosted on my machine via IIS)
  • If I change my static IP addresses to DHCP, I can restore lost Slack (and I think Skype) connectivity but websites are still down. Restoring static IP’s takes away Slack again.
  • Turning off VPNs and/or static routes does not restore connectivity
  • I can log in to my Sonicwall router from another machines on the same network and can see the ARC lease for my static IP. Killing the ARC connection in the router results in it reappearing immediately.
  • Nothing is pegging Processor or RAM at the time
  • Memory usage is only about 40% or so
  • I don't believe email comes through unless I change static/dhcp address status
  • Sometimes when this first starts, I can pull up certain websites but the CSS / included content isn't loaded. That's how I usually know it's about to hard fail the connection.

I was originally thinking DNS, virus, malware, etc. But it’s super strange that I can connect with some protocols and not with others. I’m guessing that some app / program is deciding I’ve had enough internet for the day and is trying to cut me off for my own good. But this is a free country and it’s my right to earn a living as needed… :)

Related Software I normally run

  • McAfee AntiVirus Plus (which takes over for Windows Firewall)
  • Malwarebytes

What I have tried

  • I know about flushing DNS, etc and have tried that
  • I have resetting winsock, etc (EDIT: includes running the following as Administrator in command prompt)
    • netsh winsock reset catalog
    • netsh int ip reset reset.log
  • I have tried clearing Chrome chrome://net-internals sockets and dns (but this affects all browsers)
  • I have confirmed that IE does not have any Proxies set up that are affecting general internet traffic
  • I uninstalled McAfee completely for an entire day and it still happened
  • I reinstalled McAfee and uninstalled Malwarebytes (still uninstalled)
  • I turned off all onboard firewalls (McAfee / Windows)
  • I switched Network cards (I have tried an onboard card, a 2nd PCIx card, and then purchased a 3rd USB Gigabit ethernet adapter). All cards were the only active card while testing (any others that were installed like the onboard were disabled)
  • I have tried updating drivers for each of the 3 network cards including removing them from the machine in Device Manager and reinstalling
  • I changed network ports on my ProCurve switch (and also hard rebooted my ProCurve switch)
  • I have rebooted my Sonicwall router (but other devices have no problems)
  • I did notice after a few days of daily freezes, that my C drive was full due to 30+ GB hyperfil.sys from a previous failure and my regular 64GB pagefile.sys. So I removed the hyperfil.sys file (using command line to disable Hybernation in Windows) and then relocated the pagefile to D drive instead. Now I have 70+GB free space on C and 270+GB on D drive.
  • sfc /scannow (no issues)
  • DISM /Online /Cleanup-Image /RestoreHealth (nothing found)
  • Resetting DHCP in services (but again I’m using static IP)

After migrating my pagefile to D, I lost my Start menu. I had to do this and other stuff to restore them:

“Get-AppXPackage -AllUsers | Foreach {Add-AppxPackage -DisableDevelopmentMode -Register "$($_.InstallLocation)\AppXManifest.xml"}”

However, I don’t have a complete set of panels (whatever the goofy cards are called on the right hand of the Start menu that pops up). So the start menu isn’t completely back, but my programs are there now so I can get to them.

Scans I have tried

  • Full McAfee AntiVirus Plus scan
  • Full Malwarebytes
  • Emsisoft Emergency Kit
  • TrendMicro HouseCall
  • Windows Defender

Other than some non-intrusive PUPs, nothing is ever found.

I usually try an hour or two of more troubleshooting and Googling every time it goes down, but I can’t seem to figure this out yet. It’s super annoying because I don’t like to reboot regularly as I have 5 monitors and routinely have 130+ Chrome tabs along with 10-20 other apps / windows running, and many background processes (web servers, etc). I haven’t had any problems with this until a few weeks ago.

I’m sure there is more than I can offer but people will probably ask questions and I’m worried I might lose connectivity again before I get to post this.

Feel free to ask questions or offer advice but please take the time to read all the stuff I tried to save us both some time! And I don’t mind trying some other things again to confirm various settings / statuses. :)

Windows for home | Windows 10 | Internet and connectivity

Locked Question. This question was migrated from the Microsoft Support Community. You can vote on whether it's helpful, but you can't add comments or replies or follow the question.

0 comments No comments

7 answers

Sort by: Most helpful
  1. Anonymous
    2017-10-17T15:28:37+00:00

    @alex,

    It does appear to be solved. I believe it was actually an issue where TCP ports were being exhausted and not related to DNS at all. I did some workarounds as referenced below and it was much better. Then there was a Windows Update about a month ago that seemed to do away with the issue entirely. So I think it may have started from an earlier Update somehow as I suspected.

    Similar to this:

    https://msdn.microsoft.com/en-us/library/aa560610%28v=bts.20%29.aspx?f=255&MSPPError=-2147217396

    But I'm running Windows 10 and I don't think this directly applies. I have tried to watch netstat before and after it breaks and it doesn't appears to have an unruly amount of ports used. The symptoms are consistent with port issues though:

    • Existing connections typically remain open for a period of time (music streaming, existing SQL Server query connections, skype, slack, etc)
    • New connections are blocked (browsing, new SQL connections, etc)
    • Shutting down services that are using a lot of TCP connections (like Coldfusion server) will temporarily restore service for some types of connections (some browser requests, etc)

    I have watched my vshub.exe error logs from Visual Studio and see that it's cycling through a bunch of ports trying to open a connection and it's constantly blocked (49152, 49153, 49154, ...). However, those ports do not show up under netstat as blocked or in use at all. They're simply not listed. But vshub still cannot bind to the port.

    But this is what I actually did to make it much better:

    https://deploymentresearch.com/Research/Post/532/Fix-for-Windows-10-exhausted-pool-of-TCP-IP-ports

    I settled on these options (listed in the article):

    TcpTimedWaitDelayREG_DWORD: 0000001e (30) (hex) 

    MaxUserPortREG_DWORD: 0000fffe (65534) (hex)

    TcpMaxDataRetransmissionsREG_DWORD: 00000005 (5) (hex)

    I didn't end up setting TcpNumConnections at all.

    Hopefully that helps! If it doesn't and you haven't done the latest comprehensive Update, I would definitely try that as well.

    Was this answer helpful?

    7 people found this answer helpful.
    0 comments No comments
  2. Anonymous
    2017-07-18T19:26:45+00:00

    That makes some sense on the networking side. Like it's possibly hitting some "buffer" or something that fills up and only gets reset on reboot.

    I know that the static IP addresses remain connected to the machine. And I'm 99.9% sure that I ran a test to ping my machine from other machines on the network after the connection is lost. I can also access other machines on my local network (including mapped network drives on my NAS) when it happens. But the browser based connections are not accessible.

    So it's like 80/443 stop working correctly but most other connections remain. Which screams either virus, malware, or firewall to me but I can't seem to find any apps or processes that are doing it.

    I do see some events in the Event Viewer of the following (every 30 minutes or so):

    The processing of Group Policy failed because of lack of network connectivity to a domain controller. This may be a transient condition. A success message would be generated once the machine gets connected to the domain controller and Group Policy has successfully processed. If you do not see a success message for several hours, then contact your administrator.

    I also see this every 90 minutes or so this morning:

    The system failed to register host (A or AAAA) resource records (RRs) for network adapter with settings:

               Adapter Name : {22501B19-[MASKED]- 404B0A9BA624}

               Host Name : [MASKED]

               Primary Domain Suffix : domain.[MASKED]

               DNS server list :

                     8.8.4.4, 8.8.8.8

               Sent update to server : <?>

               IP Address(es) :

                 [My 4 Static IP's listed here; MASKED]

    The reason the system could not register these RRs was because the DNS server contacted refused the update request. The reasons for this might be (a) you are not allowed to update the specified DNS domain name, or (b) because the DNS server authoritative for this name does not support the DNS dynamic update protocol.

    To register the DNS host (A or AAAA) resource records using the specific DNS domain name and IP addresses for this adapter, contact your DNS server or network systems administrator.

    And one last weird one:

    Initialization failed because the driver device could not be created. Use the string "000000000100320000000000D71000[MASKED]1000000000000000000000000000000" to identify the interface for which initialization failed. It represents the MAC address of the failed interface or the  Globally Unique Interface Identifier (GUID) if NetBT was unable to  map from GUID to MAC address. If neither the MAC address nor the GUID were  available, the string represents a cluster device name.

    I could try doing a BIOS update from Dell (Precision T5610) but again, it's just weird that it was working fine for 3 years and then some recent change causes it to freak out every day. And it's not the same time every day. It's literally about 24 hours each time. I did have it once over the weekend take somewhere between 24-48 hours but otherwise it's very consistent.

    Was this answer helpful?

    0 comments No comments
  3. @CmdrKeene 90,626 Reputation points Volunteer Moderator
    2017-07-18T19:12:45+00:00

    Badly made or misconfigured drivers can cause issues like this, such as after so many packets or hours running they can hit the issue and be unable to recover. I had a similar frustration with my Dell Venue 8 Pro but it hasn't happened since I updated the BIOS, although I can't say for absolute certainty if that was the fix.

    I'd still encourage you to get the drivers from the system builder/configurator like Dell/HP/Acer/etc, since they will often modify the driver from Intel and ship their own configured version.  But considering an entire different network card connected by USB has the same issue, I'd be inclined to agree that it isn't the fault.

    When you lose connectivity, does your IP address become invalid?  Or perhaps, does the Device Manager entry for the network card still report that the device is working normally?

    Was this answer helpful?

    0 comments No comments
  4. Anonymous
    2017-07-18T19:05:07+00:00

    @Shawn, Thanks for the reply. I should have been more clear in my "reset winsock" line item to say that I did run those commands as Admin in command line. I went ahead and updated the initial post to reflect that. The end step of those commands is always to reboot. Which of course solves the problem temporarily (as does any reboot) but then it reoccurs approximately 24 +/- hours from that point.

    As far as the drivers go, I had an Intel card onboard that I tried both the Device Manager as well as going directly to the Intel site to get the latest drivers. Neither worked. So I tried the "generic" card I had installed years ago and got the same result. Then I went and bought a brand new USB network card from BestBuy in hopes that a different method altogether might provide a positive result or at least some different troubleshooting data. Unfortunately it worked exactly the same as the other two cards I had tried.

    And I'm wondering why the connection would reset with every reboot and then break after an entire day if the drivers were in fact bad (not questioning you but the machine issues themselves). It's obviously a possible solution that needs explored, but it will work without fail after a reboot. So I'm not convinced that drivers would necessarily fix it even though I have tried several in order to correct this with 3 different cards.

    If you have any other thoughts please let me know.

    Was this answer helpful?

    0 comments No comments
  5. @CmdrKeene 90,626 Reputation points Volunteer Moderator
    2017-07-18T18:53:35+00:00

    If you only updated drivers from Device Manager, that update check isn't very good. Companies have to voluntarily submit drivers to get found by that tool, and even if there are they are usually outdated.

    I'd at minimum re-apply whatever driver is on the OEM's website for that model. Even if it's the version you already have I'd still do it.

    I know you've tried a ton, forgive me if you did this, but I found success with it before:

    1. First, open an admin-mode command line interface:

    On the Start screen, type cmd and press Ctrl+Shift+Enter. You'll be prompted for this admin action.

    Make sure the top of the command prompt window says "Administrator" in the title. 2. Type these two commands to reset the networking system:

    Type each of these lines, press enter after each one

    netsh winsock reset catalog

    netsh int ip reset reset.log hit

    After running both of these, hopefully everything can connect again. Let me know how it goes.
    

    Was this answer helpful?

    0 comments No comments