RDSH freezes spontaneously

ThomasK 11 Reputation points
2022-01-04T11:45:34.943+00:00

Hi everybody,
For a while I have an issue at one of my customers. Just een small business with running 3 HPE servers with Hyper-v host running in a failover cluster on-premise all SAS connected to a HPE MSA SAN. They have several VM's running on the cluster for their domain. Some AD servers, Fileserver, Database server and a RDSH deployment (4 RDSH + 1 Sessionbroker and gateway). The RDSH server are using FSLogix for profile containers. The problem I have is with these RDSH servers. These RDSH servers freeze spontaneously. All servers (hosts and VM's) running Windows Server 2016.
When I connect to the Hyper-V host and looking to the console, I see the old time of freezing and can't CTRL+ALT+DEL to the machine.
I am able to ping te machine, can access the c$ share but it is slow loading (normally not). Only thing we can do is reset the RDSH, after that all is running back normal. When looking at the eventlogs from before the reset I see a lot of vhdmp eventid 129 events. After some minutes I also see disk evenid 153 events. It seems there are problems from the VM getting to the vhdx disks. On the hyper-v host runnen the RDSH VM's I can't find any related errors in the event logs.
I read about some issues with older FSLogix versions so updated it, without success. All Hyper-v hosts are up to date with firmware and drivers. Also the Hyper-V hosts and VM's are patched with latest Windows updates.
I made the AV Exclusions mentioned bij MS for FSLogix, the customer is running TrendMicro Business Security.
Can someone help me out? I am searching for about 2 months now, without a solution. The annoying thing is that it also occurs sporadically and is not reproducible. Sometimes 1x a week, sometimes 1x in 2 weeks and then alternately over the hosts.

Best regards, Thomas

Windows Server
Windows Server
A family of Microsoft server operating systems that support enterprise-level management, data storage, applications, and communications.
12,205 questions
Hyper-V
Hyper-V
A Windows technology providing a hypervisor-based virtualization solution enabling customers to consolidate workloads onto a single server.
2,560 questions
Remote Desktop
Remote Desktop
A Microsoft app that connects remotely to computers and to virtual apps and desktops.
4,260 questions
FSLogix
FSLogix
A set of solutions that enhance, enable, and simplify non-persistent Windows computing environments and may also be used to create more portable computing sessions when using physical devices.
463 questions
0 comments No comments
{count} votes

5 answers

Sort by: Most helpful
  1. Limitless Technology 39,391 Reputation points
    2022-01-06T11:31:28.183+00:00

    Hello @ThomasK ,

    Thank you for your question.

    1. Are you able to check CPU usage by task manager when disconnect/crash issue occurred?
    2. Carefully check the logs in the path below when the problem occurs to see if there are any clues.
      Event log check:

    TerminalServices-RemoteConnectionManager and TerminalServices-LocalSessionManager logs to view information about connections.

    Step 1: Press Windows + R to open the Run dialog, enter eventvwr (or eventvwr.msc) and click OK.

    Step 2: Navigate to Event Viewer \ Application and Services Logs \ Microsoft \ Windows \ TerminalServices- *

    1. Are there any changes made to the server before the issue occurred? Do you like to install updates? Try reverting the changes back to the previous one as a test.

    You can also change the High CPU Usage by the WMIPRVSE.EXE process at regular intervals on Windows:

    https://learn.microsoft.com/en-US/troubleshoot/windows-server/system-management-components/high-cpu-usage-wmiprvse-process-regular-intervals

    -----------------------------------------------------------------------------------------------------------------------------

    If the answer is helpful, please vote positively and accept the answer.

    0 comments No comments

  2. ThomasK 11 Reputation points
    2022-01-11T08:26:18.393+00:00

    Hi,
    Thanks for your comment.

    1. I can't check CPU usage by task manager, because we can't login anymore. Not by RDP and not by console. But CPU usage monitored by PRTG shows normal CPU behaviour.
    2. This logs don't give any strange events.

    We didn't make any changes to the server. Just normal Windows patching. So for this we need to go back with patches for about 4 months I think on all RDSH servers, and then just wait. I think that's quite a security issue with users logged in and working on it.

    A new Microsoft article we came across last week is quite similar to the issues we experience, so we installed this patch and are waiting now if it is happening again..
    https://learn.microsoft.com/en-us/windows/release-health/status-windows-10-1809-and-windows-server-2019#2770msgdesc

    0 comments No comments

  3. ThomasK 11 Reputation points
    2022-01-27T14:53:44.69+00:00

    After installing the hotfix KB5010196 we still had 3 RDSH freezes.. So it does not fixed our issue.
    Any help would be appreciated for further troubleshooting.

    0 comments No comments

  4. ThomasK 11 Reputation points
    2023-11-22T08:57:44.1066667+00:00

    It's a bit late, but we finally "solved" it by disabling dynamic memory on the hyper-v guests.

    0 comments No comments

  5. Deleted

    This answer has been deleted due to a violation of our Code of Conduct. The answer was manually reported or identified through automated detection before action was taken. Please refer to our Code of Conduct for more information.


    Comments have been turned off. Learn more