Unplanned failover 2 node cluster fails

Dominique Vorbrodt 21 Reputation points
2022-03-24T15:30:58.753+00:00

Hello out there

I have the most simple 2 node cluster (Windows Server 2022 Standard) with the most simple iSCSI shared storage.

I have 2 networks:

  • iSCSI (10.0.26.0/24)
  • Cluster & Client (192.168.26.0/24)

I have 1 CSV (C:\ClusterStorage on both nodes)

I have only 1 Hyper-V VM as clustered role. All files for the VM reside on the CSV.

Planned live migration of the VM initiated in Cluadmin works without problems.

However, unplanned live migration of the VM (by switching off the cluster node currently hosting the VM) does not work.
The VM goes offline.
In Cluadmin the VM remains on the failed node in the state of "Unmonitored".

Can anyone help please?

Thank you.
Sincerely
D.
Zurich, Switzerland.

System Center Virtual Machine Manager
Windows Server Clustering
Windows Server Clustering
Windows Server: A family of Microsoft server operating systems that support enterprise-level management, data storage, applications, and communications.Clustering: The grouping of multiple servers in a way that allows them to appear to be a single unit to client computers on a network. Clustering is a means of increasing network capacity, providing live backup in case one of the servers fails, and improving data security.
958 questions
0 comments No comments
{count} votes

Accepted answer
  1. Alex Bykovskyi 1,681 Reputation points
    2022-03-25T13:32:34.883+00:00

    Hey,

    When one of the nodes fails (communication between cluster nodes fails), cluster puts node, which it can't connect into isolated mode. By default node will be isolated for 240 seconds (ResiliencyPeriod). You can change this value. The following article should help:
    https://techcommunity.microsoft.com/t5/failover-clustering/virtual-machine-compute-resiliency-in-windows-server-2016/ba-p/372027

    The following article might be helpful as well:
    https://www.starwindsoftware.com/blog/the-main-features-of-2016-failover-cluster

    Cheers,

    Alex Bykvoskyi

    StarWind Software

    Note: Posts are provided “AS IS” without warranty of any kind, either expressed or implied, including but not limited to the implied warranties of merchantability and/or fitness for a particular purpose.

    0 comments No comments

2 additional answers

Sort by: Most helpful
  1. Limitless Technology 39,351 Reputation points
    2022-03-28T16:06:55.813+00:00

    Hello @Dominique Vorbrodt

    The causes for live migration issues are many and a more through analysis of the details for this issue would be recommended.

    I suggest starting with the next guide for live migration troubleshooting as a starting point:

    https://learn.microsoft.com/en-us/troubleshoot/windows-server/virtualization/troubleshoot-live-migration-issues

    Hope this helps with your query,

    -----------

    --If the reply is helpful, please Upvote and Accept as answer--


  2. Dominique Vorbrodt 21 Reputation points
    2022-03-29T13:00:03.373+00:00

    Good Afternoon Alex

    I owe you many thanks for your hints and links that lead to the solution of this nasty problem!

    Indeed, I was unaware of the ResiliencyLevel setting. Unfortunately I never waited 240 secs to see the VM live migrate ;-)

    Thank you for your help.
    Greetings from Zürich Switzerland.

    0 comments No comments