Failover Cluster question

DNSMan 0 Reputation points
2023-08-08T19:33:05.17+00:00

Hello everybody, I was curious if anyone has ran into this issue with Failovercluters.. We have a 2 node and a file share witness set up with a cybernetics SAN through ISCSI.

The power went out so both nodes went offline for a while and after the reboot, are unable to see the cluster by name or IP. The Nodes host virtual machines that need access to the cluster storage to power on.

All nodes and witnesses can ping each other by name and IPS. In the failover cluster manager, both nodes are running and the status of the cluster is up. However when i try to connect to cluster it fails with "Cluster [cluster name] not found" error.

We have DNS and DHCP setup on the backup domain controller but the cluster seemed to have disappeared completely. My theory is that the backup DC doesn't have an a entry for the cluster and the node cant resolve it?

Any suggestions or recommendations on the next steps would be greatly appreciated.

Windows Server
Windows Server
A family of Microsoft server operating systems that support enterprise-level management, data storage, applications, and communications.
13,726 questions
Windows Server Clustering
Windows Server Clustering
Windows Server: A family of Microsoft server operating systems that support enterprise-level management, data storage, applications, and communications.Clustering: The grouping of multiple servers in a way that allows them to appear to be a single unit to client computers on a network. Clustering is a means of increasing network capacity, providing live backup in case one of the servers fails, and improving data security.
1,027 questions
0 comments No comments
{count} votes

2 answers

Sort by: Most helpful
  1. Michael Durkan 12,221 Reputation points MVP
    2023-08-08T22:22:43.96+00:00

    Hi

    this sounds like either Domain Controller replication issues and/or DNS resolution issues.

    As you reference your backup DC, it suggests that the Primary is a VM on the cluster. Use Repadmin and DCDiag tools to check last replication times and if there are any failures.

    You say you can ping using both IP (makes sense) and name - are the names cached entries? Try running nslookup and specify the backup DC as your DNS for resolution. See if it can resolve the Cluster Nodes and Cluster FQDN.

    Not sure what DHCP has to do with this - I assume you have static addresses on both your Cluster nodes and your Cluster VIP?

    Hope this helps,

    Thanks

    Michael Durkan

    • If the reply was helpful please upvote and/or accept as answer as this helps others in the community with similar questions. Thanks!
    0 comments No comments

  2. Alex Bykovskyi 2,241 Reputation points
    2023-08-17T17:15:13.2433333+00:00

    Hey,

    In the failover cluster manager, both nodes are running and the status of the cluster is up.

    You see your nodes in failover cluster manager, so you can connect to the cluster. You can see status of the nodes in the cluster only when you connect to the cluster.

    In any case, you should verify that your DC has accounts and computer objects needed. https://learn.microsoft.com/en-us/windows-server/failover-clustering/configure-ad-accounts

    As noted, you should check replication health of your backup DC>

    In addition, you can consider using StarWind VSAN to have redundant iSCSI storage for your cluster. It will create replicated shared storage, which can be used in the cluster.
    Cheers,    
    Alex Bykovskyi    
    StarWind Software    
    Note: Posts are provided “AS IS” without warranty of any kind, either expressed or implied, including but not limited to the implied warranties of merchantability and/or fitness for a particular purpose.

    0 comments No comments

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.