Friends,
After a few years away from the Windows World (Unix/MacOS) ive recently spent 3 weeks in hell trying to get a simple Failover Cluster up and running. I really just need to confirm some things before deciding wether to continue.
Note I finally got it up and running a couple of days ago with 2 Virtual Machines on Hyper-V in Cluster Roles and then today we had a power outage while I was away (small farm business) resulting in the SAN box (Synology), switches LAN & SAN and both IDENTICAL nodes (Intel NUCs with identical configs and hardware) going down.
When I brought it back up it was completely borked. I cannot recover it. Cluster Services just bouncing on each node up the down, repeat.
Ive tried the usual Clear-ClusterNode, Remove-Cluster and Remove-ClusterNode etc - nothing works.
I've restored both nodes from Backups going further back in time a few times (I do hourly backups)
I decided in frustration to wipe it from the disk, removed the Feature (and reboot), removed the iSCSI disks and even clean formatted them. I searched the Registry and there is no Cluster settings under HKEY_LOCAL. I also looked at the registry Ole hive and it's full of gibberish entries I'm too scared to touch. :-)
When I try to create a new cluster I get an AD access denied error despite doing this as a Domain Administrator.
I'm at my wits end.
I'm 99% about to walk away from Failover Cluster as frankly it can't handle a failure and seems to make recovery worse/longer/pointless.
Here are my questions that I'd appreciate answer to:
- Is there anyway to completely remove ALL traces of the old Cluster from the nodes, files, registry anything else.
- is my experience that a total loss of both nodes and shared storage renders a cluster unrecoverable?
- am I cursed?
I truly appreciate any info or feedback.
Cheers,
Paul.