ClusterPerformanceHistory volume wants to be repaired - but how?

Stefan Falk 166 Reputation points
2021-05-05T14:46:54.957+00:00

Hello everybody,

A customer has a 4-node Windows Server 2019 Hyper-V cluster. One node had to be put in maintenance because one of its boot drives (RAID 1) was replaced. After returning the node to the cluster, the cluster rans just fine. However, we cannot put pause a node (for maintenance). Failover cluster management says error 0x80071763, some cluster storage would be in a degraded state.

Windows Admin Center shows us that it is the ClusterPerformanceHistory volume. It says, a repair is needed, but it does not do that on its own (we waited several days). The volume is ReFS and has 3-way-mirrorring on the Cluster-S2D storage, but it is not a CSV, says WAC. WAC also says that all data on the volume is safe and available but must be synchronized with other servers in the cluster.

How can we repair this volume please?

Best Regards,
Stefan Falk

Windows Server Clustering
Windows Server Clustering
Windows Server: A family of Microsoft server operating systems that support enterprise-level management, data storage, applications, and communications.Clustering: The grouping of multiple servers in a way that allows them to appear to be a single unit to client computers on a network. Clustering is a means of increasing network capacity, providing live backup in case one of the servers fails, and improving data security.
962 questions
0 comments No comments
{count} votes

7 answers

Sort by: Most helpful
  1. JiayaoZhu 3,911 Reputation points
    2021-05-12T01:45:00.847+00:00

    Hi,

    Thanks for your reply!

    It seems like a new issue because as I mentioned before, “IsBackgroundTask” = “False” means that you did not run your command under a background environment. So this result should not necessarily lead the cluster to not allow you to take a node offline for a reboot. In this way, we can first try to add -asjob command to run repair command again, to see if you still cannot reboot your cluster. The guidance for using -asjob command is in the article that I sent you before.

    If you still have problem in taking a node offline, we may need more information about this situation, like wasthere only one node that cannot be turned off? if so, which one? In addition, you are encouraged to give us any error messages or codes that popped up when you tried to take a node offline but failed.

    Thanks for your support and understanding!

    BR,
    Joan

    --------------------------------------------------------------------------------------------------------------------

    If the Answer is helpful, please click "Accept Answer" and upvote it.

    Note: Please follow the steps in our documentation to enable e-mail notifications if you want to receive the related email notification for this thread.

    0 comments No comments

  2. Stefan Falk 266 Reputation points
    2021-06-02T15:02:09.637+00:00

    Hello Joan,

    meanwhile, the Cluster has decided to do its job an repair that volume. It is now showes as healthy again. As we had done nothing special, it seems it just took days to weeks. I can't imagine this is normal, but in the end, I am happy again.

    Thanks for you support!

    Best Regards,
    Stefan

    0 comments No comments