VSS\NetBackup\Failover Cluster issue

Sebastian Jesior 166 Reputation points
2021-06-02T16:25:21.86+00:00

Hello All,

Last time we implement NetBackup solution. Now from time to time we have cluster failover. In logs I can see some logs regarding VSS. Please check below:

Volume Shadow Copy Service warning: Writer received a Freeze event more than two minutes ago. The writer is still waiting for either an Abort or a Thaw event.

And after this warning we have cluster failover(Event ID 1135):

Event ID 1135
Cluster node ' NODE A ' was removed from the active failover cluster membership. The Cluster service on this node may have stopped.
This could also be due to the node having lost communication with other active nodes in the failover cluster.
Run the Validate a Configuration wizard to check your network configuration.
If the condition persists, check for hardware or software errors related to the network adapters on this node.
Also check for failures in any other network components to which the node is connected such as hubs, switches, or bridges.

We checked everything and we was not able to find anything. When we disable backup we don't have this issue.

May I kindly ask you to help me please?

Regards,
Sebastian

Windows Server Clustering
Windows Server Clustering
Windows Server: A family of Microsoft server operating systems that support enterprise-level management, data storage, applications, and communications.Clustering: The grouping of multiple servers in a way that allows them to appear to be a single unit to client computers on a network. Clustering is a means of increasing network capacity, providing live backup in case one of the servers fails, and improving data security.
1,027 questions
{count} votes

Accepted answer
  1. JiayaoZhu 3,921 Reputation points
    2021-06-03T07:44:30.513+00:00

    Hi,

    Thanks for posting on our forum!

    Based on your descriptions, it seems that when enabling your third-party backup tool, your network occasionally lose packets. When you disable the tool, this issue disappears, so I guess this third-party tool may take up a lot of bandwidth when it is in the run. In this case, you can:

    1) Add an extra network cable
    2) Increase cluster heart rate threshold. See:

    https://techcommunity.microsoft.com/t5/failover-clustering/tuning-failover-cluster-network-thresholds/ba-p/371834

    https://www.virtual-dba.com/blog/always-on-changing-cluster-configuration/

    Please note: Information posted in the given link is hosted by a third party. Microsoft does not guarantee the accuracy and effectiveness of information.

    In addition, you can also check other causes and solutions for this issue through this article:

    https://learn.microsoft.com/en-us/windows-server/troubleshoot/troubleshooting-cluster-event-id-1135

    Thanks for your support!

    BR,
    Joan


    If the Answer is helpful, please click "Accept Answer" and upvote it.

    Note: Please follow the steps in our documentation to enable e-mail notifications if you want to receive the related email notification for this thread.

    0 comments No comments

0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.