AAG on Linux : Pacemaker and corosync

sakuraime 2,326 Reputation points
2020-11-05T16:42:23.62+00:00

https://learn.microsoft.com/en-us/sql/linux/sql-server-linux-availability-group-ha?view=sql-server-ver15

Not quite under stand why in the following situation (Primary failed), it still can do auto failover
37804-pacemaker.jpg

And what;s the difference between pacemaker and corosync? what are there responsibility ?

how pacemaker detect the sql server is healthy or not ?

SQL Server
SQL Server
A family of Microsoft relational database management and analysis systems for e-commerce, line-of-business, and data warehousing solutions.
13,663 questions
{count} votes

1 answer

Sort by: Most helpful
  1. CathyJi-MSFT 21,861 Reputation points Microsoft Vendor
    2020-11-06T07:33:37.993+00:00

    Hi @sakuraime ,

    > what;s the difference between pacemaker and corosync? what are there responsibility ?

    Pacemaker - The core clustering component, that does things like coordinate across the clustered machines.
    Corosync - A framework and set of APIs that provides things like quorum, the ability to restart failed processes, and so on.

    Please refer to Pacemaker for Always On Availability Groups and failover cluster instances on Linux.

    > how pacemaker detect the sql server is healthy or not ?

    When AGs or FCIs are configured on a Windows-based configuration, they are cluster-aware. Cluster awareness means that SQL Server has specific resource DLLs that a WSFC knows about (sqagtres.dll and sqsrvres.dll for FCIs, hadrres.dll for AGs) and are used by the WSFC to ensure that the SQL Server clustered functionality is up, running, and functioning properly. Because clustering is external not only to SQL Server but Linux itself, Microsoft had to code the equivalent of a resource DLL for Linux-based AG and FCI deployments. This is the mssql-server-ha package, also known as the SQL Server resource agent for Pacemaker.

    Please refer to Install SQL Server packages for availability.

    Please refer to
    Best regards,
    Cathy


    If the response is helpful, please click "Accept Answer" and upvote it.
    Hot issues October--Users always get connection timeout problem when using multi subnet AG via listener. Especially after failover to another subnet


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.