Cassandra Monitoring in SCOM

Ravi Bhushan 61 Reputation points
2020-11-04T09:52:14.583+00:00

Hello All,

Cassandra is running on Linux server. I have done process monitoring and log file monitoring. But related to cassandra, i am a bit confused. I have received below request:-

  1. Schedule monitoring of the Cassandra service on both the Cassandra servers (linux servers). Trigger P1 incident if Cassandra service is stopped on any of the Cassandra servers.

Command:

service cassandra status

Expected Output:

Service should be in running state

  1. Schedule monitoring of the Cassandra nodes. Trigger P1 incident if any of the Cassandra node is down from available nodes.

Command:

nodetool status

Expected Output:

Last login: Thu Oct 22 13:32:14 BST 2020 on pts/0

Datacenter: xx_xxx

====================

Status=Up/Down

/ State=Normal/Leaving/Joining/Moving

-- Address Load Tokens Owns (effective) Host ID Rack

xx xx.xx.x.xx 1.43 MiB 256 100.0% e13db14d-574e-4acb-9323-dd9e4d7a419b rack01

xx xx.xx.x.xx 1.32 MiB 256 100.0% 6c95f648-02ed-4b5d-9461-7da7df9d1b9f rack01

  1. Set monitoring for Cassandra Error log. On multiple occurrences of specific error, Trigger P2 incident.

Log File Details:

Log File Path : /var/log/cassandra
Log File Name: Cassandra.log
Log lines starting with : ERROR

Operations Manager
Operations Manager
A family of System Center products that provide infrastructure monitoring, help ensure the predictable performance and availability of vital applications, and offer comprehensive monitoring for datacenters and cloud, both private and public.
1,419 questions
0 comments No comments
{count} votes

1 answer

Sort by: Most helpful
  1. SChalakov 10,266 Reputation points MVP
    2020-11-04T10:11:47.437+00:00

    Hi @Ravi Bhushan ,

    I think you can cover each of those requirements (except for the log monitiornig part, there you can use Linux log file monitiorinbg template or tge Nice Log management pack) using a "UNIX/Linux Shell Command Two State Monitor". You simply need to know the command that will get the service and its status and the exact output, so that you can properly configure the Healthy and Error expressions.
    You can find an example of how the configuration procedure exactly looks like here:

    SCOM 2012 – Linux Two-State Monitor With “Script In Script”
    https://www.stefanroth.net/2012/10/21/scom-2012-linux-two-state-monitor-with-script-in-script/

    In any case you need the proper command to configrue within the monitor and also you need to know the output so that you can properly configure the conditions.

    I hope I could help you out!

    ----------

    (If the reply was helpful please don't forget to upvote or accept as answer, thank you)
    Regards,
    Stoyan

    0 comments No comments