Hi Anu,
you need to analyses and identify how long it takes for the management server to get greyed out and until the event is being logged.
This behavior would in the most cases indicate that there is an issue with the sizing of the environment or that the servers are in the need of additional resources. What I would do:
- Increase the server resources (hardware requirements) as per:
System requirements for System Center Operations Manager
https://learn.microsoft.com/en-us/system-center/scom/system-requirements?view=sc-om-2019
- Do some performance optimizations.
I have summarized many of the possible configurations here:
System Center Operations Manager (SCOM) Management Group Performance Optimizations
https://social.technet.microsoft.com/wiki/contents/articles/53582.system-center-operations-manager-scom-management-group-performance-optimizations.aspx
Pay special attention to the application level performance optimizations (09, 10, 11 and 12).
09 is actually a suggestion, coming from Kevin Holman in form of registry based performance optimizations, which you should not forget.
- Last, but not least, in the article Microsoft says:
"If the HealthService is running lots of workflows, this registry value must be set even larger than the recommended size."
This means exactly this: you can set the value even higher if you have a large environment with many MPs (hence many workflows).
----------
(If the reply was helpful please don't forget to upvote and/or accept as answer, thank you)
Regards
Stoyan Chalakov