Hi,
We have two separate SCOM environments (one for test/dev servers and one for production) that experienced a strange issue over the past weekend. All the management servers for both environments had the healthservice stop at 2:00 AM. There are 4 management servers in Production and 2 in Test/Dev. The environments only share a domain, network, and virtualization infrastructure. Everything else is separate (different databases, management group, etc).
Does anyone know of any processes (clean-up, etc) that runs at 2:00 AM in the morning that may have caused the health service to completely stop? This event in the operations manager log is recorded just before the service stops.
Provider
[ Name] Health Service ESE Store
- EventID 327 [ Qualifiers] 0 Level 4 Task 1 Keywords 0x80000000000000
- TimeCreated [ SystemTime] 2022-12-04T07:00:03.911475400Z EventRecordID 185075 Channel Operations Manager
HealthService (2604,D,51) Health Service Store: The database engine detached a database (1, C:\Program Files\Microsoft System Center\Operations Manager\Server\Health Service State\Health Service Store\HealthServiceStore.edb). (Time=0 seconds)
Revived Cache: 0 0
Additional Data: lgposDetach = 00006B03:000F:0000
Internal Timing Sequence:
[1] 0.000012 +J(0)
[2] 0.000001 +J(0)
[3] 0.000107 +J(0)
[4] 0.000001 +J(0)
[5] 0.0 +J(0)
[6] 0.009554 -0.005641 (3) WT +J(0) +M(C:-84K, Fs:10, WS:-68K # 0K, PF:-64K # 0K, P:-64K)
[7] 0.001785 +J(0)
[8] 0.002098 -0.000857 (1) WT +J(CM:0, PgRf:0, Rd:0/0, Dy:0/0, Lg:4096/2) +M(C:0K, Fs:1, WS:4K # 0K, PF:0K # 0K, P:0K)
[9] 0.010725 -0.005817 (6) WT +J(0) +M(C:0K, Fs:3, WS:-60K # 0K, PF:-68K # 0K, P:-68K)
[10] 0.001253 +J(0)
[11] 0.000387 +J(0) +M(C:0K, Fs:2, WS:0K # 0K, PF:52K # 0K, P:52K).