Troubleshooting UM Health Set
Applies to: Exchange Server 2013
The Unified Messaging (UM) health set monitors the overall health of the UM service in your organization.
If you receive an alert that specifies that UM is unhealthy, this indicates an issue that may prevent users from using the UM service in your organization. The UM health set is closely related to the following health sets:
Troubleshooting UM.CallRouter Health Set
Troubleshooting UM.Protocol Health Set
The UM service is monitored by using the following probes and monitors.
Probe | Health Set | Dependencies | Associated Monitors |
---|---|---|---|
UMSelfTestProbe | UM.Protocol | Active Directory Domain Services (AD DS) | UMSelfTestMonitor |
UMCallRouterTestProbe | UM.CallRouter | Active Directory Domain Services (AD DS) | UMCallRouterTestMonitor |
For more information about probes and monitors, see Server health and performance.
It's possible that the service recovered after it issued the alert. Therefore, when you receive an alert that specifies that the health set is unhealthy, first verify that the issue still exists. If the issue does exist, perform the appropriate recovery actions outlined in the following sections.
Identify the health set name and the server name in the alert.
The message details provide information about the exact cause of the alert. In most cases, the message details provide sufficient troubleshooting information to identify the root cause. If the message details are not clear, do the following:
Open the Exchange Management Shell, and then run the following command to retrieve the details of the health set that issued the alert:
Get-ServerHealth <server name> | ?{$_.HealthSetName -eq "<health set name>"}
For example, to retrieve the UM.Protocol health set details about server1.contoso.com, run the following command:
Get-ServerHealth server1.contoso.com | ?{$_.HealthSetName -eq "UM.Protocol"}
Review the command output to determine which monitor reported the error. The AlertValue value for the monitor that issued the alert will be
Unhealthy
.Rerun the associated probe for the monitor that is in an unhealthy state. Refer to the table in the Explanation section to find the associated probe. To do this, run the following command:
Invoke-MonitoringProbe <health set name>\<probe name> -Server <server name> | Format-List
For example, assume that the failing monitor is UMSelfTestMonitor. The probe associated with that monitor is UMSelfTestProbe. To run that probe on server1.contoso.com, run the following command:
Invoke-MonitoringProbe UM.Protocol\UMSelfTestMonitor -Server server1.contoso.com | Format-List
In the command output, review the Result value of the probe. If the value is Succeeded, the issue was a transient error, and it no longer exists. Otherwise, refer to the recovery steps outlined in the following sections.
When you receive an alert from a health set, the email message contains the following information:
Name of the server that sent the alert
Time and date when the alert occurred
Authentication mechanism used, and credential information
Full exception trace of the last error, including diagnostic data and specific HTTP header information
Note: You can use the information in the full exception trace to help troubleshoot the issue. The exception generated by the probe contains a Failure Reason that describes why the probe failed.
Determine whether the UM service is disabled. If the UM service is not started or disabled, restart the UM service.
Review the event logs on the Client Access server (CAS) to determine whether the UM objects, such as umipgateway and umhuntgroup, are configured correctly.
If the event logs do not contain enough information, you may have to enable UM event logs at the Expert level, and then review the UM trace log files.
Review the event logs on the CAS to determine whether the UM objects, such as the umipgateway and umhuntgroup objects, are configured correctly.
If the event logs do not contain enough information, you may have to enable UM event logs at the Expert level, and then review the UM trace log files.
Review the event logs on the CAS to determine whether the UM objects, such as the umipgateway and umhuntgroup objects, are configured correctly.
If the event logs do not contain enough information, you may have to enable UM event logs at the Expert level, and then review the UM trace log files.
Review the event logs on the CAS to determine whether the UM objects, such as the umipgateway and umhuntgroup objects, are configured correctly.
If the event logs do not contain enough information, you may have to enable UM event logs at the Expert level, and then review the UM trace log files.
Review the event logs on Mailbox server to try to determine why calls from the Lync server are failing. Then, do the following:
Make sure that the Lync pool that is selected by the UM service is operational.
To use a specific Lync server, run the following command:
Set-UMServer ExchangeUMServer -SIPAccessService <ServerName>
The UM server was unable to acquire credentials successfully with the Communications Server A/V Edge service
Review the event logs to investigate which Lync pool is selected, and to verify that the selected Lync pool is operational.
The Communications Server Audio/Video Edge was unable to open a port or allocate resources while attempting to establish a session
Review the event logs to investigate which Lync pool is selected, and to verify that the selected Lync pool is operational.
Renew the UM service certificate on the Mailbox server.
Start IIS Manager, and connect to the server that's reporting the issue to determine whether the MSExchangeServicesAppPool application pool is running.
In IIS Manager, click Application Pools, and then recycle the MSExchangeServicesAppPool application pool. To do this, run the following command:
%SystemRoot%\System32\inetsrv\Appcmd recycle MSExchangeServicesAppPool
Rerun the associated probe as shown in step 2c in the Verifying the issue still exists section.
If the issue still exists, recycle the IIS service by using the IISReset utility or by running the following command:
Iisreset /noforce
Rerun the associated probe as shown in step 2c in the Verifying the issue still exists section.
If the issue still exists, restart the server.
After the server restarts, rerun the associated probe as shown in step 2c in the Verifying the issue still exists section.
If the probe continues to fail, you may need assistance to resolve this issue. Contact a Microsoft Support professional to resolve this issue. To contact a Microsoft Support professional, visit Support for business and then select Servers > Exchange Server. Because your organization may have a specific procedure for directly contacting Microsoft Product Support Services, be sure to review your organization's guidelines first.