Server Health Monitor

You can monitor and troubleshoot performance issues using a number of different tools.

In Domino®, performing traditional performance troubleshooting involves:

  • Using event generators and notifications and Domino® server monitoring to perform real-time data-analysis
  • Using information from the server log (LOG.NSF), the Monitoring Results database (STATREP.NSF), and the Administration Requests database (ADMIN4.NSF) to perform historical data-analysis
  • Using Domino® Directory documents and NOTES.INI settings to customize the server configuration

The Server Health Monitor extends the usefulness of traditional performance troubleshooting by automatically calculating health statistics, comparing those statistics to predefined thresholds, and reporting on overall server health. If the server health rating is Warning or Critical, a health report, which is stored in the Health Monitoring database (DOMMON.NSF), suggests short-term and long-term recommendations for tuning the server and returning its performance status to Healthy.

The Server Health Monitor is incorporated into the Domino® server monitor, which is part of the Domino® Administration client.

For each server being monitored, the Server Health Monitor reports a health rating for the server and for all enabled individual server components -- namely, CPU, disk, memory, and network utilization; NRPC name lookup; mail delivery latency; and server, HTTP, LDAP, and IMAP response.

The health rating of each server and server component is based on a collection of indices. Health ratings, such as healthy, warning, or critical, are assigned, based on these index values. Each index has a calculated value between 0 and 100. These values are based on server health monitoring assessment algorithms and rules. Each index has two related thresholds: a warning threshold and a critical threshold. When the index value is less than both thresholds, the server or server component is rated Healthy. When the index value is greater than the warning threshold, the server or server component is rated Warning. When the index value is higher than the critical threshold, the server performance is judged to be Critical and requires immediate attention.

The Server Health Monitor includes threshold values for each index on these platforms: IBM® AIX®, IBM® eServer, IBM® i, IBM® eServer zSeries® , Linux Intel, and Microsoft Windows. You can modify the thresholds to customize server assessment for each platform. You reduce or increase the thresholds to make the algorithms more or less sensitive.

Health Monitoring reports on each server area for which data can be retrieved. If no data is available, nothing is reported for that component. You can customize this behavior by specifying which servers you want to monitor. You can exclude any component from the health report, which is useful for filtering out known situations about which you don't want to be constantly reminded.

If you use the Server Health Monitor, the Current® Reports view of the Health Monitoring database (DOMMON.NSF) displays a health rating for each monitored server and server component.