Server Health Monitor statistics

The Server Health Monitor reports a statistic for the overall server and for individual components. Each statistic corresponds to a rating.

Occasionally, the Server Health Monitor assigns the rating of Unknown. This happens when the Domino® Administration client workstation performs at 100 percent of its CPU capacity for an extended period of time. If this happens you may need to make some adjustments to improve the performance of the Server Health Monitor.

Server Health reports are stored in the Health Monitoring database (DOMMON.NSF).

Table 1. Overall server health ratings

Statistic

Rating

Explanation

0 = Health.Overall.Value

Never Seen

The server has never been seen running during the current server monitor session.

0 < Health.Overall.Value

and

Health.Overall.Value < Health.Overall.Threshold.Warning

Healthy

The server is performing within acceptable levels of tolerance.

Health.Overall.Threshold.Warning < = Health.Overall.Value

and

Health.Overall.Value < Health.Overall.Threshold.Critical

Warning

One or more server components are approaching unacceptable levels of poor performance.

Health.Overall.Threshold.Critical <= Health.Overall.Value

and

Health.Overall.Value <= 97

Critical

One or more server components are failing to perform acceptably.

98 = Health.Overall.Value

Critical

One or more server tasks issued a fatal error message.

99 = Health.Overall.Value

Critical

One or more tasks are not responding.

100 = Health.Overall.Value

Server Down

The server is not responding.

Overall health ratings are based, in part, on component health statistics values.

Table 2. Component health statistics

Statistic

Rating

Explanation

0 = Health.*.Value

Never Seen

The component is not being monitored.

0< Health.*.Value

and

Health.*.Value < Health.*.Threshold.Warning

Healthy

The component is performing within acceptable levels of tolerance.

Health.*.Threshold.Warning <= Health.*.Value

and

Health.*.Value< Health.*.Threshold.Critical

Warning

The component is approaching unacceptable levels of poor performance.

Health.*.Threshold.Critical <= Health.*.Value and

Health.*.Value <= 97

Critical

The component is failing to perform acceptably.

98 = Health.*.Value

Fatal

The task associated with the component issued a fatal error message.

99 = Health.*.Value

Not Responding

The task associated with the component is not responding.