Detection Of Server Fault; Prevention Of Server Fault; Management Of Server Operation Status - NEC N8800-096F User Manual

Express5800 series
Hide thumbs Also See for N8800-096F:
Table of Contents

Advertisement

5-14 Installing and Using Utilities

Detection of Server Fault

NEC ESMPRO Manager and NEC ESMPRO Agent detect errors causing faults to occur at an early
stage and notify Administrators of fault information real-time.
Early detection of error
If a fault occurs, NEC ESMPRO Agent detects the fault and reports the occurrence of the fault
to NEC ESMPRO Manager (alert report). NEC ESMPRO Manager displays the received alert
in the alert viewer and also changes the status colors of the server and server component in
which the fault occurs. This allows you to identify the fault at a glance. Further, checking the
content of the fault and the countermeasures, you can take appropriate action for the fault as
soon as possible.
Types of reported faults
The table below lists the typical faults reported by NEC ESMPRO Agent.
Component
CPU
Memory
Power supply
Temperature
Fan
Storage
LAN

Prevention of Server Fault

NEC ESMPRO Agent includes the preventive maintenance function forecasting the occurrence of a
fault as countermeasures for preventing faults from occurring.
NEC ESMPRO Manager and NEC ESMPRO Agent can set the threshold for each source in the
server. If the value of a source exceeds the threshold, NEC ESMPRO Agent reports the alert to NEC
ESMPRO Manager.
The preventive maintenance function can be set for a variety of monitoring items including chassis
temperature, and CPU usage rate.

Management of Server Operation Status

NEC ESMPRO Agent manages and monitors a variety of components installed in the server. You
can view the information managed and monitored by NEC ESMPRO Agent on the data viewer of
NEC ESMPRO Manager.
NEC ESMPRO Agent also manages and monitors all the components and conditions required to
keep the server reliability at a high level such as hard disks, CPU, memory, fans, power supply, and
temperature.
The following table indicates the functions available on each items of the data viewer.
Reported information
CPU load is over the threshold
CPU degrading
ECC 1-bit error detection, etc.
Voltage lowering
Power failure
Temperature increase in chassis
Fan failure (decrease in the number of revolutions)
File system usage rate
Line fault threshold over
Send retry or send abort threshold over

Advertisement

Table of Contents
loading

Table of Contents