Proactive Monitoring - Fujitsu PRIMEQUEST 2400E3 General Description Manual

Hide thumbs Also See for PRIMEQUEST 2400E3:
Table of Contents

Advertisement

CHAPTER 4 Functions provided by the PRIMEQUEST 2000 series

4.7 Proactive monitoring

4.7
Proactive monitoring
This section describes the Proactive monitoring of the PRIMEQUEST 2000 series.
Proactive monitoring and linkage with the operations management server are performed for any system
account.
The section describes the following.
-
Two types of errors detected by hardware
-
Overview of proactive monitoring
-
Proactive monitoring operations
Two types of errors detected by hardware
The PRIMEQUEST 2000 series detects the following two types of errors, depending on the hardware.
-
Uncorrectable Error ( UE)
-
Correctable Error ( CE)
If an uncorrectable error occurs, the hardware stops all the partitions affected by the error, disconnects the
component on which the error occurs, and tries a restart. (Alternatively, it keeps the partitions stopped and
waits for maintenance.)
A correctable error is corrected by the hardware function. Therefore, the partition need not be stopped, or
the faulty component need not be disconnected immediately. However, if the correctable error occurs
frequently, the component may be degraded, making it likely that a fatal error will occur in the future.
Overview of Proactive Monitoring
Proactive monitoring in the PRIMEQUEST 2000 series monitors the occurrence of correctable errors.
If more correctable errors than the threshold for a given period occur, proactive monitoring detects the
component causing the errors and reports it to the MMB. When an event report on an exceeded threshold
value is generated, a prompt plan to stop and disconnect the component is requested.
Proactive monitoring is performed by SVS, BIOS and MMB firmware. SVS is server management software
that can perform integration management of the system built in several PRIMEQUEST 2000 series. For
details on SVS, see
The MMB firmware and BMC firmware manages the error analysis and the statistical information of each
defective component. If the statistical information crosses the threshold value, a Warning is output to the
System Event Log.
The SVS provides a notification function of the fault prediction information, using the S.M.A.R.T. function of
the disk drive.
-
Monitoring target
Disk drives mounted on the DU.
-
Monitoring items
S.M.A.R.T. supports proactive monitoring of the following items.
-
Temperature
-
Read error rate
-
Write error rate
-
Seek error rate
-
Spin-up time
-
Number of replaceable sectors remaining
-
Monitoring method
ServerView Suite (SVS) periodically polls the S.M.A.R.T. function of each disk to check for proactive
detection of any events.
-
Action taken with proactive detection
The following event notification actions are taken.
-
E-mail notification (If e-mail notification is specified, the MMB sends e-mail.)
-
REMCS notification (When REMCS connection is specified, the MMB sends a notification.)
The following figure shows the proactive monitoring flow.
'1.5.3 Server management
software'.
87
CA92344-0534-07

Hide quick links:

Advertisement

Table of Contents
loading

This manual is also suitable for:

Primequest 2800b3Primequest 2800e3

Table of Contents