System Firmware Surveillance - IBM TotalStorage NAS Gateway 500 Service Manual

Hide thumbs Also See for TotalStorage NAS Gateway 500:
Table of Contents

Advertisement

System firmware surveillance

System firmware surveillance is automatically enabled during system power-on. It cannot be disabled by
the user, and the surveillance interval and surveillance delay cannot be changed by the user.
If the service processor detects no heartbeats during system IPL (for a set period of time), it cycles the
system power to attempt a reboot. The maximum number of retries is set from the service processor
menus. If the fail condition persists, the service processor leaves the machine powered on, logs an error,
and displays menus to the user. If Call-out is enabled, the service processor calls to report the failure and
displays the operating-system surveillance failure code on the operator panel.
Operating system surveillance
Operating system surveillance provides the service processor with a means to detect hang conditions, as
well as hardware or software failures, while the operating system is running. It also provides the operating
system with a means to detect a service processor failure caused by the lack of a return heartbeat.
Operating system surveillance is not enabled by default, allowing you to run operating systems that do not
support this service processor option.
For operating system surveillance to work correctly, you must set these parameters:
v Surveillance enable/disable
v Surveillance interval
The maximum time the service processor should wait for a heartbeat from the operating system before
timeout.
v Surveillance delay
The length of time to wait from the time the operating system is started to when the first heartbeat is
expected.
Surveillance does not take effect until the next time the operating system is started after the parameters
have been set.
You can initiate surveillance mode immediately from service aids. In addition to the previously discussed
options, another option allows you to select immediate surveillance, and rebooting of the system is not
necessarily required.
If operating system surveillance is enabled (and system firmware has passed control to the operating
system), and the service processor does not detect any heartbeats from the operating system, the service
processor assumes that the system is hung and takes action according to the reboot/restart policy
settings. See "Service processor reboot/restart recovery" on page 282.
If surveillance is selected from the service processor menus that are only available at system boot,
surveillance is enabled by default as soon as the system boots. From service aids, the selection is
optional.
Call-out (call-home)
The service processor can call out (call-home) when it detects one of the following conditions:
v System firmware surveillance failure
v Operating system surveillance failure (if supported by operating system)
v Restarts
v Critical hardware failure
v Abnormal operating system termination
288
NAS Gateway 500 Service Guide

Advertisement

Table of Contents
loading

Table of Contents