Successful Failover/Reboot Recovery; Failed Failover/Reboot Recovery, Non-Critical - Intel NetStructure MPCMM0001 Software Instructions

Chassis management module
Table of Contents

Advertisement

Process Monitoring and Integrity
6.7.4

Successful Failover/Reboot Recovery

In this scenario, PMS detects a process fault. The configured recovery action is: failover to the
standby CMM and upon successfully executing the failover, reboot the now standby CMM. The
recovery actions are successful.
Table 9.

Successful Failover/Reboot Recovery

PMS detects a faulty process. The
mechanism (existence, thread
watchdog, or integrity) used to detect
the fault will determine which of the
event type strings will be used.
The recovery action specified is
"failover & reboot"
PMS executes a failover.
Note this step is skipped when
running on the standby CMM.
PMS is running on the standby CMM
(failover was successful or already
running on the standby), PMS
recovers the CMM by rebooting.
Upon initialization of PMS after the
reboot. The monitor will de-assert the
event.
6.7.5

Failed Failover/Reboot Recovery, Non-Critical

In this scenario, PMS is running on the active CMM and detects a monitored process fault. The
severity of the process is configured to a value that is not critical. The configured recovery action
is: failover to the standby CMM and upon successfully executing the failover, reboot the now
standby CMM. The failover recovery action is unsuccessful (standby is not available, etc.). The
process being monitored is not of a critical severity and therefore the reboot of the CMM will not
be performed.
48
MPCMM0001 Chassis Management Module Software Technical Product Specification
Download from Www.Somanuals.com. All Manuals Search And Download.
Description
Process existence fault;
attempting recovery or
Thread watchdog fault; attempting
recovery or
Process integrity fault; attempting
recovery
Attempting failover & reboot
recovery action
The existing code generates the
events for failover. They are
separate from process monitoring
events and are not described
here.
Monitoring initialized
Event String
UID
#
#
-
#
Assert
Severity
Assert
Configure
N/A
Configure
N/A
N/A
De-assert
OK

Advertisement

Table of Contents
loading

Table of Contents