IBM BladeCenter S SAS RAID Controller Module Installation And User Manual page 131

Sas raid controller module
Table of Contents

Advertisement

1105
Controller controller_ID critical failure detected - Unrecoverable HW error
Explanation: An unrecoverable hardware error has been detected in the controller. The controller continually
monitors the health of the RAID Storage Processor and microprocessor subsystem. When this alert is generated, the
controller can no longer continue normal operation.
System action: By default, the SAS RAID Module system information indicator is turned on and the operator is
notified of this alert by e-mail. If the failure occurs in a redundant controller, the remaining controller enters survivor
mode and assumes control of all disk drives and associated LUNs while the failed controller enters service mode. If
this failure occurs on a controller that is already in survivor mode, the controller attempts to flush all dirty cache and
orderly dismount all volumes prior to entering service mode.
Operator response: Reboot the controller.. All alerts are cleared. The controller reevaluates the system health status
and regenerates all alerts that apply. If alert 1105 is generated again after the reboot, replace the controller. See
"Replacing a single controller using the CLI" on page 154 or "Replacing a single controller using the SCM" on page
155.
1106
Controller controller_ID critical failure detected - Recurring system error
Explanation: The controller attempts to automatically recover from unexpected system errors, including both
software and hardware errors. If, after repeated recovery attempts (during which the controller might perform
multiple restarts), the failure persists or reoccurs within a predetermined time window, the controller aborts further
recovery attempts and generates this alert. When this alert is generated, the controller can no longer continue normal
operation.
System action: By default, the SAS RAID Module system information indicator is turned on and the operator is
notified of this alert by e-mail. If the failure occurs in a redundant controller, the remaining controller enters survivor
mode and assumes control of all disk drives and associated LUNs while the failed controller enters service mode. If
this failure occurs on a controller that is already in survivor mode, the controller attempts to flush all dirty cache and
orderly dismount all volumes prior to entering service mode.
Operator response: Replace the controller. See "Replacing a single controller using the CLI" on page 154 or
"Replacing a single controller using the SCM" on page 155.
1107
Controller critical failure detected - Serial port failed
Explanation: The system uses a dedicated serial port to exchange status and control between redundant controllers.
If this port fails, each controller might attempt to become the survivor controller and reset the other controller. Thus,
the other controller might actually reset the controller that reports alert 1107 before you can take additional corrective
action. When this alert is generated, the controller can no longer continue normal operation.
System action: By default, the SAS RAID Module system information indicator is turned on and the operator is
notified of this alert by e-mail. If the failure occurs in a redundant controller, the remaining controller enters survivor
mode and assumes control of all disk drives and associated LUNs while the failed controller enters service mode. If
this failure occurs on a controller that is already in survivor mode, the controller attempts to flush all dirty cache and
orderly dismount all volumes prior to entering service mode.
Operator response: Replace the controller. See "Replacing a single controller using the CLI" on page 154 or
"Replacing a single controller using the SCM" on page 155.
Chapter 11. Troubleshooting and support
1105 • 1107
121

Advertisement

Table of Contents
loading

Table of Contents