Service Processor Reboot/Restart Policy Controls; Processor Boot-Time Deconfiguration (Cpu Repeat Gard); Memory Boot-Time Deconfiguration (Memory Repeat Gard) - IBM 6C1 Service Manual

Pseries 610
Table of Contents

Advertisement

Service Processor Reboot/Restart Policy Controls

The operating system's automatic restart policy (see operating system documentation)
defines the operating system's response to a system crash. The service processor can
be instructed to refer to that policy by the Use OS-Defined Restart Policy setup menu.

Processor Boot-Time Deconfiguration (CPU Repeat Gard)

Processor boot time deconfiguration allows for the removal of processors from the
system configuration at boot time. The objective is to minimize system failure or data
integrity exposure due to a faulty processor.
This function uses processor hardware built-in self-test (BIST) and firmware power-on
self-test (POST) to discover and isolate processor hardware failures during boot time. It
also uses the hardware error detection logic in the processor to capture run-time
recoverable and irrecoverable error indications. The firmware uses the error signatures
in the hardware to analyze and isolate the error to a specific processor.
The processors that are deconfigured remain offline for subsequent reboots until the
faulty processor hardware is replaced.
This function allows users to manually deconfigure or re-enable a previously
deconfigured processor through the service processor menu. The user can also enable
or disable this function through the service processor.

Memory Boot-Time Deconfiguration (Memory Repeat Gard)

Memory boot-time deconfiguration allows for the removal of a memory segment or
DIMM from the system configuration at boot time. The objective is to minimize system
failures or data integrity exposure due to faulty memory hardware. The hardware
resource(s) to be removed are the ones that experienced the following failures:
v A boot-time test failure
v Run-time recoverable errors over the error threshold prior to the current boot phase
v Run-time irrecoverable errors prior to the current boot phase
This function uses firmware power-on self-test (POST) to discover and isolate memory
hardware failures during boot time. It also uses the hardware error detection logic in the
memory controller to capture run-time recoverable and irrecoverable errors. The
firmware uses the error signatures in the hardware to analyze and isolate the error to
the specific memory segment or DIMM.
The memory segment or DIMM that is deconfigured remains offline during subsequent
reboots until the faulty memory hardware is replaced.
The function allows users to manually deconfigure or re-enable a previously
deconfigured memory segment/DIMM through the service processor menu. The user
can also enable or disable this function through the service processor menu.
208
Service Guide

Advertisement

Table of Contents
loading

This manual is also suitable for:

6e1

Table of Contents