IBM Power 780 Technical Overview And Introduction page 196

Hide thumbs Also See for Power 780:
Table of Contents

Advertisement

Service Focal Point (SFP)
A critical requirement in a logically partitioned environment is to ensure that errors are not lost
before being reported for service, and that an error should only be reported once, regardless
of how many logical partitions experience the potential effect of the error. The Manage
Serviceable Events task on the management console is responsible for aggregating duplicate
error reports, and ensures that all errors are recorded for review and management.
When a local or globally reported service request is made to the operating system, the
operating system diagnostic subsystem uses the Resource Monitoring and Control
Subsystem to relay error information to the Hardware Management Console. For global
events (platform unrecoverable errors, for example) the service processor will also forward
error notification of these events to the Hardware Management Console, providing a
redundant error-reporting path in case of errors in the Resource Monitoring and Control
Subsystem network.
The first occurrence of each failure type is recorded in the Manage Serviceable Events task
on the management console. This task then filters and maintains a history of duplicate
reports from other logical partitions on the service processor. It then looks at all active service
event requests, analyzes the failure to ascertain the root cause and, if enabled, initiates a call
home for service. This methodology ensures that all platform errors will be reported through
at least one functional path, ultimately resulting in a single notification for a single problem.
Extended error data
Extended error data (EED) is additional data that is collected either automatically at the time
of a failure or manually at a later time. The data that is collected is dependent on the
invocation method but includes information like firmware levels, operating system levels,
additional fault isolation register values, recoverable error threshold register values, system
status, and any other pertinent data.
The data is formatted and prepared for transmission back to IBM either to assist the service
support organization with preparing a service action plan for the service representative or for
additional analysis.
System-dump handling
In certain circumstances, an error might require a dump to be automatically or manually
created. In this event, it is off-loaded to the management console. Specific management
console information is included as part of the information that can optionally be sent to IBM
support for analysis. If additional information relating to the dump is required, or if it becomes
necessary to view the dump remotely, the management console dump record notifies the IBM
support center regarding on which management console the dump is located.
182
IBM Power 770 and 780 (9117-MMD, 9179-MHD) Technical Overview and Introduction

Hide quick links:

Advertisement

Table of Contents
loading

This manual is also suitable for:

Power 770

Table of Contents