Managing Server Hardware Faults Through The Oracle Ilom Fault Management Shell; Troubleshooting Using A Cmod Fault Remind Test Circuit; Troubleshooting System Cooling Issues - Oracle X7-8 Service Manual

Hide thumbs Also See for X7-8:
Table of Contents

Advertisement

Clear Hardware Fault Messages (Oracle ILOM)
Managing Server Hardware Faults Through the
Oracle ILOM Fault Management Shell
The Oracle ILOM Fault Management Shell enables Oracle Services personnel to view and
manage fault activity on a managed servers and other types of devices.
For more information about how to use the Oracle ILOM Fault Management Shell, see the
Oracle ILOM User's Guide for System Monitoring and Diagnostics Firmware Release 4.0.x in
the Oracle Integrated Lights Out Manager (ILOM) 4.0 Documentation Library at
http://www.
oracle.com/goto/ilom/docs.
The purpose of the Oracle ILOM Fault Management Shell is to help Oracle Service
Caution -
personnel diagnose system problems. Customers should not launch this shell or run fault
management commands in the shell unless requested to do so by Oracle Service personnel.
Troubleshooting Using a CMOD Fault Remind Test
Circuit
The CMODs have an internal test circuit that you can use to locate failed DIMMs and
verify a failed CPU after removing the CMOD from the server. The DIMM and CPU Fault
Remind circuits hold an electrical charge for 10 minutes after power is removed from the
server, allowing enough time to remove the CMOD and use the circuit. Each CMOD has a
motherboard-mounted Fault Remind button. The button is part of the CMOD Fault Remind
circuit. The circuit is charged and allows you to identify a failed DIMM or CPU after the
CMOD has been removed for the server. You must remove the CMOD from the front panel to
access the button.
For more information, see
"Identify and Remove a Faulty DIMM" on page 189
and
"Identify
and Remove a Faulty Processor" on page
171.

Troubleshooting System Cooling Issues

Maintaining the proper internal operating temperature of the server is crucial to the health of the
server. To prevent server shutdown and damage to components, address over temperature and
hardware-related issues as soon as they occur. If your server has a temperature-related fault, use
the information in the following table to troubleshoot the issue.
Troubleshooting and Diagnostics
55

Advertisement

Table of Contents
loading

Table of Contents