Troubleshooting Power Issues - Oracle Exadata X5-2 Service Manual

High capacity storage server
Hide thumbs Also See for Exadata X5-2:
Table of Contents

Advertisement

Troubleshoot Hardware Faults Using the Oracle ILOM Web Interface
for the server to function as a sealed system. If internal cooling areas are compromised, the
server cooling system, which relies on the movement of cool air through the server, cannot
function properly, and the airflow inside the server becomes chaotic and non-directional.
Action: Inspect the server interior to ensure that the air baffle is properly installed. Ensure that
all external-facing slots (storage drive, DVD, PCIe) are occupied with either a component or a
component filler panel. Ensure that the server top cover is in place and sits flat and snug on top
of the server.
Prevention: When servicing the server, ensure that the air baffle is installed correctly and that
the server has no unoccupied external-facing slots. Never operate the server without the top
cover installed.
Hardware Component Failure
Components, such as power supplies and fan modules, are an integral part of the server cooling
system. When one of these components fails, the server internal temperature can rise. This rise
in temperature can cause other components to enter into an over-temperature state. Additionally,
some components, such as processors, might overheat when they are failing, which can also
generate an over-temperature event.
To reduce the risk related to component failure, power supplies and fan modules are installed
in pairs to provide redundancy. Redundancy ensures that if one component in the pair fails,
the other functioning component can continue to maintain the subsystem. For example, power
supplies serve a dual function; they provide both power and airflow. If one power supply fails,
the other functioning power supply can maintain both the power and the cooling subsystems.
Action: Investigate the cause of the over-temperature event, and replace failed components
immediately. For hardware troubleshooting information, see
"Troubleshooting Server Hardware
Faults" on page
22.
Prevention: Component redundancy is provided to allow for component failure in critical
subsystems, such as the cooling subsystem. However, once a component in a redundant
system fails, the redundancy no longer exists, and the risk for server shutdown and component
failures increases. Therefore, it is important to maintain redundant systems and replace failed
components immediately.

Troubleshooting Power Issues

If your server does not power on, the cause of the problem might be:
"AC Power Connection" on page 34
Troubleshooting and Diagnostics
33

Advertisement

Table of Contents
loading

Table of Contents