Fault Detection And Diagnostics Overview - Oracle X6 series Administration Manual

Table of Contents

Advertisement

2.
Use the
hwmgmtcli list subsystem
Where subsystem is one of the following: all, server, cooling, processor, memory, power,
storage, network, firmware, device, bios, or iomodule
Related Information
Displaying Hardware Information (hwmgmtcli), Oracle Server CLI Tools User's Guide at
http://www.oracle.com/goto/ohmp/docs

Fault Detection and Diagnostics Overview

The server supports multiple fault detection and diagnostics tools. Fault detection tools, such
as the Oracle ILOM Fault Manager, automatically poll the system to detect hardware faults and
adverse environmental conditions. Diagnostics tools, such as Oracle VTS must be run manually
and can assist you in troubleshooting server issues. The following table provides an overview of
the fault detection and diagnostics tools supported by the server.
Tool
Description
Oracle ILOM Fault
The Oracle ILOM Fault Manager is part of the Oracle ILOM
Manager
firmware embedded on the server service processor (SP). The
fault manager automatically detects system hardware faults and
environmental conditions on the server. If a problem occurs on the
server, Oracle ILOM identifies the problem in the Open Problems
table and logs information about the fault in the Event log.
Oracle Linux
Oracle Linux FMA software can be optionally installed on the
Fault Management
server through Oracle Hardware Management Pack. Oracle Linux
Architecture (FMA)
FMA can be used to manage faults detected at the operating system
(OS) level in much the same way that you manage faults in Oracle
ILOM. Fault diagnosis messages from Linux FMA are maintained
on a fault management database, which is shared with Oracle
ILOM.
Oracle Solaris
Oracle Solaris FMA is included with the Oracle Solaris operating
Fault Management
system (OS). The fault manager receives data related to hardware
Architecture (FMA)
and software errors, automatically diagnoses the underlying
problem, and responds by trying to take faulty components offline.
Auto Service Request
ASR is an optional support service for Oracle hardware. ASR
(ASR)
collects hardware telemetry data from telemetry sources (such as
Oracle ILOM) on ASR-enabled systems in your data center. ASR
filters this telemetry data and forwards what it determines to be
potential faults directly to Oracle, and then automatically initiates a
service request. You can configure features of the ASR service from
Oracle ILOM.
command:
hwmgmtcli list

Fault Detection and Diagnostics Overview

Documentation
Refer to Protecting Against Hardware
Faults: Oracle ILOM Fault Manager,
Oracle ILOM User's Guide for System
Monitoring and Diagnostics, Firmware
Release 3.2.x at:
http://www.oracle.com/goto/ilom/
docs
Refer to the Oracle Linux Fault
Management Architecture User's Guide
at:
http://docs.oracle.com/cd/
E52095_01
Refer to Oracle Solaris Administration:
Common Tasks at:
http://docs.oracle.com/cd/
E23824_01/index.html
Go to:
http://www.oracle.com/us/support/
auto-service-request/index.html
Monitoring Server Inventory and Health
81

Advertisement

Table of Contents
loading

Table of Contents