Online System Health Management; About Online System Health Management - HP Cisco MDS 9216 - Fabric Switch Configuration Manual

Cisco mds 9000 family fabric manager configuration guide, release 3.x (ol-8222-10, april 2008)
Hide thumbs Also See for Cisco MDS 9216 - Fabric Switch:
Table of Contents

Advertisement

Online System Health Management

S e n d d o c u m e n t a t i o n c o m m e n t s t o m d s f e e d b a c k - d o c @ c i s c o . c o m
switch# show cores vdc vdc2
VDC No Module-num
------ ----------
2
2
2
Online System Health Management
The Online Health Management System (system health) is a hardware fault detection and recovery
feature. It ensures the general health of switching, services, and supervisor modules in any switch in the
Cisco MDS 9000 Family.
For information on most Online Health Management System procedures, refer to the Cisco MDS 9000
Note
Family CLI Configuration Guide.
This section includes the following topics:

About Online System Health Management

The Online Health Management System (OHMS) is a hardware fault detection and recovery feature. It
runs on all Cisco MDS switching, services, and supervisor modules and ensures the general health of
any switch in the Cisco MDS 9000 Family. The OHMS monitors system hardware in the following ways:
The OHMS application launches a daemon process in all modules and runs multiple tests on each module
to test individual module components. The tests run at preconfigured intervals, cover all major fault
points, and isolate any failing component in the MDS switch. The OHMS running on the active
supervisor maintains control over all other OHMS components running on all other modules in the
switch.
On detecting a fault, the system health application attempts the following recovery actions:
Cisco MDS 9000 Family CLI Configuration Guide
70-6
Process-name
------------
5
radius
5
radius
5
radius
About Online System Health Management, page 70-6
Performing Internal Loopback Tests, page 70-7
Performing External Loopback Tests, page 70-7
The OHMS component running on the active supervisor maintains control over all other OHMS
components running on the other modules in the switch.
The system health application running in the standby supervisor module only monitors the standby
supervisor module—if that module is available in the HA standby mode. See the
Characteristics" section on page
Performs additional testing to isolate the faulty component
Attempts to reconfigure the component by retrieving its configuration information from persistent
storage.
If unable to recover, sends Call Home notifications, system messages and exception logs; and shuts
down and discontinues testing the failed module or component (such as an interface)
Chapter 70
PID
Core-create-time
---
----------------
6100
Jan 29 01:47
6103
Jan 29 01:55
6104
Jan 29 01:57
17-2.
Monitoring System Processes and Logs
"HA Switchover
OL-16184-01, Cisco MDS SAN-OS Release 3.x

Advertisement

Table of Contents
loading

Table of Contents