Displaying Diagnostic Test Results; System Health Checking-Modular Switches Only; Understanding The System Health Checker-Blackdiamond 10K Switch Only - Extreme Networks ExtremeWare XOS Guide Manual

Concepts guide
Hide thumbs Also See for ExtremeWare XOS Guide:
Table of Contents

Advertisement

Displaying Diagnostic Test Results

To display the status of the last diagnostic test run on the switch, use the following command:
show diagnostics {slot [<slot> | A | B]}
NOTE
The slot, A, and B parameters are available only on modular switches.
System Health Checking—Modular Switches Only
System health check is a useful tool to monitor the overall health of your system. The software performs
a proactive, preventive search for problems by polling and reporting the health of system components,
including I/O and management module processes, power supplies, power supply controllers, and fans.
By isolating faults to a specific module, backplane connection, control plane, or component, the system
health checker notifies you of a possible hardware fault.
This section describes the system health check functionality of the following switches:
BlackDiamond 10K
BlackDiamond 8800 family
Understanding the System Health Checker—BlackDiamond 10K
Switch Only
The BlackDiamond 10K switch supports extensive error-checking and monitoring capabilities. Packet
and system memories are protected by an error correction code (ECC). ECC is capable of correcting all
single-bit errors and detecting all other memory errors. The data path is protected by checksums and
parity checks. The system automatically corrects correctable memory errors and kills packets that
encounter checksum and parity errors during processing. Errored packets are not propagated through
the system.
The primary responsibility of the system health checker is to monitor and poll the ASIC error registers.
The system health checker processes, tracks, and reads the memory, parity, and checksum error counts.
The ASICs maintain counts of correctable and uncorrectable memory errors, as well as packets that
encountered checksum and parity errors. In a running system, some of these error counts may show
non-zero values. Occasional increments of these counters does not mean faulty hardware is detected or
that hardware requires replacement. If you see persistent increments of these counters, please contact
Extreme Networks Technical Support.
In addition, you can enable the system health checker to check the backplane, CPU, and I/O modules
by periodically sending diagnostic packets and checking the validity of the looped back diagnostic
packets.
In summary, two modes of health checking are available: polling and backplane diagnostic packets.
These methods are briefly described in the following:
Polling is always enabled on the system and occurs every 60 seconds by default. The system health
checker polls and tracks the ASIC counters that collect correctable and uncorrectable packet memory
errors, checksum errors, and parity errors on a per ASIC basis. By reading and processing the
registers, the system health check detects and associates faults to specific system ASICs.
ExtremeWare XOS 11.3 Concepts Guide
System Health Checking—Modular Switches Only
189

Advertisement

Table of Contents
loading

This manual is also suitable for:

Extremeware xos 11.3

Table of Contents