Troubleshooting A Failed Supervisor - Cisco MDS 9000 Series Troubleshooting Manual

Cisco family switch troubleshooting guide
Hide thumbs Also See for MDS 9000 Series:
Table of Contents

Advertisement

Chapter 2
Troubleshooting Switch System Issues
S e n d c o m m e n t s t o m d s f e e d b a c k - d o c @ c i s c o . c o m .

Troubleshooting a Failed Supervisor

This section provides a workaround for a failed supervisor under certain conditions. An example
situation is used to describe the problem and the workaround.
In this case, the supervisor failed when the standby was reloaded, or when the supervisor was replaced
with a new one. It was discovered that the failed supervisor either had its version of code changed, or
the running configuration on the active supervisor wasn't saved with the appropriate boot parameters. In
either case, the problem was mismatched code on the active and standby supervisors. One clue that
indicated the mismatched code was a "heartbeat" error on the active supervisor. Because of this error,
the current flash images were unable to be copied from the active supervisor to the standby.
The workaround was to copy the images to compact flash, switch consoles, and load code from compact
flash onto the second supervisor. The second supervisor was at a "loader" prompt which is indicative of
missing boot statements. When a dir slot0 command was executed, none of the images appeared. This
may have been due to mismatched images on supervisors or to not having current images in flash of the
supervisor. Performing a copy slot0 bootflash command copied the images anyway. Once the images
were loaded on the second supervisor and the boot statements were confirmed and saved on the active
supervisor, the supervisor loaded and came up in "standby-ha" mode.
As a best practice, we recommended the following in order to understand how the switch can end up in
this situation:
1.
2.
3.
4.
5.
6.
7.
OL-5183-02, Cisco MDS SAN-OS Release 1.3
Make sure both supervisors have their flash loaded with the same versions of kickstart and system
images.
Make sure that the proper boot statements for Sup1 and Sup2 are set to run the same code.
Once the boot statements are configured on the active supervisor, make sure and perform a copy run
start.
Make a copy of the running configuration to compact flash just for a safe backup.
Always perform a copy run start when modifying the running configuration and the system is
operating the way they desire.
Never "init" the switch unless you understand that the switch will lose everything.
Keep backup copies of running kickstart and system images on compact flash.
Troubleshooting a Failed Supervisor
Cisco MDS 9000 Family Troubleshooting Guide
2-7

Advertisement

Table of Contents
loading

Table of Contents