Performing Cluster Recovery Using The Service Assistant - IBM Storwize V7000 Troubleshooting And Maintenance Manual

Table of Contents

Advertisement

To remove cluster information from a node canister with an error 550 or 578, follow this procedure using
the service assistant:
1. Point your browser to the service IP address of one of the node canisters.
If you do not know the IP address or if it has not been configured, you must assign an IP address
using the initialization tool.
2. Log on to the service assistant.
3. Select Manage cluster.
4. Click Remove cluster data.
5. Click Confirm Remove Cluster.
6. Go to the Home page to view the error condition and node status for the node canisters in the cluster.
All node canisters for this cluster must be in candidate status. The error conditions must be None.
Note: A node that is powered off might not show up in this list of nodes for the cluster. Diagnose
hardware problems directly on the node using the service assistant IP address and by physically
verifying the LEDs for the hardware components.
7. Resolve any hardware errors until the error condition for all node canisters in the cluster is None.
8. Ensure that all node canisters in the cluster display a status of candidate.
When all node canisters display a status of candidate and all error conditions are None, you can run the
cluster recovery procedure.

Performing cluster recovery using the service assistant

Start recovery when all node canisters that were members of the cluster are online and are in candidate
status. Do not run the recovery procedure on different node canisters in the same cluster. This restriction
includes remote clusters also.
Attention: This service action has serious implications if not performed properly. If at any time during
the procedure, you encounter an error, stop and call IBM Support.
You might see any one of the following categories of messages:
4
v T3 successful. The volumes are back online. Use the final checks to get your environment operational
again.
4
v T3 incomplete. One or more of the volumes is offline because there was fast write data in the cache.
Further actions are required to bring the volumes online again. Contact IBM Support for more details
regarding how to bring the volumes online again.
v T3 failed. Call IBM Support. Do not attempt any further action.
The recovery can be run from any node canister in the cluster. The node canisters must not have
participated in any other cluster. To receive optimal results in maintaining the I/O group ordering, run
the recovery from a node canister that was in I/O group 0.
1. Point your browser to the service IP address of one of the node canisters.
If you do not know the IP address or if it has not been configured, you must assign an IP address
using the initialization tool.
2. Log on to the service assistant.
3. Select Recover cluster.
4. Follow the online instructions to complete the recovery procedure.
Verify the date and time of the last quorum time. The time stamp must be less than 10 minutes before
the cluster failure. The time stamp format is YYYYMMDD hh:mm, where YYYY is the year, MM is the
month, DD is the day, hh is the hour, and mm is the minute.
Chapter 6. Recovery procedures
55

Advertisement

Table of Contents
loading

Table of Contents