Performing Cluster Recovery Using The Service Assistant - IBM Storwize V7000 Troubleshooting And Maintenance Manual

Hide thumbs Also See for Storwize V7000:

Table of Contents

To remove cluster information from a node canister with an error 550 or 578, follow this procedure using

the service assistant:

1. Point your browser to the service IP address of one of the node canisters.

If you do not know the IP address or if it has not been configured, you must assign an IP address

using the initialization tool.

2. Log on to the service assistant.

3. Select Manage cluster.

4. Click Remove cluster data.

5. Click Confirm Remove Cluster.

6. Go to the Home page to view the error condition and node status for the node canisters in the cluster.

All node canisters for this cluster must be in candidate status. The error conditions must be None.

Note: A node that is powered off might not show up in this list of nodes for the cluster. Diagnose

hardware problems directly on the node using the service assistant IP address and by physically

verifying the LEDs for the hardware components.

7. Resolve any hardware errors until the error condition for all node canisters in the cluster is None.

8. Ensure that all node canisters in the cluster display a status of candidate.

When all node canisters display a status of candidate and all error conditions are None, you can run the

cluster recovery procedure.

Performing cluster recovery using the service assistant

Start recovery when all node canisters that were members of the cluster are online and are in candidate

status. Do not run the recovery procedure on different node canisters in the same cluster. This restriction

includes remote clusters also.

Attention: This service action has serious implications if not performed properly. If at any time during

the procedure, you encounter an error, stop and call IBM Support.

You might see any one of the following categories of messages:

v T3 successful. The volumes are back online. Use the final checks to get your environment operational

again.

v T3 incomplete. One or more of the volumes is offline because there was fast write data in the cache.

Further actions are required to bring the volumes online again. Contact IBM Support for more details

regarding how to bring the volumes online again.

v T3 failed. Call IBM Support. Do not attempt any further action.

The recovery can be run from any node canister in the cluster. The node canisters must not have

participated in any other cluster. To receive optimal results in maintaining the I/O group ordering, run

the recovery from a node canister that was in I/O group 0.

1. Point your browser to the service IP address of one of the node canisters.

If you do not know the IP address or if it has not been configured, you must assign an IP address

using the initialization tool.

2. Log on to the service assistant.

3. Select Recover cluster.

4. Follow the online instructions to complete the recovery procedure.

Verify the date and time of the last quorum time. The time stamp must be less than 10 minutes before

the cluster failure. The time stamp format is YYYYMMDD hh:mm, where YYYY is the year, MM is the

month, DD is the day, hh is the hour, and mm is the minute.

Chapter 6. Recovery procedures

Table of Contents

Performing Cluster Recovery Using The Service Assistant - IBM Storwize V7000 Troubleshooting And Maintenance Manual

Performing cluster recovery using the service assistant

Related Manuals for IBM Storwize V7000

Related Content for IBM Storwize V7000

Table of Contents