IBM Power Systems 775 Manual page 299

For aix and linux hpc solution
Table of Contents

Advertisement

Example 5-23 Stopping LoadLeveler
xdsh compute -v -l loadl llrctl stop
xdsh service -v -l loadl llctl stop
Stopping GPFS and unmounting the file system
After LoadLeveler is stopped, GPFS also is stopped. It is important to ensure that all
applications that need to access files within GPFS are stopped before performing this step. A
single command is run on any storage node to complete this step. For this example, we use a
storage node called f01st01 (for frame one storage node one), as shown in Example 5-24.
Example 5-24 Stopping GPFS
xdsh f01st01 -v mmshutdown -a
Important: This command shuts down GPFS everywhere it is running in the cluster. After
the command completes, GPFS is down and no longer available.
Optional: Capturing the HFI lInk status
Before the compute LPARs are shut down, it might be useful to get a state of the HFI link
status. This status is useful if there are any HFI errors before shutting down so that the errors
are understood when the cluster is restarted. This restart is done by listing the connection
state for the BPAs and FSPs and listing the CEC link status.
Verify that CNM successfully contacted all BPAs and FSPs by issuing the command that is
shown in Example 5-25.
Example 5-25 Checking the status of all the BPAs and FSPs
lsnwcomponents
Example 5-26 must match the number of CEC drawers that are in the cluster.
Example 5-26 Matching the number of the CEC drawer
lsnwloc | grep -v EXCLUDED | wc -l
If the number is incorrect, check for any issues that cause a CEC drawer to be excluded by
CNM, as shown in Example 5-27.
Example 5-27 Checking issues with CEC
$ xdsh f01st01 -v mmshutdown -a
$ lsnwloc | grep EXCLUDED
Shutting down compute nodes
Now that LoadLeveler and GPFS are stopped, the next step is to shut down the compute
nodes, as shown in Example 5-28. The compute nodes are shut down first because other
nodes within the cluster do not depend on the compute nodes.
Example 5-28 Shutting down the compute nodes
xdsh compute -v shutdown -h now
Chapter 5. Maintenance and serviceability
285

Advertisement

Table of Contents
loading

Table of Contents