Page 1
Power Systems Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y) GI11-9831-00...
Page 3
Power Systems Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y) GI11-9831-00...
Page 4
Before using this information and the product it supports, read the information in “Notices,” on page 271, “Safety notices” on page v, the IBM Systems Safety Notices manual, G229-9054, and the IBM Environmental Notices and User Guide, Z125–5823. This edition applies to IBM Power Systems servers that contain the POWER7 processor and to all associated models.
Page 6
Installing the disk drive tray . . 253 Terms and conditions. . 280 Removing the tier 2 management card . . 255 Installing the tier 2 management card . . 256 Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 8
Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
Page 9
Observe the following precautions when working on or around your IT rack system: v Heavy equipment–personal injury or equipment damage might result if mishandled. v Always lower the leveling pads on the rack cabinet. v Always install stabilizer brackets on the rack cabinet. v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest devices in the bottom of the rack cabinet.
Page 10
Pack the rack cabinet in the original packaging material, or equivalent. Also lower the leveling pads to raise the casters off of the pallet and bolt the rack cabinet to the pallet. (R002) (L001) (L002) viii Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 11
(L003) All lasers are certified in the U.S. to conform to the requirements of DHHS 21 CFR Subchapter J for class 1 laser products. Outside the U.S., they are certified to be in compliance with IEC 60825 as a class 1 laser product.
Page 12
(C030) Power and cabling information for NEBS (Network Equipment-Building System) GR-1089-CORE The following comments apply to the IBM servers that have been designated as conforming to NEBS (Network Equipment-Building System) GR-1089-CORE: The equipment is suitable for installation in the following:...
Tier 1 CRU at your request, you are charged for the installation. v Tier 2 customer replaceable unit: You can install a Tier 2 CRU yourself or request IBM to install it, at no additional charge, under the type of warranty service that is designated for your blade server.
Features and specifications of the IBM BladeCenter PS700 blade server are summarized in this overview. The PS700 Type 8406 is a single-wide (non-expandable) blade server. The PS700 blade server is used in an IBM BladeCenter H (8852 and 7989), BladeCenter HT (8740 and 8750), or BladeCenter S (8886 and 7779) chassis unit.
Page 15
Four core, single socket (4-way) management module v Front panel LEDs processors @ 3.0 GHz v Automatic server restart (ASR) v IBM Director v 64 GB maximum in 8 very low v SOL through FSP v Hardware Management Console profile (VLP) DIMM slots; Supports...
PS700 from a minimum of 4 GB to a maximum of 64 GB. See Chapter 3, “Parts listing, Type 8406,” on page 229 for memory modules that you can order from IBM. Memory module rules: v Install DIMM fillers in unused DIMM slots for proper cooling.
Blade server control panel buttons and LEDs Blade server control panel buttons and LEDs provide operational controls and status indicators. Note: Figure 2 shows the control-panel door in the closed (normal) position. To access the power-control button, you must open the control-panel door. Figure 2.
The information LED can be turned off through the Web interface of the management module or through IBM Director Console. 3 Blade-error LED: When this amber LED is lit, it indicates that a system error has occurred in the blade server.
You can start the blade server in any of the following ways. v Start the blade server by pressing the power-control button on the front of the blade server. The power-control button is behind the control panel door, as described in “Blade server control panel buttons and LEDs”...
High-Speed (CFFh) expansion card connector (P1-C12) DIMM 5-8 connectors (See Figure 4 on page 9 for individual connectors.) 3V lithium battery connector (P1-E1) Figure 4 on page 9 shows individual DIMM connectors. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Figure 4. DIMM connectors. Base unit connectors System-board LEDs Use the illustration of the LEDs on the system board to identify a light emitting diode (LED). Remove the blade server from the BladeCenter unit, open the cover, press the blue button to see any error LEDs that were turned on during error processing, and use Figure 5 to identify the failing component.
Page 22
Table 3. PS700 LEDs (continued) Callout Base unit LEDs CIOv (1Xe) expansion card connector LED High-Speed (CFFh) expansion card connector LED HDD2 LED DIMM 5-8 LEDs Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Error checking hardware ranges from parity error detection coupled with processor instruction retry and bus retry, to ECC correction on caches and system buses. IBM hardware error checkers have these distinct attributes: v Continuous monitoring of system operations to detect potential calculation errors...
Page 24
In some circumstances, an error might require a dump to show more data. The Integrated Virtualization Manager (IVM) or Hardware Management Console (HMC) sets up a dump area. Specific IVM or HMC information is included as part of the information that can optionally be sent to IBM support for analysis.
Note: If you power off the blade through the management module while the service processor is performing a dump, platform dump data is lost. You might be asked to retrieve a dump to send it to IBM Support for analysis. The location of the dump data varies by operating system.
Un-P1-T4 Ethernet HEA0_B Un-P1-T5 Machine Location Code Utttt.mmm.sssssss Um codes are for firmware. The format is the same as for a Un location code. Um = Utttt.mmm.sssssss Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Table 4. Location codes (continued) Components Physical Location Code CRU LED Firmware version Um-Y1 Reference codes Reference codes are diagnostic aids that help you determine the source of a hardware or operating system problem. To use reference codes effectively, use them in conjunction with other service and support procedures.
D1513901 Created at: 2007-11-13 19:30:20 SRC Version: 0x02 Hex Words 2-5: 020110F0 52298910 C1472000 200000FF Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
B1xxxxxx - Service processor error, such as a boot problem v B6xxxxxx - Licensed Internal Code or hardware event error v B9xxxxxx - Software installation error or IBM i IPL error. See "Recovering from IPL or system failures" in the IBM i Information Center at http://publib.boulder.ibm.com/infocenter/powersys/v3r1m5/ index.jsp?topic=/ipha5_p5/iplprocedure.htm.
Page 30
Blade power latch fault 1. Go to “Checkout procedure” on page 184. 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 31
Table 7. 1xxxyyyy SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 32
VRM voltage adjustment failure 1. Go to “Checkout procedure” on page 184. 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Table 7. 1xxxyyyy SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 34
Refer to the hosting partition for problem analysis. 632CC110 SCSD command timeout Refer to the hosting partition for problem analysis. occurred. 632CC210 Informational system log entry No corrective action is required. only. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 35
Table 8. 6xxxyyyy SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Replace the management card, as described in or using system VPD. “Removing the tier 2 management card” on page 255 and “Installing the tier 2 management card” on page 256. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Table 11. A700yyyy Licensed internal code SRCs (continued) Reference Code Description Action A7004721 The World Wide Port Name (WWPN) Prefix is https://www-912.ibm.com/supporthome.nsf/ not valid. document/51455410 A7004730 Informational system log entry only. No corrective action is required. A7004740 Informational system log entry only.
Page 38
For a Linux operating system, boot the blade server using the stand-alone diagnostics CD or a NIM server; then, run diagnostics against the failing adapter. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 39
Table 12. AA00E1A8 to AA260005 Partition firmware attention codes (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
720A Power-off reset occurred. FipsDump should be analyzed: Possible software problem Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
B200xxxx Logical partition SRCs A B200xxxx SRC is a logical partition reference code that is related to logical partitioning. Table 14 describes system reference codes that might be displayed if system firmware detects a problem. Suggested actions to correct the problem are also listed. Note: For problems persisting after completing the suggested actions, see “Checkout procedure”...
Page 42
Set the partition to Normal. startup of a partition. The partition could not start at the Timed Power On setting because the partition was not set to Normal. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 43
Table 14. B200xxxx Logical partition SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 44
Verify that the adapter type is supported. startup of a partition. The adapter type might not be supported. B2003088 Informational system log entry only. No corrective action is required. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 45
Table 14. B200xxxx Logical partition SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 46
Look for other errors and resolve them. startup of a partition. There was an error writing the partition main storage dump to the partition load source. The main store dump startup will continue. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 47
B2006006 A problem occurred during the Contact IBM support. startup of a partition. The partition could not reserve the memory required for IPL. B2006012 During the startup of a partition, the Go to “Isolating firmware problems”...
Page 48
2. Check for server firmware updates; then, install the updates if available. B2008111 A problem occurred during the Check for server firmware updates; then, install the startup of a partition. updates if available. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 49
Table 14. B200xxxx Logical partition SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 50
B200E0AA A problem occurred during the Go to “Isolating firmware problems” on page 218. power off of a partition. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Table 14. B200xxxx Logical partition SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 52
260. 0443 Service processor failure. Replace the system-board and chassis assembly, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 53
Table 15. B700xxxx Licensed internal code SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 54
218. 4402 A system firmware error occurred while Go to “Isolating firmware problems” on page attempting to allocate the memory 218. necessary to create a platform dump. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 55
Table 15. B700xxxx Licensed internal code SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 56
6906 System bus error Replace the system-board and chassis assembly, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 57
Table 15. B700xxxx Licensed internal code SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 58
Remote I/O (RIO), high-speed link Replace the system-board and chassis (HSL), or 12X connection failure. assembly, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 59
Table 15. B700xxxx Licensed internal code SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
The device data structure is corrupted 1. Go to “Checkout procedure” on page 184. 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 61
Table 16. BA000010 to BA400002 Partition firmware SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 62
2. If the problem persists: a. Go to “Checkout procedure” on page 184. b. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 63
2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. BA00E830 Failure when initializing ibm,event-scan 1. Go to “Checkout procedure” on page 184. 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly”...
Page 64
2. If the problem persists: a. Go to “Checkout procedure” on page 184. b. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 65
Table 16. BA000010 to BA400002 Partition firmware SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 66
2. If the problem persists: a. Go to “Checkout procedure” on page 184. b. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 67
Table 16. BA000010 to BA400002 Partition firmware SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 68
2. If the problem persists: a. Go to “Checkout procedure” on page 184. b. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 69
Table 16. BA000010 to BA400002 Partition firmware SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 70
DHCP configuration files. BA01D053 DHCP::discover received a reply, but Verify that the DHCP server is properly without a message type configured. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 71
Table 16. BA000010 to BA400002 Partition firmware SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 72
1. Go to “Checkout procedure” on page 184. watchdog timer failed. 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 73
Table 16. BA000010 to BA400002 Partition firmware SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 74
2. If the problem remains: a. Go to “Checkout procedure” on page 184. b. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 75
Table 16. BA000010 to BA400002 Partition firmware SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 76
3. If the problem persists: a. Go to “Checkout procedure” on page 184. b. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 77
Table 16. BA000010 to BA400002 Partition firmware SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 78
4. If the problem persists: a. Go to “Checkout procedure” on page 184. b. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 79
Table 16. BA000010 to BA400002 Partition firmware SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 80
If the problem persists: 1) Go to “Checkout procedure” on page 184. 2) Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 81
Table 16. BA000010 to BA400002 Partition firmware SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 82
2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. BA153002 Gigabit Ethernet adapter failure Verify that the MAC address programmed in the FLASH/EEPROM is correct. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 83
Table 16. BA000010 to BA400002 Partition firmware SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 84
1. Go to “Checkout procedure” on page 184. contains a null character 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 85
Table 16. BA000010 to BA400002 Partition firmware SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 86
1. Go to “Checkout procedure” on page 184. time-of-day reported an error 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 87
Table 16. BA000010 to BA400002 Partition firmware SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 88
2. If the problem persists: a. Go to “Checkout procedure” on page 184. b. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 89
Table 16. BA000010 to BA400002 Partition firmware SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 90
Failed to flash a firmware update lid Download a new firmware update image and retry the update. BA278006 Unable to unlock the firmware update Restart the blade server. lid manager Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 91
Failed to reboot the system after a Restart the blade server. firmware flash update BA278009 The operating system's server firmware Go to the IBM download site at update management tools are www14.software.ibm.com/webapp/set2/sas/ incompatible with this system. f/lopdiags/home.html to download the latest version of the service aids package for Linux.
Page 92
2. If the problem persists: a. Go to “Checkout procedure” on page 184. b. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 93
Table 16. BA000010 to BA400002 Partition firmware SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
Page 94
2. If the problem persists: a. Go to “Checkout procedure” on page 184. b. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 95
Table 16. BA000010 to BA400002 Partition firmware SRCs (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, then you can stop performing the remaining actions. v See Chapter 3, “Parts listing, Type 8406,”...
D1513901 Created at: 2007-11-13 19:30:20 SRC Version: 0x02 Hex Words 2-5: 020110F0 52298910 C1472000 200000FF Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
C1001F00 to C1645300 Service processor checkpoints The C1xx progress codes, or checkpoints, offer information about the initialization of both the service processor and the server. Service processor checkpoints are typical reference codes that occur during the initial program load (IPL) of the server. Table 18 lists the progress codes that might be displayed during the power-on self-test (POST), along with suggested actions to take if the system hangs on the progress code.
Page 98
Asset protection IPL step in progress 1. Go to “Checkout procedure” on page 184. 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 99
Table 18. C1001F00 to C1645300 checkpoints (continued) v If the system hangs on a progress code, follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, you can stop performing the remaining actions.
Page 100
Processor enable machine check test in 1. Go to “Checkout procedure” on page 184. progress 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 101
Table 18. C1001F00 to C1645300 checkpoints (continued) v If the system hangs on a progress code, follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, you can stop performing the remaining actions.
Page 102
ASIC interface alignment step in 1. Go to “Checkout procedure” on page 184. progress 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 103
Table 18. C1001F00 to C1645300 checkpoints (continued) v If the system hangs on a progress code, follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, you can stop performing the remaining actions.
Page 104
2. Replace the system-board, as described in byte (xx) will increment up from 00 to “Replacing the FRU system-board and 1F every second while it waits. chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Table 18. C1001F00 to C1645300 checkpoints (continued) v If the system hangs on a progress code, follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, you can stop performing the remaining actions.
Page 106
1. Go to “Recovering the system firmware” on page 220. 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 107
Table 19. C2001000 to C20082FF checkpoints (continued) v If the system hangs on a progress code, follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, you can stop performing the remaining actions.
Page 108
1. Go to “Recovering the system firmware” on operational page 220. 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 109
Table 19. C2001000 to C20082FF checkpoints (continued) v If the system hangs on a progress code, follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, you can stop performing the remaining actions.
Page 110
1. Go to “Recovering the system firmware” on page 220. 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 111
Table 19. C2001000 to C20082FF checkpoints (continued) v If the system hangs on a progress code, follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, you can stop performing the remaining actions.
Page 112
1. Go to “Recovering the system firmware” on page 220. 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 113
Table 19. C2001000 to C20082FF checkpoints (continued) v If the system hangs on a progress code, follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, you can stop performing the remaining actions.
In some cases, a server might hang (or stall) at one of these progress codes without displaying an 8-character system reference code (SRC). Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 115
Table 21 lists the progress codes that might be displayed during the power-on self-test (POST), along with suggested actions to take if the system hangs on the progress code. Only when you experience a hang condition should you take any of the actions described for a progress code. In the following progress codes, x can be any number or letter.
Page 116
1. Go to “Checkout procedure” on page 184. GUI package 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 117
Table 21. CA000000 to CA2799FF checkpoints (continued) v If the system hangs on a progress code, follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, you can stop performing the remaining actions.
Page 118
Create HCA node 1. Go to “Checkout procedure” on page 184. 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 119
Table 21. CA000000 to CA2799FF checkpoints (continued) v If the system hangs on a progress code, follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, you can stop performing the remaining actions.
Page 120
Probing for adapter FCODE; evaluate if 1. Go to “Checkout procedure” on page 184. present 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 121
Table 21. CA000000 to CA2799FF checkpoints (continued) v If the system hangs on a progress code, follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, you can stop performing the remaining actions.
Page 122
2. If the problem persists: a. Go to “Checkout procedure” on page 184. b. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 123
260. CA00E198 The system is rebooting to enact changes Go to “Boot problem resolution” on page 190. that were specified in ibm,client-architecture-support CA00E199 The system is rebooting to enact changes 1. Verify that: that were specified in the boot image v The bootp server is correctly configured;...
Page 124
1. Go to “Checkout procedure” on page 184. mode boot list 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 125
Table 21. CA000000 to CA2799FF checkpoints (continued) v If the system hangs on a progress code, follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, you can stop performing the remaining actions.
Page 126
Privileged-access password prompt 1. Go to “Checkout procedure” on page 184. 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 127
Table 21. CA000000 to CA2799FF checkpoints (continued) v If the system hangs on a progress code, follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, you can stop performing the remaining actions.
Page 128
Initializing lpevent 1. Go to “Checkout procedure” on page 184. 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 129
Table 21. CA000000 to CA2799FF checkpoints (continued) v If the system hangs on a progress code, follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, you can stop performing the remaining actions.
Page 130
2. If the problem persists: a. Go to “Checkout procedure” on page 184. b. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 131
Table 21. CA000000 to CA2799FF checkpoints (continued) v If the system hangs on a progress code, follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, you can stop performing the remaining actions.
(POST), along with suggested actions to take if the system hangs on the progress code. Only when you experience a hang condition should you take any of the actions described for a progress code. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 133
Table 22. D1001xxx to D1xx3FFF dump codes v If the system hangs on a progress code, follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, you can stop performing the remaining actions.
Page 134
1. Go to “Checkout procedure” on page 184. dump if enough space) 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 135
Table 22. D1001xxx to D1xx3FFF dump codes (continued) v If the system hangs on a progress code, follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, you can stop performing the remaining actions.
Page 136
Store information about existing core 1. Go to “Checkout procedure” on page 184. files 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Table 22. D1001xxx to D1xx3FFF dump codes (continued) v If the system hangs on a progress code, follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, you can stop performing the remaining actions.
Page 138
Get optimized cache 1. Go to “Checkout procedure” on page 184. 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 139
Table 23. D1xx3y01 to D1xx3yF2 checkpoints (continued) v If the system hangs on a progress code, follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. If an action solves the problem, you can stop performing the remaining actions.
Hypervisor handshaking is complete 1. Go to “Checkout procedure” on page 184. 2. Replace the system-board, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Service request numbers (SRNs) Service request numbers (SRNs) are error codes that the operating system generates. The codes have three digits, a hyphen, and three or four digits after the hyphen. SRNs can be viewed using the AIX diagnostics or the Linux service aid “diagela” if it is installed. Note: The “diagela”...
Page 142
3. If the 8-digit error and location codes were NOT reported, then run diagnostics in problem determination mode and record and report the 8-digit error and location codes for this SRN. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 143
Table 25. 101-711 through FFC-725 SRNs (continued) Description and Action 651-140 Display Character test failed. Note: Diagnostic will provide this SRN but there is no action to be taken. Do not perform operator panel test from diagnostics. 651-151 152 2E2 Sensor indicates a voltage is outside the normal range.
Page 144
Correctable error threshold exceeded. Go to “Performing the checkout procedure” on page 184. 651-669 Correctable error threshold exceeded. Go to “Performing the checkout procedure” on page 184. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 145
Table 25. 101-711 through FFC-725 SRNs (continued) Description and Action 651-66A Correctable error threshold exceeded. Go to “Performing the checkout procedure” on page 184. 651-66B Correctable error threshold exceeded. Go to “Performing the checkout procedure” on page 184. 651-674 Failed memory module. Go to “Performing the checkout procedure” on page 184. 651-675 Failed memory module.
Page 146
Uncorrectable memory error. Go to “Performing the checkout procedure” on page 184. 651-785 303 214 Uncorrectable memory error. Go to “Performing the checkout procedure” on page 184. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 147
Table 25. 101-711 through FFC-725 SRNs (continued) Description and Action 651-786 304 214 Uncorrectable memory error. Go to “Performing the checkout procedure” on page 184. 651-789 2CD 214 Uncorrectable memory error. Go to “Performing the checkout procedure” on page 184. 651-78A 2CE 214 Uncorrectable memory error.
Page 148
Go to “Performing the checkout procedure” on page 184. 652-66B A non-critical error has been detected: correctable error threshold exceeded. Schedule deferred maintenance. Go to “Performing the checkout procedure” on page 184. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 149
Table 25. 101-711 through FFC-725 SRNs (continued) Description and Action 652-731 A non-critical error has been detected: intermediate or system bus address parity error. Schedule deferred maintenance. Go to “Performing the checkout procedure” on page 184. 652-732 A non-critical error has been detected: intermediate or system bus data parity error. Schedule deferred maintenance.
Page 150
External loopback fairness and parity tests failed. Go to “Performing the checkout procedure” on page 184. 887-112 External loopback (twisted pair) test failed. Go to “Performing the checkout procedure” on page 184. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 151
Table 25. 101-711 through FFC-725 SRNs (continued) Description and Action 887-113 External loopback (twisted pair) parity test failed. Go to “Performing the checkout procedure” on page 184. 887-114 Ethernet loopback (twisted pair) fairness test failed. Go to “Performing the checkout procedure”...
Page 152
PCI bus error detected by controller. Go to “Performing the checkout procedure” on page 184. 2506-7001 Temporary disk data error. Go to “Performing the checkout procedure” on page 184. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 153
Table 25. 101-711 through FFC-725 SRNs (continued) Description and Action 2506-8008 A permanent Cache Battery Pack failure occurred. Go to “Performing the checkout procedure” on page 184. 2506-8009 Impending Cache Battery Pack failure. Go to “Performing the checkout procedure” on page 184. 2506-8150 2506 Controller failure.
Page 154
1. Check the BladeCenter management-module event log. If an error was recorded by the system, see “POST progress codes (checkpoints)” on page 84. 2. Replace any parts reported by the diagnostic program. 3. Replace the system-board. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 155
Table 25. 101-711 through FFC-725 SRNs (continued) Description and Action 252B-710 252B Permanent adapter failure. 1. Check the BladeCenter management-module event log. If an error was recorded by the system, see “POST progress codes (checkpoints)” on page 84. 2. Replace any parts reported by the diagnostic program. 3.
Page 156
Go to “Performing the checkout procedure” on page 184. 2567-xxx 2567 USB integrated system-board and chassis assembly. Go to “Performing the checkout procedure” on page 184. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 157
Table 25. 101-711 through FFC-725 SRNs (continued) Description and Action 256D-201 256D 221 Adapter configuration error. 1. Check the BladeCenter management-module event log. If an error was recorded by the system, see “POST progress codes (checkpoints)” on page 84. 2. Replace any parts reported by the diagnostic program. 3.
Page 158
Error Log Analysis indicates that a parity error has been detected for the Fibre Channel adapter card. The adapter must be replaced immediately. Failure to do so could result in data being read or written incorrectly. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 159
Table 25. 101-711 through FFC-725 SRNs (continued) Description and Action 2604-706 2604 Error Log Analysis indicates that a fatal hardware error has occurred for the Fibre Channel adapter card. This adapter was successfully taken off-line. It will remain off-line until reconfigured or the system is rebooted.
Page 160
Command timeouts threshold exceeded. Go to “Performing the checkout procedure” on page 184. 2640-133 2640 Command timeout with error condition. Go to “Performing the checkout procedure” on page 184. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 161
Table 25. 101-711 through FFC-725 SRNs (continued) Description and Action 2640-134 2640 Hardware command or DMA failure. Go to “Performing the checkout procedure” on page 184. 2640-136 2640 2631 Timeout waiting for controller or drive with no busy status. Go to “Performing the checkout procedure”...
Page 162
1. Check the BladeCenter management-module event log. If an error was recorded by the system, see “POST progress codes (checkpoints)” on page 84. 2. Replace any parts reported by the diagnostic program. 3. Replace the system-board. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 163
Table 25. 101-711 through FFC-725 SRNs (continued) Description and Action 2E13-605 2E13 Error Log Analysis indicates permanent adapter failure is reported on the other port of this adapter. 1. Check the BladeCenter management-module event log. If an error was recorded by the system, see “POST progress codes (checkpoints)”...
Page 164
1. Check the BladeCenter management-module event log. If an error was recorded by the system, see “POST progress codes (checkpoints)” on page 84. 2. Replace any parts reported by the diagnostic program. 3. Replace the system-board. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 165
Table 25. 101-711 through FFC-725 SRNs (continued) Description and Action 2E15-605 2E15 Error Log Analysis indicates permanent adapter failure is reported on the other port of this adapter. 1. Check the BladeCenter management-module event log. If an error was recorded by the system, see “POST progress codes (checkpoints)”...
Page 166
1. Check the BladeCenter management-module event log. If an error was recorded by the system, see “POST progress codes (checkpoints)” on page 84. 2. Replace any parts reported by the diagnostic program. 3. Replace the system-board. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 167
Table 25. 101-711 through FFC-725 SRNs (continued) Description and Action 2E23-106 2E23 External Wrap with IP Checksum Test Failure 1. Check the BladeCenter management-module event log. If an error was recorded by the system, see “POST progress codes (checkpoints)” on page 84. 2.
Page 168
“POST progress codes (checkpoints)” on page 84. 2. Replace any parts reported by the diagnostic program. 3. Go to “Performing the checkout procedure” on page 184. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
A00-FF0 through A24-xxx SRNs AIX might generate service request numbers (SRNs) from A00-FF0 to A24-xxx. Note: Some SRNs in this sequence might have 4 rather than 3 digits after the dash (–). Table 26 shows the meaning of an x in any of the following SRNs, such as A01-00x. Table 26.
Page 170
“POST progress codes (checkpoints)” on page 84. 2. If no entry is found, Replace the system-board. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 171
Table 27. A00-FF0 through A24-xxx SRNs (continued) Description FRU/action A02-06x Memory Data error (Bad data going to 1. Check the BladeCenter management-module memory). event log; if an error was recorded by the system, see “POST progress codes (checkpoints)” on page 84. 2.
Page 172
“POST progress codes (checkpoints)” on page 84. 2. If no entry is found, Replace the system-board. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 173
Table 27. A00-FF0 through A24-xxx SRNs (continued) Description FRU/action A03-16x I/O Expansion unit not in an operating 1. Check the BladeCenter management-module state. event log; if an error was recorded by the system, see “POST progress codes (checkpoints)” on page 84. 2.
Page 174
“POST progress codes (checkpoints)” on page 84. 2. If no entry is found, Replace the system-board. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 175
Table 27. A00-FF0 through A24-xxx SRNs (continued) Description FRU/action A05-21x System shutdown due to Over 1. Make sure that: temperature condition. a. The room ambient temperature is within the system operating environment. b. There is unrestricted air flow around the system.
Page 176
“POST progress codes (checkpoints)” on page 84. 2. If no entry is found, Replace the system-board. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 177
Table 27. A00-FF0 through A24-xxx SRNs (continued) Description FRU/action A0D-36x Other IPL Diagnostic Error. 1. Check the BladeCenter management-module event log; if an error was recorded by the system, see “POST progress codes (checkpoints)” on page 84. 2. If no entry is found, Replace the system-board.
Page 178
If no for an unrecoverable error. entry is found, replace the system-board and chassis assembly. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 179
Table 27. A00-FF0 through A24-xxx SRNs (continued) Description FRU/action A11-550 Recoverable errors on resource indicate 1. If repair is not immediately available, reboot a trend toward an unrecoverable error. and the resource will be deconfigured; However, the resource could not be operations can continue in a degraded mode.
Page 180
“POST progress codes (checkpoints)” on page 84. 2. If no entry is found, Replace the system-board. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 181
Table 27. A00-FF0 through A24-xxx SRNs (continued) Description FRU/action A12-16x A non-critical error has been detected, a 1. Check the BladeCenter management-module system bus internal hardware/switch event log; if an error was recorded by the error. system, see “POST progress codes (checkpoints)”...
Page 182
“POST progress codes (checkpoints)” on page 84. 2. If no entry is found, Replace the system-board. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 183
Table 27. A00-FF0 through A24-xxx SRNs (continued) Description FRU/action A13-16x A non-critical error has been detected, 1. Check the BladeCenter management-module an I/O expansion unit not in an event log; if an error was recorded by the operating state. system, see “POST progress codes (checkpoints)”...
Page 184
“POST progress codes (checkpoints)” on page 84. 3. If no entry is found, Replace the system-board. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 185
Table 27. A00-FF0 through A24-xxx SRNs (continued) Description FRU/action A15-22x Fan failure and Over temperature 1. Check the BladeCenter management-module condition. event log; if an error was recorded by the system, see “POST progress codes (checkpoints)” on page 84. 2. If no entry is found, Replace the system-board.
Page 186
“POST progress codes (checkpoints)” on page 84. 2. If no entry is found, Replace the system-board. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 187
Table 27. A00-FF0 through A24-xxx SRNs (continued) Description FRU/action A1D-19x A non-critical error has been detected, a 1. Check the BladeCenter management-module service processor error accessing real event log; if an error was recorded by the time clock/time-of-day clock. system, see “POST progress codes (checkpoints)”...
Page 188
“POST progress codes (checkpoints)” on page 84. 2. Replace part numbers reported by the diagnostic program. 3. If no entry is found, Replace the system-board. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Table 27. A00-FF0 through A24-xxx SRNs (continued) Description FRU/action A24-xxx Spurious interrupts have exceeded 1. Check the BladeCenter management-module threshold. event log; if an error was recorded by the system, see “POST progress codes (checkpoints)” on page 84. 2. Replace part numbers reported by the diagnostic program.
Page 190
2. Replace any parts reported by the diagnostic program. 3. Replace the system board and chassis assembly, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 191
Table 28. ssss-102 through ssss-640 SRNs (continued) Description and action ssss-122 ssss A SCSD reservation conflict error. 1. Check the BladeCenter management-module event log. If an error was recorded by the system, see “POST progress codes (checkpoints)” on page 84. 2.
Page 192
2. Replace any parts reported by the diagnostic program. 3. Replace the system board and chassis assembly, as described in “Replacing the FRU system-board and chassis assembly” on page 260. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 193
Failing function codes 151 through 2E33 Failing function codes (FFCs) identify a function within the system unit that is failing. Table 29 describes the component that each function code identifies. Note: When replacing a component, perform system verification for the component. See “Using the diagnostics program”...
Page 194
System-board and chassis assembly 25C4 Broadcom Ethernet adapter 2607 Emulex 8Gb PCI-Express Fibre Channel Expansion Card 2624 System-board and chassis assembly (InfiniBand Host Channel Adapter) 2631 System-board and chassis assembly Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
1. If the firmware hangs on an eight-digit progress code, see “POST progress codes (checkpoints)” on page 84. 2. If the firmware records an eight-digit error code, see “System reference codes (SRCs)” on page Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 197
3. If the AIX operating system records a service request number (SRN), see “Service request numbers (SRNs)” on page 129. 4. Check the BladeCenter management-module event log. If an error was recorded by the system, see “POST progress codes (checkpoints)” on page 84 or “System reference codes (SRCs)”...
Select the resources to be tested and record any SRNs; then go to “Service request numbers (SRNs)” on page 129. This ends the Linux procedure. For more information about installing and using all supported operating systems, search the IBM Support Site. Verifying the partition configuration Perform this procedure if there is a configuration problem with the system or a logical partition.
3. When testing is complete, press F3 until the Diagnostic Operating Instructions panel is displayed, then press F3 to exit the diagnostic program. Starting stand-alone diagnostics from a CD Perform these procedures to start the stand-alone diagnostics from a CD. These procedures can be used if the blade server is running a Linux operating system or if an AIX operating system cannot start the concurrent diagnostics program.
“vs100” as the terminal type is recommended; however, the function keys (F#) may not work. In this case, press Esc and the number in the screen menus. For example, instead of F3 you can press Esc and Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
7. When testing is complete, press F3 until the Diagnostic Operating Instructions screen is displayed; then press F3 again to exit the diagnostic program. Using the diagnostics program Follow the basic procedures for running the diagnostics program. 1. Start the diagnostics from the AIX operating system, from a CD, or from a management server. See “Starting AIX concurrent diagnostics”...
If replacing the CD or DVD drive does not resolve the problem, replace the media tray. f. If booting on all servers fails using the new media tray, replace the following in the BladeCenter unit: v Management module Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
v Midplane 4. If you are attempting to boot from a hard disk drive. a. Verify that the hard disk drive is installed. b. Select the CD or DVD drive as the boot device. c. Go to “Performing the checkout procedure” on page 184. d.
If there is no airflow, the blower is not working. This causes the blade server to overheat and shut down. v Ensure that the self configuring SCSI device (SCSD) bus and devices are configured correctly. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Management module service processor problems Determine if a problem is a management module service processor problem and, if so, the corrective action to take. v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved.
The hardware that controls PCI adapters and PCI card slots detected an error. The direct select address (DSA) portion of the system reference code (SRC) identifies the location code of the failing component. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 207
The following table shows the syntax of a nine-word B700xxxx SRC as it might be displayed in the event log of the management module. The first word of the SRC in this example is the message identifier, B7001111. This example numbers each word after the first word to show relative word positions.
1. Use the BladeCenter management module to verify that local power control for the blade server is enabled. 2. Reseat the control-panel connector. 3. Replace the bezel assembly. 4. Replace the system-board. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 209
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Type 8406,” on page 229 to determine which components are CRUs and which components are FRUs.
Verify that the partition has load source and console I/O resources. 4. Check the IPL mode of the system or failing partition. 5. For further assistance, contact IBM Support. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 211
3. Install new memory DIMMs, as described in “Installing a memory module” on page 245. See “Supported DIMMs” on page 4 for more information. NEXTLVL Contact IBM Support. Symbolic CRU PIOCARD The hardware that 1. Collect the error log information.
2 and 3: (SN#YL31W7120029) SYS F/W: CEC Hardware VPD. See procedure FSPSP07, FSPSP28 then FSP0200 (50000014 B15A3303 XX 44444444 55555555 66666666 77777777 888888888 99999999) Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 213
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Type 8406,” on page 229 to determine which components are CRUs and which components are FRUs.
Page 214
If you have replaced all of the memory DIMM pairs, then continue with the next step. v No: This ends the procedure. 5. Replace the system-board and chassis assembly. This ends the procedure. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 215
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Type 8406,” on page 229 to determine which components are CRUs and which components are FRUs.
Page 216
3. Install new memory DIMMs, as described in “Installing a memory module” on page 245. See “Supported DIMMs” on page 4 for more information. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 217
Collect the dump for support and power off and power on the blade server. 5. If an A1xx SRC has not remained more than 40 minutes, call IBM Support. FSPSP16 Save any error log Contact IBM Support.
Page 218
Product Data (VPD) 2. Call IBM Support to find out what CRU the resource ID represents. table. 3. Replace the CRU that the resource ID represents. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 219
Reason code A45F 1. Set the enclosure feature code using SMS, which automatically resets the service processor. 2. If the problem persists, call IBM Support. If you do not see your reason code listed, call IBM Support. Chapter 2. Diagnostics...
Page 220
IBM Support. within the JTAG path. An error Contact IBM Support. FSPSP42 communicating between two system processors was detected. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 221
PSI link. FSPSP48 A diagnostics If the CRUs called out before this procedure do not fix the problem, function detects an contact IBM Support. external processor interface problem. FSPSP49 A diagnostic function If the CRUs called out before this procedure do not fix the problem, detects an internal contact IBM Support.
Page 222
Replace the system board and chassis assembly, as described in occurred between the “Replacing the FRU system-board and chassis assembly” on page 260. service processor and the network switch on the blade server. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 223
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Type 8406,” on page 229 to determine which components are CRUs and which components are FRUs.
Page 224
Replace the battery, as described in “Removing the battery” on page 250 Symbolic CRU time-of-day battery is and “Installing the battery” on page 251. low or failing. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Software problems Use this information to recognize software problem symptoms and to take corrective actions. v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Type 8406,” on page 229 to determine which components are CRUs and which components are FRUs.
BladeCenter unit. The LEDs will remain lit for as long as you press the switch, to a maximum of 25 seconds. Figure 6 on page 215 shows the locations of LEDs on the system board. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Figure 6. LED locations on the system board of the PS700 blade server Table 34 shows LED descriptions. Table 34. PS700 LEDs Callout Base unit LEDs 3V lithium battery LED DIMM 1-4 LEDs Management card LED Light path power LED System board LED HDD1 LED Interposer LED...
Page 228
“Removing and installing an P1-C11 PCIe I/O expansion card” on page 246. 3. Replace the I/O expansion option. If you are still having problems, see theServerProven Web site. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 229
1. Make sure that the I/O expansion option is card error supported, as described on the ServerProven Web site. See P1-C12 1Xe http://www.ibm.com/servers/eserver/ serverproven/compat/us/. 2. Reseat the I/O expansion option, as described in “Removing and installing an I/O expansion card” on page 246.
FC loc code:U78AF.001.startSN-P1-C35-L1-T1 Ports logged in 1 Flags a<LOGGED_IN,STRIP_MERGE> VFC client name fcs1 VFC client DRC:U9999.999.9999999-V4-C32-T1 In this example, the original serial number is represented as startSN. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
2. To save the location code information for each Fibre Channel adapter (fcs), enter the lsdev command as follows: lsdev -vpd | grep fcs | tee LSDEV_OUTPUT_before The output might look like the following example: fcs0 U78AF.001.startSN-P1-C35-L1-T1 Dual Port 8Gb FC Mezzanine Card (7710322577107501) fcs1 U78AF.001.startSN-P1-C35-L1-T2 Dual Port 8Gb FC Mezzanine Card (7710322577107501) In this example, the original serial number is represented as startSN.
3. Click the appropriate PS700 blade server in the list of blade servers in the BladeCenter unit. 4. Select Permanent to force the system to start from the PERM image. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
This function also displays which image the blade server used to start up. 1. Start the diagnostics program. See “Running the diagnostics program” on page 186. The online BladeCenter information center is available in the IBM BladeCenter Information Center at http://publib.boulder.ibm.com/infocenter/bladectr/documentation/index.jsp. Chapter 2. Diagnostics...
To check the general function of shared BladeCenter resources, complete the following operations. 1. Verify that the BladeCenter unit has the required power modules installed and is connected to a working power source. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
2. Verify that power management is set correctly for your BladeCenter unit configuration. 3. Verify whether the problem is being experienced on more than one blade server. 4. Perform a test of the failing function on a blade server that is known to be operational. 5.
Page 236
10. Replace the management module. See the online information center or the Problem Determination and Service Guide or the Hardware Maintenance Manual and Troubleshooting Guide for your BladeCenter unit. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
If these steps do not resolve the problem, it is likely a problem with the blade server. See “Universal Serial Bus (USB) port problems” on page 213 for more information. Solving shared network connection problems Problems with BladeCenter shared resources might appear to be in the blade server, but might actually be a problem in a BladeCenter unit network connection resource.
Problems with BladeCenter shared resources might appear to be in the blade server, but might actually be a problem in a BladeCenter unit video component. Some IBM monitors have their own self-tests. If you suspect a problem with the monitor, see the information that comes with the monitor for instructions for adjusting and testing the monitor.
7. Replace the monitor cable, if applicable. 8. Replace the monitor. 9. Replace the management module. See the online information center or the Problem Determination and Service Guide or the Hardware Maintenance Manual and Troubleshooting Guide for your BladeCenter unit. Solving undetermined problems When you are diagnosing a problem in the PS700 blade server, you must determine whether the problem is in the blade server or in the BladeCenter unit.
Calling IBM for service Call IBM for service after you collect as much as possible of the following information. Before calling for service, collect as much as possible of the following available information:...
Page 242
Tier 1 CRU at your request, you will be charged for the installation. v Tier 2 customer replaceable unit: You may install a Tier 2 CRU yourself or request IBM to install it, at no additional charge, under the type of warranty service that is designated for your blade server.
Page 243
(Tier 1) (Tier 2) code (FFC) Hard drive filler 40K5928 Service Label 46K5891 IBM FRU/CRU Label 46K5893 OEM IBM FRU/CRU Label 46K5894 Cover warning label 90P4799 Miscellaneous parts kit 32R2451 3.0V Battery 33F8354 RFID Tag for North America, Latin America, Asia...
Page 244
Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Tier 1 CRU at your request, you will be charged for the installation. v Tier 2 customer replaceable unit: You may install a Tier 2 CRU yourself or request IBM to install it, at no additional charge, under the type of warranty service that is designated for your blade server.
Returning a device or component If you are instructed to return a device or component, follow all packaging instructions, and use any packaging materials for shipping that are supplied to you. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Removing the blade server from a BladeCenter unit Remove the blade server from the BladeCenter unit to access options, connectors, and system-board indicators. Figure 8. Removing the blade server from the BladeCenter unit Attention: v To maintain proper system cooling, do not operate the BladeCenter unit without a blade server, expansion unit, or blade filler installed in each blade bay.
Reinstall a blade server in the same blade bay to preserve configuration information and update options that are established by blade bay. Reinstalling into a different blade bay can have unintended consequences, which might include re-configuring the blade server. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Removing and replacing Tier 1 CRUs Replacement of Tier 1 customer-replaceable units (CRUs) is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. The illustrations in this documentation might differ slightly from your hardware.
Page 250
Statement 21 CAUTION: Hazardous energy is present when the blade server is connected to the power source. Always replace the blade server cover before installing the blade server. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Installing and closing the blade server cover Install and close the cover of the blade server before you insert the blade server into the BladeCenter unit. Do not attempt to override this important protection. Figure 11. Installing the cover Statement 21 CAUTION: Hazardous energy is present when the blade server is connected to the power source.
8. If you are instructed to return the bezel assembly, follow all packaging instructions, and use any packaging materials for shipping that are supplied to you. Installing the bezel assembly Install the bezel assembly. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Figure 13. Installing the bezel assembly 1. Connect the control-panel cable ( 1 in Figure 13) to the control-panel connector ( 2 ) on the system board. 2. Carefully slide the bezel assembly ( 4 ) onto the blade server until the two bezel-assembly releases ( 3 ) click into place in the bezel assembly.
Lift the drive 3 out of the drive tray. Installing a drive You can install a hard disk drive in drive tray. Figure 15 on page 243 shows how to install the disk drive. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 255
Figure 15. Installing a drive All drive connectors are on the same bus. If the two drives are both SAS hard disk drives, you can use them to implement and manage a redundant array of independent disks (RAID) level-1 array. See “Configuring a RAID array”...
Note: Install a DIMM filler in any location where a DIMM is not present to avoid machine damage. 8. If you are instructed to return the DIMM, follow all packaging instructions, and use any packaging materials for shipping that are supplied to you. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Qlogic 4Gb SFF Fibre Channel Expansion card (CIOv) See the ServerProven Web site for information about supported operating-system versions and all PS700 blade server optional devices. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Removing a CIOv form-factor expansion card You can remove a CIOv form-factor expansion card from the 1Xe connector. Figure 18. Removing a CIOv form factor expansion card from the 1Xe connector 1. Read the Safety topic and the “Installation guidelines” on page 233. 2.
Page 260
236. 9. Use the documentation that comes with the expansion card to install device drivers and to perform any configuration that the expansion card requires. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Removing a combination-form-factor expansion card Complete this procedure to remove a combination-form-factor expansion card. Figure 20. Removing a combination-form-factor expansion card 1. Read the Safety topic and the “Installation guidelines” on page 233. 2. Shut down the operating system, turn off the blade server, and remove the blade server from the BladeCenter unit.
2. Shut down the operating system, turn off the blade server, and remove the blade server from the BladeCenter unit. See “Removing the blade server from a BladeCenter unit” on page 235. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
When replacing the battery, you must replace it with a lithium battery of the same type from the same manufacturer. v To order replacement batteries, call 1-800-426-7378 within the United States, and 1-800-465-7999 or 1-800-465-6666 within Canada. Outside the U.S. and Canada, call your IBM marketing representative or authorized reseller. v After you replace the battery: 1.
CAUTION: When replacing the lithium battery, use only IBM Part Number 33F8354 or an equivalent type battery recommended by the manufacturer. If your system has a module containing a lithium battery, replace it only with the same module type made by the same manufacturer. The battery contains lithium and can explode if not properly used, handled, or disposed of.
Figure 24. Removing the disk drive tray Perform the following procedure to remove the disk drive tray. 1. Read the Safety topic and the “Installation guidelines” on page 233. 2. Shut down the operating system, turn off the blade server, and remove the blade server from the BladeCenter unit.
Page 266
4. Install the blade server into the BladeCenter unit. See “Installing the blade server in a BladeCenter unit” on page 236. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Removing the tier 2 management card You can remove this tier 2 CRU yourself or request IBM to remove it, at no additional charge, under the type of warranty service that is designated for the blade server. Remove the management card to replace the card or to reuse the card in a new system board and chassis assembly.
Installing the tier 2 management card You can install this tier 2 CRU yourself or request IBM to install it, at no additional charge, under the type of warranty service that is designated for the blade server. Use this procedure to install the management card into the currently installed system board.
7. Install and close the blade server cover. See “Installing and closing the blade server cover” on page 239. Statement 21 CAUTION: Hazardous energy is present when the blade server is connected to the power source. Always replace the blade server cover before installing the blade server. 8.
Page 270
In the Capacity on Demand window, select Advanced Functions from the Select On Demand Type list, and then select PowerVM. e. Click View Code Information. The following is an example of the PowerVM output: Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 271
3. Send a request for the VET activation code for your replacement management card to the System p Capacity on Demand mailbox at pcod@us.ibm.com. If the HMC or SDMC was used to get the VET information, include the following fields and their...
18 on page 262, the manual backup completed in step 3 is recommended as a precaution and best practice before system board replacement. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 273
4. Does the blade server have Fibre Channel adapters? “Save vfchost map data” on page 218. Then, continue with the next step. Continue with the next step. 5. Shut down the operating system, turn off the blade server, and remove the blade server from the BladeCenter unit.
Page 274
Continue with the next step. 20. Reset the system date and time through the operating system that you installed. For additional information, see the documentation for your operating system. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
PS700 blade server. Updating the firmware IBM periodically makes firmware updates available for you to install on the blade server, the management module, or expansion cards in the blade server. Important: To avoid problems and to maintain proper system performance, always verify that the blade server BIOS, service processor, and diagnostic firmware levels are consistent for all blade servers within the BladeCenter unit.
This utility is for advanced users of the IEEE 1275 specifications only. v Management module Use the management module to change the boot list, determine which firmware image to boot, and perform other configuration tasks. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Using the SMS utility Use the System Management Services (SMS) utility to perform a variety of configuration tasks on the PS700 blade server. Starting the SMS utility Start the SMS utility to configure the blade server. 1. Turn on or restart the blade server, and establish an SOL session with it. See the BladeCenter Management Module Command-Line Interface Reference Guide or the BladeCenter Serial-Over-LAN Setup Guide for more information.
Ethernet controller. See the operating-system device-driver documentation for information about configuring for failover. Important: To support failover on the blade server Ethernet controllers, the Ethernet switch modules in the BladeCenter unit must have identical configurations. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Blade server Ethernet controller enumeration The enumeration of the Ethernet controllers in a blade server is operating-system dependent. You can verify the Ethernet controller designations that a blade server uses through the operating-system settings. The routing of an Ethernet controller to a particular I/O-module bay depends on the type of blade server. You can verify which Ethernet controller is routed to which I/O-module bay by using the following test: 1.
528 bytes to 512 bytes. Updating IBM Director If you plan to use IBM Director to manage the blade server, you must check for the latest applicable IBM Director updates and interim fixes.
Page 281
d. Click BladeCenter PS700 to display the list of downloadable files for the blade server. Chapter 5. Configuring...
Page 282
Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
IBM, the IBM logo, and ibm.com are trademarks or registered trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the Web at Copyright and trademark information at www.ibm.com/legal/copytrade.shtml.
Class A Notices The following Class A statements apply to the IBM servers that contain the POWER7 processor and its features unless designated as electromagnetic compatibility (EMC) Class B in the feature information.
Page 286
Declaration: This is a Class A product. In a domestic environment this product may cause radio interference in which case the user may need to perform practical action. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)
Page 287
Warning: This is a Class A product. In a domestic environment this product may cause radio interference in which case the user will be required to take adequate measures. IBM Taiwan Contact Information: Electromagnetic Interference (EMI) Statement - Korea Appendix. Notices...
Page 288
Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw.
Properly shielded and grounded cables and connectors must be used in order to meet FCC emission limits. Proper cables and connectors are available from IBM-authorized dealers. IBM is not responsible for any radio or television interference caused by unauthorized changes or modifications to this equipment.
Page 290
This product is in conformity with the protection requirements of EU Council Directive 2004/108/EC on the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot accept responsibility for any failure to satisfy the protection requirements resulting from a non-recommended modification of the product, including the fitting of non-IBM option cards.
Page 291
Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw.
PUBLICATIONS. THESE PUBLICATIONS ARE PROVIDED "AS-IS" AND WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING BUT NOT LIMITED TO IMPLIED WARRANTIES OF MERCHANTABILITY, NON-INFRINGEMENT, AND FITNESS FOR A PARTICULAR PURPOSE. Power Systems: Problem Determination and Service Guide for the IBM Power PS700 (8406-70Y)