Page 6
Dynamic System Analysis diagnostic programs and Hardware service and support . 168 messages . . 108 IBM Taiwan product service . . 168 Running the diagnostic programs . 109 Diagnostic text messages . 110 Notices ....169 Viewing the test log .
Use the information in this section to help you identify potential unsafe conditions in an IBM product on which you are working. Each IBM product, as it was designed and manufactured, has required safety items to protect users and service technicians from injury.
8. Make sure that the power-supply cover fasteners (screws or rivets) have not been removed or tampered with. Guidelines for servicing electrical equipment Observe the following guidelines when servicing electrical equipment: v Check the area for electrical hazards such as moist floors, nongrounded power extension cords, and missing safety grounds.
Safety statements Important: Each caution and danger statement in this documentation begins with a number. This number is used to cross reference an English-language caution or danger statement with translated versions of the caution or danger statement in the Safety Information document.
Page 11
Statement 2 CAUTION: When replacing the lithium battery, use only IBM Part Number 33F8354 or an equivalent type battery recommended by the manufacturer. If your system has a module containing a lithium battery, replace it only with the same module type made by the same manufacturer.
Page 12
BladeCenter LS22 Type 7901 and LS42 Type 7902: Problem Determination and Service Guide...
Tier 1 customer replaceable unit (CRU): Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. v Tier 2 customer replaceable unit: You may install a Tier 2 CRU yourself or request IBM to install it, at no additional charge, under the type of warranty service that is designated for your server.
To check for updated documentation and technical updates, complete the following steps. Note: Changes are made periodically to the IBM Web site. The actual procedure might vary slightly from what is described in this document. 1. Go to http://www.ibm.com/bladecenter/.
USB flash device, such as dual-channel DIMMs, supporting drives the IBM USB key for VMware ESXi up to 64 GB of total memory. v One internal USB connector for a supported USB flash device, such...
Page 16
2. The following illustration shows a single-width type of blade server. The MPE also interacts with the control panel on the blade server. Activity LED Location LED KVM select button Information LED Blade-error LED Media-tray select button Power-control button Power-on LED KVM select button: Press this button to associate the shared BladeCenter unit keyboard, video, and mouse port (KVM) with the blade server.
When this LED is lit, the location LED on the BladeCenter unit is lit also. The location LED can be turned off through the management-module Web interface or through IBM Director Console. Information LED: When this amber LED is lit, it indicates that information about a system error for the blade server has been placed in the management-module event log.
– Through the management-module Web interface, you can also configure the management module to turn off the blade server. For additional information, see the IBM BladeCenter Management Module User’s Guide. System board layouts The following illustrations show the connectors, LEDs, and switches on the blade server system board and the multiprocessor expansion (MPE) unit.
System-board connectors The following illustration shows the connectors on the blade server system board. The following illustration shows the connectors on the MPE unit. Chapter 1. Introduction...
System-board switches The following illustration shows the switches on the blade server system board. The following table defines the function of each switch on the blade server system board. Switch number Description Light path diagnostics switch: Press and hold to relight the system-board LEDs that were lit before you removed the blade server from the BladeCenter unit.
The following table defines the function of the light path diagnostics switch on the MPE unit. Switch number Description Light path diagnostics switch – Press and hold to relight the MPE-unit LEDs that were lit before you removed the blade server from the BladeCenter unit.
Page 22
NMI error LED Microprocessor mismatch error LED SBRD System-board error LED TEMP Over temperature error LED Light path diagnostics LED Light path diagnostics switch The following illustration shows the LEDs on the MPE unit. To see these LEDs, remove the blade server from the BladeCenter unit, open the cover, and press the light path diagnostics switch to light any error LEDs that were lit during processing.
The IBM Remote Deployment Manager (RDM) Version 4.4 program is available for purchase. You can use IBM RDM to install a BIOS code update onto a blade server by following the instructions in the documentation that comes with the RDM program.
Setup Utility program menu choices The following choices are on the Setup Utility main menu. Depending on the version of the BIOS code in the blade server, some menu choices might differ slightly from these descriptions. v System Summary Select this choice to display configuration information, including the type, speed, and cache sizes of the microprocessors and the amount of installed memory.
Page 25
Select this choice to view or change the start options. Changes in the start options take effect when you start the blade server. – Startup Sequence Options Select this choice to view or change the startup device sequence that is set for the blade server.
Web site at http://www.ibm.com/systems/management/serverguide/ sub.html. To download the free image, click IBM Service and Support Site. Note: Changes are made periodically to the IBM Web site. The actual procedure might vary slightly from what is in this document. The ServerGuide program has the following features to make setup easier:...
When you use the ServerGuide Setup and Installation CD, you do not need setup diskettes. You can use the CD to configure any supported IBM blade server model. The setup program provides a list of tasks that are required to set up the blade server.
IBM Web site. Note: Changes are made periodically to the IBM Web site. The actual procedure might vary slightly from what is described in this document. 1. Go to http://www.ibm.com/systems/support/.
Gigabit Ethernet Software CD that comes with the blade server. To find updated information about configuring the controller, complete the following steps. Note: Changes are made periodically to the IBM Web site. The actual procedure might vary slightly from what is described in this document.
Two SAS drives can be used to implement and manage RAID level-0 (striping) or RAID level-1 (mirror) arrays in operating systems that are listed at http://www.ibm.com/servers/eserver/serverproven/compat/us/. For more information, see the Installation and User’s Guide. Important: You must create the RAID array before you install the operating system on the blade server.
Type 7901 and LS42 Type 7902 blade server. For an updated parts listing on the Web, complete the following steps. Note: Changes are made periodically to the IBM Web site. The actual procedure might vary slightly from what is described in this document.
Page 32
Tier 1 customer replaceable unit (CRU): Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. v Tier 2 customer replaceable unit: You may install a Tier 2 CRU yourself or request IBM to install it, at no additional charge, under the type of warranty service that is designated for your server.
Page 33
Table 1. Customer and field replaceable units, Type 7901 and Type 7902 CRU part CRU part FRU part Index Description number number number (Tier 1) (Tier 2) Cover (all models) 46C7201 MPE board assembly (type 7902 models 3Qx, CQx, EDx, EMx, 46M6832 EUx, HVx, JVx) Bezel, MPE unit (type 7902 models 3Qx, CQx, EUx, EMx, HVx,...
Page 34
IBM ServeRAID 8k-l SAS Controller expansion card (option) 25R8079 IBM ServeRAID 8k SAS Controller expansion card (option) 25R8076 IBM USB key for VMware ESXi (type 7901 model CTO) (type 7902 42D0545 model HVx) InfiniBand 4X DDR CFF expansion (option) 43W4425...
Page 35
Table 1. Customer and field replaceable units, Type 7901 and Type 7902 (continued) CRU part CRU part FRU part Index Description number number number (Tier 1) (Tier 2) Kit, miscellaneous parts (all models) 44X2229 v 2 socket alignment pins v 1 CPU socket protective cover v 4 HSDC support pins v 2 HSDC support studs v 2 SFF HSDC brackets...
Page 36
Windows Server 2008 Enterprise Edition 32/64 Bit (1-8 Processors), 49Y0897 Traditional Chinese Consumable parts are not covered by the IBM Statement of Limited Warranty. The following consumable parts are available for purchase from the retail store. Table 2. Consumable parts, Type 7901 and Type 7902...
Tier 1 customer replaceable unit (CRU): Replacement of Tier 1 CRUs is your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation. v Tier 2 customer replaceable unit: You may install a Tier 2 CRU yourself or request IBM to install it, at no additional charge, under the type of warranty service that is designated for your server.
When you are finished working on the blade server, reinstall all safety shields, guards, labels, and ground wires. v For a list of supported optional devices for the blade server, see http://www.ibm.com/servers/eserver/serverproven/compat/us/. System reliability guidelines To help ensure proper cooling and system reliability, make sure that the following...
v Remove the device from its package and install it directly into the blade server without setting it down. If it is necessary to set down the device, put it back into its static-protective package. Do not place the device on the blade server cover or on a metal surface.
Attention: Wait at least 30 seconds, until the hard disk drives stop spinning, before you proceed to the next step. The activity LED on the control panel turns off when the hard disk drive has stopped spinning (see “Blade server controls and LEDs”...
Installation and User’s Guide. Removing and replacing consumable parts and Tier 1 CRUs Replacement of consumable parts and Tier 1 CRUs are your responsibility. If IBM installs a Tier 1 CRU at your request, you will be charged for the installation.
To remove the blade server cover, complete the following steps. 1. Read the safety information that begins on page “Safety statements” on page viii and “Installation guidelines” on page 25. 2. If the blade server is installed in a BladeCenter unit, remove it (see “Removing the blade server from a BladeCenter unit”...
CAUTION: Hazardous energy is present when the blade server is connected to the power source. Always replace the blade cover before installing the blade server. Attention: You cannot insert the blade server into the BladeCenter unit until the cover is installed and closed. Do not attempt to override this protection. To install the blade server cover, complete the following steps.
1. Read the safety information beginning on page “Safety statements” on page viii and “Installation guidelines” on page 25. 2. If the blade server is installed in a BladeCenter unit, remove it (see “Removing the blade server from a BladeCenter unit” on page 27). 3.
To install an expansion unit, complete the following steps. 1. Remove the cover from the blade expansion connector, if one is installed. Attention: If a high-speed expansion card is installed in the blade server system board, you cannot install an expansion unit until the high-speed expansion card is removed.
To remove the bezel assembly, complete the following steps. 1. Read the safety information that begins on page “Safety statements” on page viii and “Installation guidelines” on page 25. 2. Remove the blade server cover (see “Removing the blade server cover” on page 29).
To install the bezel assembly, complete the following steps. 1. If you are installing the bezel assembly on a blade server, connect the control-panel cable to the control-panel connector. If you are installing a bezel assembly on an MPE unit, go to step 2. Note: Align the control-panel cable with the notch in the air baffle on the blade server in front of the control-panel connector.
To remove a SAS drive, complete the following steps. 1. Read the safety information that begins on page “Safety statements” on page viii and “Installation guidelines” on page 25. 2. If the blade server is installed in a BladeCenter unit, remove it (see “Removing the blade server from a BladeCenter unit”...
3. Install the MPE unit or expansion unit, if you removed one while you removed the SAS drive (see “Installing an expansion unit” on page 32). 4. Install the blade server cover (see “Installing the blade server cover” on page 30).
Installing a memory module Note: If you are replacing a faulty DIMM, make sure that the replacement DIMM is the correct type of memory. For each installed microprocessor, a set of four DIMM connector is enabled (see the following table for more information). If you are adding DIMMs, install the DIMMs in pairs into the DIMM connectors that are enabled.
To install a DIMM, complete the following steps. 1. Read the safety information that begins on page “Safety statements” on page viii and “Installation guidelines” on page 25. 2. Read the documentation that comes with the DIMM. 3. Locate the DIMM connectors (see “System-board connectors” on page 7). Determine the connectors into which you will install the DIMMs.
Page 52
Removing a small-form-factor expansion card The following illustration shows how to remove a small-form-factor expansion card from the blade server; removing a small-form-factor expansion card from the MPE unit is similar. To remove a small-form-factor expansion card, complete the following steps. 1.
Page 53
To remove a standard-form-factor expansion card, complete the following steps. 1. Read the safety information that begins on page “Safety statements” on page viii and “Installation guidelines” on page 25. 2. If the blade server is installed in a BladeCenter unit, remove it (see “Removing the blade server from a BladeCenter unit”...
Page 54
2. If the blade server is installed in a BladeCenter unit, remove it (see “Removing the blade server from a BladeCenter unit” on page 27). 3. Remove the blade server cover (see “Removing the blade server cover” on page 29). 4.
a. Locate the CFFv card in the I/O-expansion card connector (see “System-board connectors” on page 7). b. Pull up on the CFFv card to remove it from the expansion connector; then, lift the card out of the blade server. 6. If you are instructed to return the expansion card, follow all packaging instructions, and use any packaging materials for shipping that are supplied to you.
Page 57
To install a small-form-factor expansion card, complete the following steps. 1. Touch the static-protective package that contains the expansion card to any unpainted metal surface on the BladeCenter unit or any unpainted metal surface on any other grounded rack component; then, remove the expansion card from the package.
Page 58
1. Touch the static-protective package that contains the expansion card to any unpainted metal surface on the BladeCenter unit or any unpainted metal surface on any other grounded rack component; then, remove the expansion card from the package. 2. Orient the expansion card over the expansion card connectors. 3.
Page 59
6. Install the blade server into the BladeCenter unit (see “Installing the blade server in a BladeCenter unit” on page 28). Installing a combination-form-factor expansion card The combination-form-factor expansion card consists of two parts, the combination-form-factor vertical (CFFv) card and the combination-form-factor horizontal (CFFh) card.
To order replacement batteries, call 1-800-426-7378 within the United States, and 1-800-465-7999 or 1-800-465-6666 within Canada. Outside the U.S. and Canada, call your IBM marketing representative or authorized reseller. v After you replace the battery, you must reconfigure the server and reset the system date and time.
CAUTION: When replacing the lithium battery, use only IBM Part Number 33F8354 or an equivalent type battery recommended by the manufacturer. If your system has a module containing a lithium battery, replace it only with the same module type made by the same manufacturer. The battery contains lithium and can explode if not properly used, handled, or disposed of.
To remove the LS22 optimizer card, complete the following steps: 1. Read the safety information that begins on page “Safety statements” on page viii and “Installation guidelines” on page 25 2. If the blade server is installed in a BladeCenter unit, remove it (see “Removing the blade server from a BladeCenter unit”...
4. Press the LS22 optimizer card down into the socket until it clicks into place. 5. Install the blade server cover (see “Installing the blade server cover” on page 30). 6. Install the blade server into the BladeCenter unit (see “Installing the blade server in a BladeCenter unit”...
To install a concurrent KVM feature card, complete the following steps: 1. Touch the static-protective package that contains the expansion card to any unpainted metal surface on the BladeCenter unit or any unpainted metal surface on any other grounded rack component; then, remove the concurrent KVM feature card from the package.
To remove a USB flash drive, complete the following steps: 1. Read the safety information that begins on page “Safety statements” on page viii and “Installation guidelines” on page 25. 2. If the blade server is installed in a BladeCenter unit, remove it (see “Removing the blade server from a BladeCenter unit”...
BladeCenter unit” on page 28). Removing and replacing Tier 2 CRUs You may install a Tier 2 CRU yourself or request IBM to install it, at no additional charge, under the type of warranty service that is designated for the server.
7. If you are instructed to return the drive tray, follow all packaging instructions, and use any packaging materials for shipping that are supplied to you. Installing the SAS drive tray The following illustration shows how to install the SAS drive tray in the blade server;...
To remove an expansion-card bracket, complete the following steps. 1. Read the safety information that begins on page “Safety statements” on page viii and “Installation guidelines” on page 25. 2. If the blade server is installed in a BladeCenter unit, remove it (see “Removing the blade server from a BladeCenter unit”...
To install an expansion-card bracket, complete the following steps: 1. Remove the expansion-card bracket from the packaging. 2. Locate SAS connector 1 (see “System-board connectors” on page 7). 3. Orient the expansion-card bracket so that the screw holes in the bracket are lined up with the two screw holes in front of SAS connector 1 and closest to the side of the blade server.
Page 70
2. In a dual-microprocessor blade server or MPE unit, the microprocessor and the heat sink are a matched set. First transfer the heat sink and microprocessor from one socket to the new system board; then, transfer the other heat sink and microprocessor.
7. Remove the heat sink. Attention: Do not touch the thermal material on the bottom of the heat sink. Touching the thermal material will contaminate it. If the thermal material on the microprocessor or heat sink becomes contaminated, you must replace the heat sink.
Page 72
To install a microprocessor and heat sink, complete the following steps. Attention: Do not use any tools or sharp objects to lift the locking lever on the microprocessor socket. Doing so might result in permanent damage to the system board. 1.
Page 73
Attention: v Do not press the microprocessor into the socket. v Make sure that the microprocessor is oriented and aligned correctly in the socket before you try to close the microprocessor retainer. 5. Center the microprocessor over the microprocessor socket. Align the notches on both sides of the microprocessor with the notches in the socket and carefully place the microprocessor into the socket.
Removing the system-board assembly This section describes how to remove components from the blade server when the system-board assembly is being replaced; the procedure for removing components from the MPE unit when the MPE board assembly is being replaced is similar. When replacing the system board, you will replace the system board and blade base as one assembly.
Page 75
Note: You might have to remove component fillers or covers before you can install some components (see the Installation and User’s Guide for more information). v Microprocessors and heat sinks. See “Installing a microprocessor and heat sink” on page 59. v DIMMs.
Page 76
BladeCenter LS22 Type 7901 and LS42 Type 7902: Problem Determination and Service Guide...
If POST detects a problem, more than one beep might sound, or an error message is displayed. See “Beep code descriptions” and “Dynamic System Analysis diagnostic programs and messages” on page 108 for more information. POST beep codes A beep code is a combination of short or long beeps or a series of short beeps that are separated by pauses.
Page 79
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Page 80
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Page 81
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Page 82
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Page 83
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Management Module User’s Guide or the Management Module Command-Line Interface Reference Guide. The most recent versions of all BladeCenter documentation are at http://www.ibm.com/bladecenter/. The BMC log is limited in size. When the log is full, new entries will not overwrite existing entries;...
Entries that are written to the BMC log during the early phase of POST show an incorrect date and time as the default time stamp; however, the date and time are corrected as POST continues. Each BMC log entry is displayed on its own page. To display all the data for an entry, use the Up Arrow (↑) and Down Arrow (↓) keys or the Page Up and Page Down keys.
Page 86
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Page 87
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Page 88
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Page 89
If the firmware is at latest level, the blade server device is not supported by the BladeCenter unit in which it is installed (see the IBM BladeCenter Management Module User’s Guide for more information). Firmware (BIOS) ROM corruption detected Update the BIOS code (see “Firmware updates”...
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Page 91
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Page 92
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Page 93
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Page 94
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Page 95
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Page 96
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Page 97
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Page 98
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Page 99
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
v When you run the diagnostic programs, a single problem might cause more than one error message. When this happens, correct the cause of the first error message. The other error messages usually will not occur the next time you run the diagnostic programs.
If you have just added new software or a new optional device and the blade server is not working, complete the following steps before you use the troubleshooting tables: 1. Remove the software or device that you just added. 2. Run the diagnostic tests to determine whether the blade server is running correctly.
Intermittent problems v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Memory problems v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Microprocessor problems v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
An IBM optional device that 1. Make sure that: was just installed does not v The device is designed for the blade server (see http://www.ibm.com/ work. servers/eserver/serverproven/compat/us/). v You followed the installation instructions that came with the device and the device is installed correctly.
Page 108
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Page 109
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Power problems v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Page 111
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Page 113
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Make sure that the blade server supports the operating system (see The operating system cannot be http://www.ibm.com/servers/eserver/serverproven/compat/us/nos/ installed; the option is not ematrix.shtml). If it does, either no logical drive is defined (SAS RAID servers), or available.
Software problems v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
After you remove the blade server, you can press and hold the light path diagnostics switch for a maximum of 25 seconds to light the LEDs and locate the failing component. The following components have this feature: v Hard disk drives v Light path diagnostics panel v Microprocessors v Memory modules (DIMMs)
Page 117
The following illustration shows the LEDs on the blade server system-board light path diagnostics panel. NMI error LED Microprocessor mismatch error LED SBRD System-board error LED TEMP Over temperature error LED Light path diagnostics LED Light path diagnostics switch The following illustration shows the locations of the error LEDs and light path diagnostics panel on the MPE unit.
The following illustration shows the LEDs on the MPE unit light path diagnostics panel. Microprocessor mismatch error LED SBRD MPE-board error LED LP 1 Light path diagnostics LED (for blade server) Light path diagnostics LED (for MPE) Light path diagnostics switch When you press the light path diagnostics switch, note which LEDs are lit on the blade server system board, MPE unit, and light path diagnostics panels.
Page 119
v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Chapter 3, “Parts listing, Types 7901 and 7902,” on page 19 to determine which components are CRUs and which components are FRUs.
Dynamic System Analysis diagnostic programs and messages IBM Dynamic System Analysis (DSA) Preboot is a system information collection and analysis tool that is used by IBM service and support personnel to aid in the diagnosis of system problems. BladeCenter LS22 Type 7901 and LS42 Type 7902: Problem Determination and Service Guide...
You can output all the collected information as a compressed XML file that you can send to IBM service and support, view the information locally through a generated text report file, or copy the generated HTML pages to removable media and view them in a Web browser.
2. Turn on all attached devices; then, turn on the blade server (see “Turning on the blade server” on page 5). 3. When the prompt Press F2 for Dynamic System Analysis (DSA) is displayed, press F2. If you have set both a power-on password and an administrator password, you must type the administrator password to run the diagnostic programs.
From the System Information page in the DSA graphical user interface, select DSA Error Log. You can send the DSA error log file to IBM service and support to aid in diagnosing the server problems or you can use the DSA CLI copy command to copy the log to an external USB device.
Test program error. 2. Make sure that the DSA code is at the latest level. For the latest level of DSA code, go to http://www.ibm.com/ support/docview.wss?uid=psg1SERV- DSA. 3. Run the test again. 4. Make sure that the system firmware is at the latest level.
Page 125
59). 8. If the failure remains, collect the data from the DSA event log and send it to IBM Service. For information about contacting and sending data to IBM Service, see http://www.ibm.com/ support/ docview.wss?uid=psg1SERVCALL. 089–803–000...
Page 126
59). 9. If the failure remains, collect the data from the DSA event log and send it to IBM Service. For information about contacting and sending data to IBM Service, see http://www.ibm.com/ support/ docview.wss?uid=psg1SERVCALL. BladeCenter LS22 Type 7901 and LS42 Type 7902: Problem Determination and Service Guide...
Page 127
28). 2. Run the test again. 3. Make sure that the DSA code is at the latest level. For the latest level of DSA code, go to http://www.ibm.com/ support/docview.wss?uid=psg1SERV- DSA. 4. Make sure that the BMC firmware is at the latest level.
Page 128
28). 2. Run the test again. 3. Make sure that the DSA code is at the latest level. For the latest level of DSA code, go to http://www.ibm.com/ support/docview.wss?uid=psg1SERV- DSA. 4. Make sure that the BMC firmware is at the latest level.
Page 129
28). 2. Run the test again. 3. Make sure that the DSA code is at the latest level. For the latest level of DSA code, go to http://www.ibm.com/ support/docview.wss?uid=psg1SERV- DSA. 4. Make sure that the BMC firmware is at the latest level.
Page 130
2. Run the test again. 3. Make sure that the DSA code is at the latest level. For the latest level of DSA code, go to http://www.ibm.com/ support/docview.wss?uid=psg1SERV- DSA. 4. Make sure that the BMC firmware is at the latest level. The installed firmware...
Page 131
28). 2. Run the test again. 3. Make sure that the DSA code is at the latest level. For the latest level of DSA code, go to http://www.ibm.com/ support/docview.wss?uid=psg1SERV- DSA. 4. Make sure that the BMC firmware is at the latest level.
Page 132
28). 2. Run the test again. 3. Make sure that the DSA code is at the latest level. For the latest level of DSA code, go to http://www.ibm.com/ support/docview.wss?uid=psg1SERV- DSA. 4. Make sure that the BMC firmware is at the latest level.
Page 133
BladeCenter unit” on page 28). 2. Make sure that the DSA code is at the latest level. For the latest level of DSA code, go to http://www.ibm.com/ support/docview.wss?uid=psg1SERV- DSA. 3. Make sure that the BMC firmware is at the latest level.
Page 134
28). 2. Run the test again. 3. Make sure that the DSA code is at the latest level. For the latest level of DSA code, go to http://www.ibm.com/ support/docview.wss?uid=psg1SERV- DSA. 4. Make sure that the BMC firmware is at the latest level.
Page 135
2. Run the test again. bus. 3. Make sure that the DSA code is at the latest level. For the latest level of DSA code, go to http://www.ibm.com/ support/docview.wss?uid=psg1SERV- DSA. 4. Make sure that the BMC firmware is at the latest level. The installed firmware...
Page 136
28). 2. Run the test again. 3. Make sure that the DSA code is at the latest level. For the latest level of DSA code, go to http://www.ibm.com/ support/docview.wss?uid=psg1SERV- DSA. 4. Make sure that the BMC firmware is at the latest level.
Page 137
SMBIOS 2. Make sure that the DSA code is at the structure data latest level. For the latest level of DSA code, go to http://www.ibm.com/ support/docview.wss?uid=psg1SERV- DSA. 3. Make sure that the system BIOS code is at the latest level. The installed...
Page 138
(see “Using the Setup Utility program” on page 11). 6. Make sure that the DSA code is at the latest level. For the latest level of DSA code, go to http://www.ibm.com/ support/docview.wss?uid=psg1SERV- DSA. 7. Run the test again. BladeCenter LS22 Type 7901 and LS42 Type 7902: Problem Determination and Service Guide...
Page 139
3. Make sure that the DSA code is at the reporting latest level. For the latest level of DSA code, go to http://www.ibm.com/ support/docview.wss?uid=psg1SERV- DSA. 4. Make sure that the system BIOS code is at the latest level. The installed...
Page 140
3. Make sure that the DSA code is at the scrubbing. latest level. For the latest level of DSA code, go to http://www.ibm.com/ support/docview.wss?uid=psg1SERV- DSA. 4. Make sure that the system BIOS code is at the latest level. The installed...
Page 141
Quick Memory 3. Make sure that the DSA code is at the Menu Item latest level. For the latest level of DSA selection code, go to http://www.ibm.com/ problem. support/docview.wss?uid=psg1SERV- DSA. 4. Make sure that the system BIOS code is at the latest level. The installed...
Page 142
3. Make sure that the DSA code is at the in the restricted latest level. For the latest level of DSA area of code, go to http://www.ibm.com/ memory. support/docview.wss?uid=psg1SERV- DSA. 4. Make sure that the system BIOS code is at the latest level.
Page 143
DSA event log in the Firmware/VPD section for this component. For the latest level of firmware, go to http://www.ibm.com/support/ docview.wss?uid=psg1 MIGR-4JTS2T and select your system to display a matrix of available firmware. 5. Run the test again.
Page 144
MASK event log in the Firmware/VPD section registers. for this component. For the latest level of firmware, go to http://www.ibm.com/support/ docview.wss?uid=psg1 MIGR-4JTS2T and select your system to display a matrix of available firmware. 3. Run the test again. 4. See “Solving undetermined problems”...
Page 145
DSA event log in the Firmware/VPD section for this component. For the latest level of firmware, go to http://www.ibm.com/support/ docview.wss?uid=psg1 MIGR-4JTS2T and select your system to display a matrix of available firmware. 5. Run the test again.
Page 146
MTRR 3. Make sure that the DSA code is at the functions, latest level. For the latest level of DSA cannot uncache code, go to http://www.ibm.com/ available support/docview.wss?uid=psg1SERV- memory space. DSA. 4. Make sure that the system BIOS code is at the latest level.
Page 147
Memory test 1. Make sure that the DSA code is at the Test failed: latest level. For the latest level of DSA single-bit error code, go to http://www.ibm.com/ in DIMMs: support/docview.wss?uid=psg1SERV- PnDimmx or DSA. multiple-bit 2. Make sure that the system BIOS code error in is at the latest level.
Page 148
Stress Test program error. 2. Make sure that the DSA code is at the latest level. For the latest level of DSA code, go to http://www.ibm.com/ support/docview.wss?uid=psg1SERV- DSA. 3. Make sure that the system BIOS code is at the latest level. The installed...
Page 149
3. Make sure that the DSA code is at the run the test. At latest level. For the latest level of DSA least 1 GB of code, go to http://www.ibm.com/ memory is support/docview.wss?uid=psg1SERV- required to run DSA. the test.
Page 150
BladeCenter unit). 10. If the failure remains, collect the data from the DSA event log and send it to IBM Service. For information about contacting and sending data to IBM Service, see http://www.ibm.com/ support/ docview.wss?uid=psg1SERVCALL. BladeCenter LS22 Type 7901 and LS42 Type 7902: Problem Determination and Service Guide...
Page 151
11. Replace the CD or DVD drive. 12. If the failure remains, collect the data from the DSA event log and send it to IBM Service. For information about contacting and sending data to IBM Service, see http://www.ibm.com/ support/ docview.wss?uid=psg1SERVCALL.
Page 152
7. Replace the CD or DVD drive. 8. If the failure remains, collect the data from the DSA event log and send it to IBM Service. For information about contacting and sending data to IBM Service, see http://www.ibm.com/ support/ docview.wss?uid=psg1SERVCALL.
Page 153
4. If the failure remains, collect the data from the DSA event log and send it to IBM Service. For information about contacting and sending data to IBM Service, see http://www.ibm.com/ support/ docview.wss?uid=psg1SERVCALL. Chapter 5. Diagnostics...
Page 154
4. If the failure remains, collect the data from the DSA event log and send it to IBM Service. For information about contacting and sending data to IBM Service, see http://www.ibm.com/ support/ docview.wss?uid=psg1SERVCALL. BladeCenter LS22 Type 7901 and LS42 Type 7902: Problem Determination and Service Guide...
Page 155
4. If the failure remains, collect the data from the DSA event log and send it to IBM Service. For information about contacting and sending data to IBM Service, see http://www.ibm.com/ support/ docview.wss?uid=psg1SERVCALL. Chapter 5. Diagnostics...
Page 156
5. If the failure remains, collect the data from the DSA event log and send it to IBM Service. For information about contacting and sending data to IBM Service, see http://www.ibm.com/ support/ docview.wss?uid=psg1SERVCALL. BladeCenter LS22 Type 7901 and LS42 Type 7902: Problem Determination and Service Guide...
Page 157
5. If the failure remains, collect the data from the DSA event log and send it to IBM Service. For information about contacting and sending data to IBM Service, see http://www.ibm.com/ support/ docview.wss?uid=psg1SERVCALL. 405-905-xxx...
Page 158
5. If the failure remains, collect the data from the DSA event log and send it to IBM Service. For information about contacting and sending data to IBM Service, see http://www.ibm.com/ support/ docview.wss?uid=psg1SERVCALL. BladeCenter LS22 Type 7901 and LS42 Type 7902: Problem Determination and Service Guide...
Page 159
4. If the failure remains, collect the data from the DSA event log and send it to IBM Service. For information about contacting and sending data to IBM Service, see http://www.ibm.com/ support/ docview.wss?uid=psg1SERVCALL. Chapter 5. Diagnostics...
Page 160
4. If the failure remains, collect the data from the DSA event log and send it to IBM Service. For information about contacting and sending data to IBM Service, see http://www.ibm.com/ support/ docview.wss?uid=psg1SERVCALL. BladeCenter LS22 Type 7901 and LS42 Type 7902: Problem Determination and Service Guide...
DSA event log in the Firmware/VPD section for this component. For the latest level of firmware, go to http://www.ibm.com/ support/docview.wss?uid=psg1 MIGR-4JTS2T and select your system to display a matrix of available firmware.
Page 162
Table 5. Additional diagnostic error codes (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Table 1 on page 21 to determine which components are customer replaceable units (CRU) and which components are field replaceable units (FRU).
Page 163
Table 5. Additional diagnostic error codes (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Table 1 on page 21 to determine which components are customer replaceable units (CRU) and which components are field replaceable units (FRU).
Page 164
Table 5. Additional diagnostic error codes (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Table 1 on page 21 to determine which components are customer replaceable units (CRU) and which components are field replaceable units (FRU).
Page 165
Table 5. Additional diagnostic error codes (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Table 1 on page 21 to determine which components are customer replaceable units (CRU) and which components are field replaceable units (FRU).
Page 166
Table 5. Additional diagnostic error codes (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Table 1 on page 21 to determine which components are customer replaceable units (CRU) and which components are field replaceable units (FRU).
Page 167
Table 5. Additional diagnostic error codes (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Table 1 on page 21 to determine which components are customer replaceable units (CRU) and which components are field replaceable units (FRU).
Page 168
Table 5. Additional diagnostic error codes (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Table 1 on page 21 to determine which components are customer replaceable units (CRU) and which components are field replaceable units (FRU).
Page 169
Table 5. Additional diagnostic error codes (continued) v Follow the suggested actions in the order in which they are listed in the Action column until the problem is solved. v See Table 1 on page 21 to determine which components are customer replaceable units (CRU) and which components are field replaceable units (FRU).
To recover the BIOS code and restore the blade server operation to the primary page, complete the following steps. Note: Changes are made periodically to the IBM Web site. The actual procedure might vary slightly from what is described in this document.
Page 171
If that procedure fails, the blade server might not restart correctly or might not display video. To manually restore the BIOS code, complete the following steps: 1. Read the safety information that begins on page “Safety statements” on page viii and “Handling static-sensitive devices” on page 26. 2.
Service processor (BMC) error codes The baseboard management controller (BMC) log contains up to 512 of the most recent service processor errors, in IPMI format. These messages are a combination of plain text and error-code numbers. You can view the BMC log from the Setup Utility menu by selecting Advanced Setup →...
v The KVM select button LED on the front of the blade server is lit, indicating that the blade server is connected to the shared keyboard and mouse (see “Blade server controls and LEDs” on page 3). v The keyboard or mouse cable is securely connected to the active BladeCenter management-module.
Page 174
d. Continue with step 7. 4. For problems that affect only the diskette drive, complete the following steps: a. Continue with step 6. 5. For problems that affect only the CD or DVD drive, complete the following steps: a. Make sure that the software program is working properly. b.
Network connection problems To check for network connection problems, complete the following steps until the problem is solved: 1. Make sure that the following conditions are met: v The network cables are securely connected to the I/O-module. v Power configuration of the BladeCenter unit supports the I/O module configuration.
Non-IBM monitor cables might cause unpredictable problems. v The monitor works with another blade server. Some IBM monitors have their own self-tests. If you suspect a problem with the monitor, see the information that comes with the monitor for instructions for adjusting and testing the monitor.
Page 177
Damaged data in CMOS memory or damaged BIOS code can cause undetermined problems. To reset the CMOS data, remove and replace the battery to override the power-on password and clear the CMOS memory; see “Removing the battery” on page 48 and “Installing the battery” on page 48. If you suspect that the BIOS code is damaged, see “Recovering from a BIOS update failure”...
Because of the variety of hardware and software combinations that you can encounter, use the following information to assist you in problem determination. If possible, have this information available when you request assistance from IBM. v Machine type and model...
If you need help, service, or technical assistance or just want more information about IBM products, you will find a wide variety of sources available from IBM to assist you. This section contains information about where to go for additional information about IBM and IBM products, what to do if you experience a problem with your system, and whom to call for service, if it is necessary.
Getting help and information from the World Wide Web On the World Wide Web, the IBM Web site has up-to-date information about IBM ® systems, optional devices, services, and support. The address for IBM System x ® and xSeries information is http://www.ibm.com/systems/x/. The address for ®...
IBM makes no representations or warranties with respect to non-IBM products. Support (if any) for the non-IBM products is provided by the third party, not IBM. BladeCenter LS22 Type 7901 and LS42 Type 7902: Problem Determination and Service Guide...
(TI) que reciclen responsablemente sus equipos cuando éstos ya no les sean útiles. IBM dispone de una serie de programas y servicios de devolución de productos en varios países, a fin de ayudar a los propietarios de equipos a reciclar sus productos de TI.
In the United States, IBM has established a return process for reuse, recycling, or proper disposal of used IBM sealed lead acid, nickel cadmium, nickel metal hydride, and battery packs from IBM equipment. For information on proper disposal of these batteries, contact IBM at 1-800-426-4333.
For proper collection and treatment, contact your local IBM representative. This notice is provided in accordance with Royal Decree 106/2008 of Spain: The retail price of batteries, accumulators, and power cells includes the cost of the environmental management of their waste.
Properly shielded and grounded cables and connectors must be used in order to meet FCC emission limits. IBM is not responsible for any radio or television interference caused by using other than recommended cables and connectors or by unauthorized changes or modifications to this equipment.
Attention: This is a Class A product. In a domestic environment this product may cause radio interference in which case the user may be required to take adequate measures. European Community contact: IBM Technical Regulations Pascalstr. 100, Stuttgart, Germany 70569 Telephone: 0049 (0)711 785 1176 Fax: 0049 (0)711 785 1283 E-mail: tjahn@de.ibm.com...
Page 190
installing memory stress test 136 problems (continued) battery 48 messages ServerGuide 102 bezel assembly 33 diagnostic 108 service processor 102 blade server 28 microprocessor software 103 blade server cover 30 heat sink 59 undetermined 164, 166 cKVM card 51 installing 59 USB port 103 combination-form-factor expansion problems 93...
Page 191
start options 12 starting the blade server 5 statements and notices 2 static electricity 26 static-sensitive devices, handling 26 stopping the blade server 6 support, web site 167 switches, system board 8 system board switches 8 system board layouts 6 system board switches description 8 system board, blade server...
Page 192
BladeCenter LS22 Type 7901 and LS42 Type 7902: Problem Determination and Service Guide...
Page 194
Part Number: 49Y0107 Printed in USA (1P) P/N: 49Y0107...