Summary of Contents for IBM TotalStorage NAS Gateway 500
Page 1
IBM TotalStorage NAS Gateway 500 Service Guide GY27-0418-00...
Page 3
IBM TotalStorage NAS Gateway 500 Service Guide GY27-0418-00...
Page 4
You can also submit comments by selecting Feedback at www.ibm.com/storage/support. When you send information to IBM, you grant IBM a nonexclusive right to use or distribute the information in any way it believes appropriate without incurring any obligation to you.
Page 10
Getting customer support and service ....390 Getting help online: www.ibm.com/storage/support ..390 Getting help by telephone ....391 Appendix B.
About this manual ® ® This manual describes the service procedures for the IBM TotalStorage NAS Gateway 500, hereafter referred to as the NAS Gateway 500. Who should read this manual This manual is written for trained service personnel. It can also be used by NAS Gateway 500 customers with root access to replace customer replaceable units (CRUs) (for example, adapters).
Page 12
Table 1. IBM TotalStorage NAS Gateway 500 information library as it supports common user tasks Title User tasks Planning Hardware Software Configuration Operation and Diagnosis, installation installation administration problem determination, and service IBM TotalStorage NAS Gateway 500 Planning Guide, GA27-4335...
This publication identifies the Web site location (www.ibm.com/servers/storage/support/) for the NAS Gateway 500 Release Notes v IBM TotalStorage NAS Gateway 500 Statement of Limited Warranty, GX27-4024 This publication lists the warranty and translations of the warranty for the IBM TotalStorage NAS Gateway 500. Softcopy publications Additional technical publications are provided in PDF format on the NAS Gateway 500 documentation CD-ROM and at www.ibm.com/servers/storage/support/.
RS/6000 eServer pSeries Diagnostic Information for Multiple Bus Systems, SA38-0509 This manual contains diagnostic information, service request numbers (SRNs), and failing function codes (FFCs). It is intended to supplement the service information found in the IBM TotalStorage NAS Gateway 500 Service Guide.
Accessibility The softcopy version of this manual and the other publications in the NAS Gateway 500 library are accessibility-enabled for the IBM Home Page Reader. Web sites This section lists the Web sites where additional technical information is found. Be sure to visit the support page that is specific to your hardware.
Safety notices A danger notice indicates the presence of a hazard that has the potential of causing death or serious personal injury. Danger notices appear on the following pages: v xvi v 58 v 59 v 307 v 351 A caution notice indicates the presence of a hazard that has the potential of causing moderate or minor personal injury.
CAUTION: All IBM laser modules are designed so that there is never any human access to laser radiation above a class 1 level during normal operation, user maintenance, or prescribed service conditions. Data processing environments can contain equipment transmitting on system links with laser modules that operate at greater than class 1 power levels.
Chapter 1. Reference information This chapter provides an overview of the system, including a logical description and a physical overview. The following additional details pertaining to the system are also provided: v System features v Powering the system on and off v Console strategy v Power-on self-test v POST indicators...
System features This section describes the features of the NAS Gateway 500. Bus architecture The NAS Gateway 500 has six PCI-X slots available. The slots are 64-bit capable at 133 MHz, 3.3 volts. All slots are capable of using the full-length PCI-X adapters. The PCI-X slots are numbered on the back of the chassis, from left to right.
Media drives Typical system configuration consists of two media bays. v Media bay 1 has an IDE CD-ROM. v Media bay 2 has a diskette drive. Hot-pluggable disk drives Four hot-plug disk-drive bays with a maximum of two bays used for the operating system and operating system mirroring feature, using only 36.4 GB Ultra3 10K RPM 1-inch drives.
If you decide to disconnect clustering, you must reset the service processor using the pinhole reset switch before running another application. The service processor pinhole reset switch is located on the operator panel. 2. If you are using both serial port S2 (for example, to support the call home function) and serial port S3 (for clustering) and you require an uninterruptible power supply as well, you must order the uninterruptible power supply with an Ethernet adapter and connect it to the Service/Management LAN.
Front view of the NAS Gateway 500 (with bezel off) 1 Operator panel 2 Front serial connector 3 IDE CD-ROM drive 4 Diskette drive 5 Hot-plug disk drive for operating system 6 Hot-plug disk drive for optional OS mirroring feature Back view of the NAS Gateway 500 1 PCI-X slots (64-bit) 133 MHz 2 Power supply 2 cable receptacle...
Powering the system on and off For procedures to power off and power on the system for various system configurations and operating systems, see “Stopping the system” on page 310 and “Starting the system” on page 311. Console strategy The NAS Gateway 500 is a headless system (that is, without a display interface permanently attached or part of the normal NAS Gateway 500 configuration).
POST indicators POST indicators indicate tests that are being performed as the system is preparing to boot the operating system. The POST indicators are words displayed on the system console. Each time that the system starts a different step in the POST, a POST indicator word appears on the console. Each word is an indicator of the tests that are being performed.
System unit locations Power supply locations 1 Power supply #1 3 AC good LED (green) 2 Power supply #2 4 DC good LED (green) 5 Fault LED (amber) NAS Gateway 500 Service Guide...
Fan locations The following illustration identifies the system cooling fans. The fans have an amber LED located on top of the housing. A lit amber LED indicates that the fan is not operating correctly. The power supplies have two groups of three LEDs located on the front and top. The amber LED on the power supply indicates a problem with the power supply.
CEC backplane locations The following illustration of the CEC backplane identifies the primary connectors used in your system. 1 CEC backplane 17 Serial port #3 connector 2 Power supply #1 connectors 18 Mouse (not supported) 3 Power supply #2 connectors 19 Keyboard (not supported) 4 SPCN1 connector (not supported) 20 Rack-indicator connector (not supported)
PCI riser card locations The slots are 64-bit capable at 133 MHz, 3.3 volts. The PCI-X slots are numbered on the back of the chassis, from left to right. The PCI-X riser card has two integrated SCSI controllers. One SCSI controller (PCI, 64-bit) is a SYM53C1010 that operates at 66 MHz and has dual low voltage differential (LVD) ports.
DIMM locations The dual in-line memory modules (DIMMs) used in the NAS Gateway 500 are located on the processor card. Your system contains either one or two processor cards, each card with eight DIMM slots. Associated with each DIMM slot is an LED, which is located on the edge of the processor card and is visible from the top of the processor book.
Page 33
The following processor card illustration shows the DIMM slot locations, as well as the LED location associated with each DIMM slot. Table 6. Fault LEDs for processor book and memory module slots (1-8) Number Description Location code M1 Memory slot 1 (J2A) U0.1-P1-C1-M1 M2 Memory slot 2 (J2B) U0.1-P1-C1-M2...
Reading the operator panel and system LEDs The following illustration shows the component location on the operator panel. 3 4 5 1 Power-on button 6 Operator-panel display 2 Power LED 7 (FS1) front serial connector (RJ-48 connector) 3 Attention LED 8 Service processor reset switch (pinhole) 4 SCSI port activity 9 System reset button...
SCSI IDs and bay locations Note: To show the disk drive and disk drive bay locations, the NAS Gateway 500 is shown with its bezel removed. Index Bay location Drive name SCSI ID Operator panel IDE CD-ROM IDE (Non-SCSI) Diskette drive SCSI ID 0 D10 (base OS) Disk drive...
Location codes This system unit uses physical location codes in conjunction with AIX location codes to provide mapping of the failing field replaceable units. The location codes are produced by the system unit’s firmware and AIX. Physical location codes Physical location codes provide a mapping of logical functions in a platform (or expansion sites for logical functions, such as connectors or ports) to their specific locations within the physical structure of the platform.
Page 38
For PCI adapters where x is equal to or greater than 1. The x and y are characters in the range of 0-9, A-H, J-N, P-Z (O, I, and lowercase are omitted) and are equal to the parent bus’s ‘ibm, aix-location’ open firmware property.
Page 39
v Processor card C2 attached to CEC backplane P1: U0.1-P1-C2 v DIMM in slot 2 of processor card C1 attached to CEC backplane P1: U0.1-P1-C1-M2 Examples of AIX location codes displayed are as follows: v Integrated PCI-X adapter: 10-80 Ethernet 10-60 Integrated SCSI port 1 (internal) 10-88...
Page 40
Any identify LED in the system can also be flashed by using the ″Identify and Attention Indicators″ task through diagnostics. The procedure to operate the ″Identify and Attention Indicators″ task in diagnostics is outlined in the RS/6000 Eserver pSeries Diagnostic Information for Multiple Bus Systems. NAS Gateway 500 Service Guide...
Mapping AIX and physical location codes Use the following illustration to aid you in the mapping of a location code to its connector on the system backplane. Physical Physical Location Location Location Location Codes Codes Codes Codes U0.1-V1 U0.1-F1 U0.1-L1 U0.1-F2 U0.1-V2 U0.1-P1-C1...
Page 42
Use the following illustration to aid you in the mapping of a location code to its DIMM slot on the processor cards. Physical Location Codes U0.1-P1-C1-M1 U0.1-P1-C1-M2 Processor Card 1 U0.1-P1-C1-M3 U0.1-P1-C1-M4 U0.1-P1-C1-M5 U0.1-P1-C1-M6 U0.1-P1-C1-M7 U0.1-P1-C1-M8 Physical Location Codes U0.1-P1-C2-M1 U0.1-P1-C2-M2 Processor Card 2 U0.1-P1-C2-M3...
Use the following illustration to aid you in the mapping of a location code to its slot on the PCI riser card. PCI riser card location codes Physical Location Location Codes Codes 11-08 U0.1-P2/I6 1D-08 U0.1-P2/I5 1Z-08 U0.1-P2/I4 1n-08 U0.1-P2/I3 1H-08 U0.1-P2/I2 1V-08...
AIX and physical location code table Note: The NAS Gateway 500 does not support all listed components. Component name Physical location code AIX location code Central electronics complex (CEC) Rack frame CEC drawer U0.1 CEC backplane U0.1-P1 PCI riser card U0.1-P2 Disk drive backplane U0.1-P3...
Page 46
Component name Physical location code AIX location code Hot-swap disk drive bay 1 U0.1-P2/Z1-A8 1S-08-00-8,0 Hot-swap disk drive bay 2 U0.1-P2/Z1-A9 1S-08-00-9,0 Hot-swap disk drive bay 3 U0.1-P2/Z1-AA 1S-08-00-10,0 Hot-swap disk drive bay 4 U0.1-P2/Z1-AB 1S-08-00-11,0 Fans Fan 1 (processor fan) U0.1-F1 Fan 2 (processor fan) U0.1-F2...
System cables The following diagrams show the system cable connections. The following diagram illustrates the routing of the internal power cables. Internal power cable routing diagram Operator panel IDE CD-ROM SCSI media device (not supported) CEC backplane to operator panel signal/power cable Two-drop power cable from CEC backplane to IDE CD-ROM and other SCSI media devices (second connector unused) CEC backplane to disk drive backplane power cable...
Page 48
Internal signal cable routing diagram The following diagram illustrates the routing of the signal cables. Operator panel SCSI media device (50-pin, not supported) SCSI media device (68-pin, not supported) SCSI media device (68-pin, not supported) CEC backplane to operator panel signal cable SCSI media (68-pin) to 50-pin converter cable (not supported) SCSI media device signal cable to integrated single-ended SCSI connector on the PCI riser card (not supported)
Specifications This section contains system specifications for the NAS Gateway 500. Note: Refer to the 7014 Series Model T00 and T42 Rack Installation and Service Guide, for information on cabling power from the rack to the I/O subsystem. Table 7. Specifications for the NAS Gateway 500 Dimensions Height 178 mm (7.0 in., 4 EIA units)
External AC power cables To avoid electrical shock, power cables with grounded attachment plugs are provided. Use only properly grounded outlets. Power cables used in the United States and Canada are listed by Underwriter’s Laboratories (UL) and certified by the Canadian Standards Association (CSA). These power cords consist of the following: v Electrical cables, Type SVT or SJT.
Service inspection guide Service personnel should perform an inspection on the system when: v The system is inspected for a maintenance agreement. v Service is requested and service has not recently been performed. v An alterations and attachments review is performed. v Changes have been made to the equipment that might affect the safe operation of the equipment.
Chapter 2. Problem determination overview The system uses an integrated set of software diagnostic procedures to help isolate failing components and system maintenance. This book, along with the RS/6000 Eserver pSeries Diagnostic Information for Multiple Bus Systems, which is included on the NAS Gateway 500 documentation CD-ROM, is the basis of the diagnostic procedures for the system.
Checkpoints, error codes, and SRNs The system uses various types of checkpoints, error codes, and SRNs, which are referred to throughout this book (primarily in Chapter 4, “Checkpoints”, on page 93, Chapter 5, “Error code to FRU index”, on page 125, Chapter 6, “Using the online and standalone diagnostics”, on page 239, and Chapter 12, “Parts information”, on page 373).
When certain error events occur, notifications can be sent to monitoring facilities (for example, IBM Service). The NAS Gateway 500 supports two methods for doing this. The first method is the Electronic Service Agent (ESA), which is a pre-loaded application on the NAS Gateway 500.
Electronic Service Agent for the NAS Gateway 500 ESA allows the NAS Gateway 500 to report events to IBM service that require service involvement, such as system failures or when problem-definable thresholds have been reached. If a call is placed, it is analyzed and verified to ensure that this call is covered under the customer’s warranty and maintenance...
Note: This will not reset the system attention LED. 3. If the system error log indicates service support is required, or there are entries in the service processor error log, see “Entry MAP” on page 39 with the information noted from the error log. Resetting the system attention LED As a user with root authority, type diag on the command line, and do the following: 1.
Chapter 3. Maintenance Analysis Procedures (MAPs) This chapter contains Maintenance Analysis Procedures (MAPs) for the NAS Gateway 500. Entry MAP When possible, run online diagnostics in service mode. Online diagnostics perform additional functions compared to standalone diagnostics. This ensures that the error state of the system that was captured in NVRAM is available for your use in fixing the problem.
Quick Entry MAP The following is a Quick Entry MAP table of contents. Table 8. Quick Entry MAP Problem description Page Service actions The system attention LED on the operator panel is on. OK does not appear in the operator panel before pressing the power-on button. Other symptoms appear in the operator panel or LEDs before the power-on button is pressed.
Page 61
Symptom Action Service actions You have parts to exchange or a corrective action to 1. Go to Chapter 11, “Removal and replacement perform. procedures”, on page 307. 2. Go to “MAP 0410: Repair Checkout” on page 45. You need to verify that a part exchange or corrective Go to “MAP 0410: Repair Checkout”...
Page 62
Symptom Action The system stops with a 3-digit error code. Add 101- to the left of the three digits to create an SRN. Go to the RS/6000 Eserver pSeries Diagnostic Information for Multiple Bus Systems. If there is a location code displayed under the 3-digit error code, then look at the location to see if it matches the failing component that the SRN pointed to.
Page 63
Symptom Action The system stops and the message STARTING Go to Chapter 4, “Checkpoints”, on page 93. SOFTWARE PLEASE WAIT... is displayed on the firmware console. Symptom: The system does not respond to the password being entered, or the system login prompt is displayed when booting in service mode.
Page 64
Symptom Action Symptom: The System Management Services menu is displayed. Action: If you are loading diagnostics from the CD-ROM, you might not have pressed the correct key when you were trying to indicate a service mode IPL of the diagnostic programs. If this is the case, start again at the beginning of this step.
MAP 0410: Repair Checkout Purpose of this MAP This MAP is used to check out the system after a repair is completed.. Note: Use Standalone Diagnostics for repair checkout only when no other diagnostics are available on the system. Standalone Diagnostics do not log repair actions. Step 0410-1 Did you use a hot-swap operation to change the FRU? Go to “Step 0410-2”.
Page 66
Step 0410-6 Exit service mode diagnostics, and reboot the system. Wait for the login prompt or until the activity on the operator panel or display stops. Did the login prompt display? Go to “MAP 1020: Problem Determination” on page 49. Go to “Step 0410-7”.
Page 67
Do the following: 1. Select the resource that has been replaced from the Resource Repair Action menu. If the repair action was reseating a cable or adapter, select the resource associated with that repair action. If the resource associated with your action is not displayed on the Resource List, select sysplanar0. 2.
Page 68
Step 0410-13 This completes the repair, return the system to the user. NAS Gateway 500 Service Guide...
MAP 1020: Problem Determination Purpose of this MAP Use this MAP to get an error code if you were not provided one by the customer or you are unable to load diagnostics. If you can load the diagnostics, go to MAP 0020 in the RS/6000 Eserver pSeries Diagnostic Information for Multiple Bus Systems.
Page 70
2. Select slow boot mode (select disable fast boot) on the system power control menu from the service processor main menu. 3. Power on the system. Refer to “Starting the system” on page 311. 4. Insert the diagnostic CD-ROM into the CD-ROM drive. 5.
Page 71
Symptom Action Symptom: The System Management Services menu is displayed. Action: If you are loading diagnostics from the CD-ROM, you might not have pressed the correct key when you were trying to indicate a service mode IPL of the diagnostic programs. If this is the case, start again at the beginning of this step.
MAP 1240: Memory Problem Resolution Note: The firmware or error code that sent you here could be one of the following: v Service Processor Checkpoints 910B, 910C, 910E, or 95xx v Firmware Checkpoints E134 or E701 v System Power Control Network Error Code 1011840C v Firmware Error Codes 25Cyy001, 25Cyy002, 25B00001, or 25B00004 v Service Processor Error Codes 406x001x, 406x010x, 406x013x, 406x0CA6, or 450x30xx v Common Firmware Error Codes B1xx466x...
Page 73
Attention: Go to the service processor menus. In the System Information Menu, select Memory Configuration/ Deconfiguration. Verify on the first panel that memory repeat gard is disabled. If it is enabled, disable it for the duration of this MAP. When exiting this MAP to go to MAP 0410 in the RS/6000 Eserver pSeries Diagnostic Information for Multiple Bus Systems;...
Page 74
Are any DIMMs deconfigured or does the error code recorded in “Step 1240-1” on page 53 still appear? Go to “Step 1240-4”. Go to “Step 1240-6”. Step 1240-4 1. Turn off the power. Remove the power cords. 2. Install processor card 1 in slot 2. 3.
Page 75
4. Turn on the power. Attempt to boot to the SMS menus. (The system might stop with an 8-digit error code before it reaches the SMS menus.) 5. Power off the system using the white power button. After OK is displayed in the operator panel, look at the memory configuration/deconfiguration menu.
Page 76
Step 1240-9 The failing DIMMs is in the quad of DIMMs that was removed in “Step 1240-7” on page 55. Do the following: 1. Turn off the power. Remove the power cords. 2. Remove the quad of DIMMs that is on the processor card and reinstall the quad of DIMMs that had been removed.
Page 77
Go to “Step 1240-12”. Step 1240-12 Have all the DIMMs in the quad located on the processor card been replaced? Go to “Step 1240-11” on page 56 to test the next DIMM of the quad on the processor card. Go to “Step 1240-13”. Step 1240-13 Did the processor originally have only one quad of DIMMs at the start of this map? Call service support.
MAP 1520: Power Notes: 1. This is not a start-of-call MAP. Use this Power MAP only if you have been directed here from another MAP, an SRN in the RS/6000 Eserver pSeries Diagnostic Information for Multiple Bus Systems, or an error code.
DANGER To prevent electrical shock hazard, disconnect all power cables from the electrical outlet before relocating the system. (D07) FRU identify LEDs This system is configured with an arrangement of LEDs that help identify various components of the system. These include, but are not limited to the following: v Various FRU identify LEDs –...
MAP 1521: System with Power Failure and No Error Codes Step 1521-1 For NAS Gateway 500, do the following: 1. Place the system into the service position as described in “Placing the NAS Gateway 500 into the service position” on page 312. 2.
Page 81
Step 1521-4 Because you are servicing a drawer unit that is mounted in a rack, do the following: 1. Unplug the the system unit power cables from the power distribution panel. 2. Plug in the main power cables into the power distribution panel. 3.
Page 82
Go to “MAP 0410: Repair Checkout” on page 45. Step 1521-7 1. Unplug the power cables from the system unit. 2. Exchange the operator panel assembly. 3. Plug the power cables into the system unit and wait for the OK prompt located on the operator panel. 4.
Page 83
Does the power LED on the operator panel come on and stay on? Reinstall the original FRU. Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance.
Page 84
1. Power supply 2. CEC backplane 3. PCI riser card Restart the system and perform error log analysis. Do you get an SRN indicating that a voltage sensor is out of range? The last FRU exchanged is defective. Go to “MAP 0410: Repair Checkout” on page 45. Reinstall the original FRU.
Page 85
If the symptom changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the “Quick Entry MAP” on page 40 and follow the instructions for the new symptom. Step 1521-15 The problem is within the system unit hardware or with the line voltage or wiring. Shut down the system and remove the power cables from the system unit.
Map 1523: The SPCN Error Codes Table Directed You Here and the Error Code is 1011 1B0x A system power supply is reporting a load fault. Note: Read the danger and caution notices under “Safety notices” on page xv before continuing with this procedure.
Page 87
This ends the procedure. Go to “MAP 0410: Repair Checkout” on page 45. Step 1523-4 1. Turn off the power. 2. Reinstall system power supply 2 that was removed in “Step 1523-2” on page 66. 3. Reconnect the AC power cable to the system power supply 2. 4.
Page 88
d. Media bay devices 4. Reconnect the AC power cables to the system. 5. Turn on the power. Is the error code 1011 1B0x displayed? Go to “Step 1523-8”. The last assembly that you installed is defective. Replace the last card that you installed. This ends the procedure.
MAP 1540: Problem Isolation Procedures These procedures are used to locate problems in the processor subsystem, I/O subsystems, or rack. If a problem is detected, these procedures help you isolate the problem to a failing unit. Find the symptom in the following table;...
Page 90
This system is configured with an arrangement of LEDs that help identify various components of the system. These include, but are not limited to, the following LEDs: Problem isolation procedures Symptom/reference code/checkpoint Action You were sent here by the RS/6000 Eserver pSeries Go to “MAP 1542: I/O Problem Isolation”...
MAP 1541: JTAG Problem Isolation Step 1541-1 Record the error code and location codes that sent you to this MAP. Step 1541-2 Turn off the power. Step 1541-3 Replace the following, if present, one at a time and in the order listed: v First location code item recorded, if any, “Step 1541-1”.
MAP 1542: I/O Problem Isolation This MAP is used to locate defective FRUs not found by normal diagnostics. For this procedure, diagnostics are run on a minimally configured system. If a failure is detected on the minimally configured system, the remaining FRUs are exchanged one at a time until the failing FRU is identified. If a failure is not detected, FRUs are added back until the failure occurs.
Page 93
Step 1542-1 1. Ensure that the diagnostics and the operating system are shut down. 2. Turn off the power. 3. Select slow boot mode (select Disable Fast Boot) on the System Power Control menu from the service processor main menu. 4.
Page 94
Go to “Entry MAP” on page 39. Use the new symptom. 6. Record any devices missing from the list of all adapters and devices. Continue with this MAP. When you have fixed the problem, use this record to verify that all devices appear when you run System Verification.
Page 95
Step 1542-9 1. Boot standalone diagnostics from CD-ROM. 2. If the Please Define the System Console prompt is displayed, follow directions to select the system console. 3. Use the Display Configuration and Resource List to list all attached devices and adapters (refer to the RS/6000 Eserver pSeries Diagnostic Information for Multiple Bus Systems, if necessary).
Page 96
Is the Please Define the System Console prompt displayed and are all attached devices and adapters listed? Go to “Step 1542-12”. Reinstall the next adapter or device and return to the beginning of this step. Continue repeating this process until an adapter or device causes the Please Define the System Console prompt to not be displayed or all attached devices and adapters to not be listed.
MAP 1548: Processor Subsystem Problem Isolation Notes: 1. Be sure to unplug the power cords before removing or installing processor cards, the PCI riser card, or the CEC backplane to avoid damage to it. 2. This MAP assumes that a diagnostic CD-ROM is available. 3.
Page 98
Purpose of this MAP This MAP is used to locate defective FRUs not found by normal diagnostics. For this procedure, diagnostics are run on a minimally configured system. If a failure is detected on the minimally configured system, the remaining FRUs are exchanged one at a time until the failing FRU is identified. If a failure is not detected, FRUs are added back until the failure occurs.
Page 99
10. Remove processor card #2 (if installed). If processor card #2 is removed, ensure that processor card #1 is installed and contains at least one quad of DIMMs. The processor card that is left in the system must contain at least one quad of DIMMs. For example, with a system configuration having two processor cards, processor card #2 contains all of the DIMMs, processor card #1 contains no DIMMs.
Page 100
Does the system stop with code E1F2, E1F3, STBY, 20EE000B or 4BA00840 displayed in the operator panel? A DIMM in the quad that you just replaced in the system is defective. Turn off the power, remove the power cords, and exchange the DIMMs in that quad, one at a time with new or previously removed DIMMs.
Page 101
Step 1548-7 No failure was detected with this configuration. 1. Turn off the power and remove the power cords. 2. Reinstall processor card #2. 3. Plug in the power cords and wait for the operator panel to display OK. 4. Turn on the power. Does the system stop with code E1F2, E1F3, STBY or 20EE000B, 4BA00830 displayed on the operator panel? One of the FRUs remaining in the system unit is defective.
Page 102
Step 1548-9 1. Turn off the power and remove the power cords. 2. Plug the IDE cable into the IDE connector on the CEC backplane. 3. Connect the signal and power connectors to the IDE CD-ROM drive. 4. Plug in the power cords and wait for the operator panel to display OK. 5.
Page 103
Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. If the symptom has changed, check for loose cards, cables, and obvious problems. If you do not find a problem, go to the return to “Quick Entry MAP”...
Page 104
Exchange the FRUs that have not been exchanged, in the following order: 1. Last disk drive installed 2. Disk drive backplane Repeat this step until the defective FRU is identified or all the FRUs have been exchanged. If the symptom did not change and all the FRUs have been exchanged, call service support for assistance.
Page 105
4. Turn on the power. 5. If the Console Selection panel is displayed, choose the system console. 6. After the word keyboard is displayed, press 5 on the ASCII terminal keyboard. 7. If you are prompted to do so, enter the appropriate password. Is the Please Define the System Console prompt displayed? The last device or cable that you attached is defective.
Page 106
Is the Please Define the System Console prompt displayed? Repeat this step until the defective device or cable is identified or all devices and cables have been disconnected. If all the devices and cables have been removed, then one of the FRUs remaining in the system unit is defective.
Page 107
If the symptom did not change and all the FRUs have been exchanged, call service support for assistance. Step 1548-19 Does the system have adapters or devices that require supplemental media? Go to “Step 1548-20”. Go to “Step 1548-21”. Step 1548-20 Consult the PCI adapter configuration documentation for your operating system to verify that all installed adapters are configured correctly.
Page 108
Step 1548-23 1. Ensure that the diagnostics and the operating system are shut down. 2. Turn off the power. 3. If you have not already done so, configure the service processor with the instructions in note 7 on page 77 and then return here and continue. 4.
Page 109
2. Starting with the last installed adapter, disconnect one attached device and cable. 3. Plug in the power cords and wait for the operator panel to display OK. 4. Turn on the power. 5. If the Console Selection panel is displayed, choose the firmware console. 6.
MAP 1549: Attention Problem Isolation Step 1549-1 Record the error code and the location codes that sent you to this MAP. Step 1549-2 Turn off the power. Step 1549-3 Examine the following table and locate the error code that sent you to this MAP to determine your next step.
Page 111
Step 1549-6 Did the system stop with the same error code as recorded in “Step 1549-1” on page 90? The card that you just replaced was defective. This ends the procedure. Return the system to its original configuration. Go to “MAP 0410: Repair Checkout” on page 45. If all of the cards listed in “Step 1549-4”...
Chapter 4. Checkpoints Checkpoints are displayed on the media subsystem operator panel and the virtual terminal while the system is powering on and going through the initial program load (IPL). This chapter explains the IPL flow of the system and provides a table that lists checkpoints that you might see in the operator panel on the ASCII terminal.
hang at one of the checkpoints without displaying an 8-character error code. It is for these hang conditions, only, that any action should be taken with respect to checkpoints. The most appropriate action is included with each checkpoint. Before taking actions listed with a checkpoint, check for additional symptoms in the service processor error log.
Page 115
Table 9. Service processor checkpoints Checkpoint Description Action or possible failing FRU 8000 Test of the service processor DRAM See note 1 on page 100. 8004 Verify base code checksum See note 1 on page 100. 8008 Verify base code mirror checksum See note 1 on page 100.
Page 116
Table 9. Service processor checkpoints (continued) Checkpoint Description Action or possible failing FRU 9033 I/O base initialization See note 1 on page 100. 9034 I2C initialization See note 1 on page 100. 9035 ISA initialization See note 1 on page 100. 9036 Mailbox initialization See note1 on page 100.
Page 117
Table 9. Service processor checkpoints (continued) Checkpoint Description Action or possible failing FRU 910A Non-processor scan interface BATs and See note 1 on page 100. LBIST 910B Wire test See note 1 on page 100. 910C Initialize main storage (system memory) See note 1 on page 100.
Page 118
Table 9. Service processor checkpoints (continued) Checkpoint Description Action or possible failing FRU 9410 Service processor is issuing request to start See note 1 on page 100. instruction execution 9411 Service processor is issuing request to stop See note 1 on page 100. instruction execution 9420 Service processor is issuing request to start...
Page 119
Table 9. Service processor checkpoints (continued) Checkpoint Description Action or possible failing FRU 99FD Service processor receiving firmware See note 2 on page 100. update module Attention: This checkpoint might stay in the operator panel for some period of time during a firmware update or hypervisor dump with no indication of activity.
Page 120
Table 9. Service processor checkpoints (continued) Checkpoint Description Action or possible failing FRU A810-A8FF Description: Scan log dump in progress Action/Possible Failing FRU Informational message. (See “Scan Log Dump Policy” on page 262.) The last two characters of the checkpoints will change as the scan log dump progresses. If the last two characters do not change after 5 minutes, the service processor is hung and must be reset.
d. If this procedure does not fix the problem, call service support. Firmware checkpoints Firmware uses checkpoints (progress codes) in the range of Exxx to EFFF. These checkpoints occur during system startup and can be useful in diagnosing certain problems. Service processor checkpoints are listed in “Service processor checkpoints”...
Page 122
Table 10. Firmware checkpoints (continued) Checkpoint Description Action or possible failing FRU E138 Create options node Go to “MAP 1540: Problem Isolation Procedures” on page 69. E139 Create node aliases and system aliases Go to “MAP 1540: Problem Isolation Procedures” on page 69. E13A Create packages node Go to “MAP 1540: Problem Isolation...
Page 123
Table 10. Firmware checkpoints (continued) Checkpoint Description Action or possible failing FRU E174 Description: Establishing host connection Action or possible failing FRU: If the system is not connected to an active network, or if the target server is inaccessible (which can result from incorrect IP parameters being specified), the system still attempts to boot and because time-out durations are necessarily long to accommodate retries, the system might appear to hang.
Page 124
Table 10. Firmware checkpoints (continued) Checkpoint Description Action or possible failing FRU E1A4 User requested boot to SMS menus using Go to “MAP 1540: Problem Isolation CSP command Procedures” on page 69. E1A5 User requested boot to open firmware Go to “MAP 1540: Problem Isolation prompt using CSP command Procedures”...
Page 125
Table 10. Firmware checkpoints (continued) Checkpoint Description Action or possible failing FRU E1DC Description: Dynamic console selection. Action possible failing FRU If selection panels can be seen on the terminals and the appropriate key on the input device is pressed within 60 seconds but there is no response to the keystroke, suspect the terminal or serial cable.
Page 126
Table 10. Firmware checkpoints (continued) Checkpoint Description Action or possible failing FRU E1FB Scan SCSI bus for attached devices 1. If a location code is available, follow the repair actions listed for firmware error code 21A0 00XX. 2. If no location code is available, go to “MAP 1540: Problem Isolation Procedures”...
Page 127
Table 10. Firmware checkpoints (continued) Checkpoint Description Action or possible failing FRU E502 Set up I/O data structure pointers 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 69. E503 Initialize location code data structures 1.
Page 128
Table 10. Firmware checkpoints (continued) Checkpoint Description Action or possible failing FRU E50F Clean up the error registers 1. Check for system firmware updates. 2. Replace the riser card specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures”...
Page 129
Table 10. Firmware checkpoints (continued) Checkpoint Description Action or possible failing FRU E527 Enable the memory MAP and I/O MAP for 1. Check for system firmware updates. the current RIO hub port 2. Replace the riser card specified by the location code.
Page 130
Table 10. Firmware checkpoints (continued) Checkpoint Description Action or possible failing FRU E537 Configuring the I/O subsystem 1. Replace the riser card that the failing drawer is attached to. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 69. E538 Detected the RIO hub on the other end of 1.
Page 131
Table 10. Firmware checkpoints (continued) Checkpoint Description Action or possible failing FRU E54A Configure the I/O drawer 1. Check the RIO cable specified by the location code. If the location code specifies an I/O subsystem or its RIO interface, check the RIO cables going into and out of the I/O subsystem.
Page 132
Table 10. Firmware checkpoints (continued) Checkpoint Description Action or possible failing FRU E561 Set up RIO interface chip registers If the location code associated with this checkpoint is either U0.1-P1 or U0.1-P2, do the following: 1. Replace the PCI riser card in the CEC drawer, location: U0.1-P2.
Page 133
Table 10. Firmware checkpoints (continued) Checkpoint Description Action or possible failing FRU E566 Enable device arbitration on the RIO If the location code associated with this interface chip checkpoint is either U0.1-P1 or U0.1-P2, do the following: 1. Replace the PCI riser card in the CEC drawer, location: U0.1-P2.
Page 134
Table 10. Firmware checkpoints (continued) Checkpoint Description Action or possible failing FRU E56B Set up RIO interface chip registers If the location code associated with this checkpoint is either U0.1-P1 or U0.1-P2, do the following: 1. Replace the PCI riser card in the CEC drawer, location: U0.1-P2.
Page 135
Table 10. Firmware checkpoints (continued) Checkpoint Description Action or possible failing FRU E57F End of RIO interface chip initialization 1. Check for system firmware updates. 2. Go to “MAP 1540: Problem Isolation Procedures” on page 69. E580 Begin initializing the EADS module. 1.
Page 136
Table 10. Firmware checkpoints (continued) Checkpoint Description Action or possible failing FRU E588 Set up EADS function register 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures”...
Page 137
Table 10. Firmware checkpoints (continued) Checkpoint Description Action or possible failing FRU E591 Set up EADS function registers 1. Check for system firmware updates. 2. Replace the I/O subsystem backplane specified by the location code. 3. Go to “MAP 1540: Problem Isolation Procedures”...
Page 138
Table 10. Firmware checkpoints (continued) Checkpoint Description Action or possible failing FRU E5E3 Not in quick-restart: load and set up open 1. Check for system firmware updates. firmware code in memory 2. Go to “MAP 1548: Processor Subsystem Problem Isolation” on page 77.
Page 139
Table 10. Firmware checkpoints (continued) Checkpoint Description Action or possible failing FRU E602 SSA PCI open firmware has started. Replace the adapter. E603 SSA PCI adapter BIST has completed with Replace the adapter. an error. E604 SSA PCI adapter BIST and subsequent Replace the adapter.
Page 140
Table 10. Firmware checkpoints (continued) Checkpoint Description Action or possible failing FRU E70D Create VPD. 1. Check for system firmware updates. 2. Call service support. E70F Collect CEC VPD. Go to “MAP 1540: Problem Isolation Procedures” on page 69. E7EE Delaying while waiting for power subsystem 1.
Page 141
Table 10. Firmware checkpoints (continued) Checkpoint Description Action or possible failing FRU E853 Start cable ID request process 1. Check the SPCN and RIO cabling to the I/O subystems. 2. Remove, then reapply power to the system. 3. Go to “MAP 1542: I/O Problem Isolation”...
Table 10. Firmware checkpoints (continued) Checkpoint Description Action or possible failing FRU EAA1 Probe PCI-PCI bridge bus Follow the repair actions for checkpoint E151. Boot problems or concerns Depending on the boot device, a checkpoint might be displayed on the operator panel for an extended period of time while the boot image is retrieved from the device.
Page 143
Step 3 Try to boot and run standalone diagnostics against the system, particularly against the intended boot device. Standalone diagnostics can be run from CD-ROM. To boot standalone diagnostics, follow the detailed procedures in “Considerations for running standalone diagnostics from the diagnostic CD-ROM” on page 242.
Page 144
5. If you replaced the indicated FRUs and the problem is not corrected, or the above descriptions did not address your particular situation, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 77. If the problem has been corrected, go to “MAP 0410: Repair Checkout” on page 45. Step 4 It is possible that another installed adapter is causing the problem.
Chapter 5. Error code to FRU index Attention: Start your activity in this chapter by going to “Checkpoint and error code index” on page 127. In this section, you might be asked to perform additional isolation procedures before looking up the actions for a four-character checkpoint or 8-character error code.
Replacing a PCI-X adapter If a network adapter is replaced, the network administrator must be notified so that the client MAC addresses of the new adapter card can be propagated to any required locations in the Ethernet. In addition, the operating system configuration of the network adapter will need to be completed based on the information used on the previous network adapter.
Checkpoint and error code index The following table is an index to the checkpoints or error codes that you might receive to help diagnose a failure. Locate the code that you received, and follow the instructions to determine your next step. If you receive a software error, or the 8-digit error code is not listed in this index, call service support.
Operator panel error codes Attention: If you replace FRUs or perform an action and the problem is still not corrected, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 77 unless otherwise indicated in the tables. If you replace FRUs and the problem is corrected, go to “MAP 0410: Repair Checkout” on page 45. Note: Not all operator panel error codes are supported.
SPCN error codes Attention: If a power supply or fan is hot-swapped, wait five minutes, and then recheck the operating system error log. If the system has to be shut down to replace a power supply or fan, reboot the system, wait five minutes after the operating system login prompt appears, and then recheck the operating system error log.
Page 150
Table 13. SPCN error codes for the CEC (system unit) (continued) Error code Description Action or possible failing FRU 1011 1521 Power supply 2 fault 1. System power supply 2 Location code: U0.1-V2 2. CEC backplane Location code: U0.1-P1 1011 1522 Power supply 2 non-power fault 1.
Page 151
Table 13. SPCN error codes for the CEC (system unit) (continued) Error code Description Action or possible failing FRU 1011 1B0F 1.5 V/1.3 V load fault 1. Replace power supply 1. Location code: U0.1-V1 2. If power supply 2 is present, use removed power supply 1 to replace power supply 2.
Page 152
Table 13. SPCN error codes for the CEC (system unit) (continued) Error code Description Action or possible failing FRU 1011 7631 System PCI fan 3 missing 1. PCI fan 3 Location code: U0.1-F3 2. CEC backplane Location code: U0.1-P1 1011 7640 Slow system PCI fan 3 failure 1.
Page 153
Table 13. SPCN error codes for the CEC (system unit) (continued) Error code Description Action or possible failing FRU 1011 8409 No valid processor card VPD found. 1. Verify that the processor card is of the right type for your system and is properly installed.
Page 154
Table 13. SPCN error codes for the CEC (system unit) (continued) Error code Description Action or possible failing FRU 1011 9280 SPCN interface failure 1. Check for system firmware updates. Apply the update if it is available. 2. Replace the system backplane, location: U0.1-P1.
Page 155
Table 14. SPCN error codes for I/O subsystems (continued) Error code Description Action or possible failing FRU 1nn1 1517 Power supply 1 thermal fault 1. Check for obstructions to the air flow around the I/O subsystem. 2. I/O subsystem power supply 1, U0.dd-V1. 3.
Firmware error codes Attention: If you replace FRUs or perform an action and the problem is still not corrected, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 77 unless otherwise indicated in the tables. If you replace FRUs and the problem is corrected, go to “MAP 0410: Repair Checkout” on page 45. Note: Not all firmware error codes are supported.
Page 158
Table 15. Firmware error codes (continued) Error code Description Action or possible failing FRU 20D0 0A02 Error initializing RTAS debugging token 1. Check for system firmware updates. Apply update if available. 2. Call service support. 20D0 0A03 Error initializing interface/service processor 1.
Page 159
Table 15. Firmware error codes (continued) Error code Description Action or possible failing FRU 20D0 0A10 Error initializing interface/cache access 1. Check for system firmware updates. Apply update if available. 2. Call service support. 20D0 0A11 Error initializing interface/power access 1.
Page 160
Table 15. Firmware error codes (continued) Error code Description Action or possible failing FRU 20D0 9008 Invalid subsystem serial number found in VPD. 1. Power off, and then power on the system. 2. If the error persists, call service support. 20D1 0001 Failed to send state-change message to service 1.
Page 161
Table 15. Firmware error codes (continued) Error code Description Action or possible failing FRU 20D1 000D Function to get time-of-day failed. 1. Check for system firmware updates. Apply update if available. 2. Call service support. 20D1 000E Function to get time-of-day failed. 1.
Page 162
Table 15. Firmware error codes (continued) Error code Description Action or possible failing FRU 20EE 0008 No configurable adapters were found in the This warning occurs when the remote IPL menu system by the remote IPL menu in the SMS in the SMS utilities cannot locate any LAN utilities.
Page 163
Table 15. Firmware error codes (continued) Error code Description Action or possible failing FRU 20EE 000F Unable to find the package node. 1. Reboot the system. 2. Reflash the system with the latest level of system firmware. 3. Call service support. 20EE 0010 Unable to execute the root VPD method.
Page 164
Table 15. Firmware error codes (continued) Error code Description Action or possible failing FRU 2100 0xxx SCSI controller fault 1. Before replacing any system components, refer to the notes in error code 21A00001. 2. Replace the SCSI device. 2101 0010 The IDE device remained busy longer than 1.
Page 165
Table 15. Firmware error codes (continued) Error code Description Action or possible failing FRU 21E0 0003 SCSI tape - send diagnostic failed - sense data Before replacing any system components, refer available to the notes in error code 21A0 0001. Replace the SCSI device.
Page 166
Table 15. Firmware error codes (continued) Error code Description Action or possible failing FRU 21F0 0001 SCSI CD-ROM - test unit ready failed - Before replacing any system components, refer hardware error to the notes in error code 21A0 0001. 1.
Page 167
Table 15. Firmware error codes (continued) Error code Description Action or possible failing FRU 2201 0001 PCI Auto LANstreamer token-ring adapter failed Replace the adapter. to complete hardware initialization. See “Checkpoint and error code index” on page 127 for location code information related to this error.
Page 168
Table 15. Firmware error codes (continued) Error code Description Action or possible failing FRU 25A8 0998 Description: NVRAMRC script evaluation error - command line execution error Action: Execution of a command line within the nvram configuration variable “nvramrc” (script) resulted in a “throw” being executed. This script can be modified by the system firmware SMS utilities, the operating system, PCI adapter ROM code or utility, or an operator (using the open firmware script editing command nvedit).
Page 169
Table 15. Firmware error codes (continued) Error code Description Action or possible failing FRU 2602 0001 Description: Invalid PCI adapter vendor ID Action: Attention: Before moving the adapter to another slot, verify with the customer or system administrator that the slot that the adapter is being moved to is available to the partition in which the adapter presently resides.
Page 170
Table 15. Firmware error codes (continued) Error code Description Action or possible failing FRU 2602 0008 Description: PCI device Fcode evaluation error Action: 1. If the location code identifies a slot: a. Check for adapter firmware updates. Apply update if available. b.
Page 171
Table 15. Firmware error codes (continued) Error code Description Action or possible failing FRU 2602 0012 Description: PCI device run-time error; bridge in freeze state. Action: If the location code identifies a slot: 1. Check the cabling to the adapter, in particular, the adapters that have serial ports. Serial ports might require null modems or special cabling configurations to avoid connecting driver outputs.
Page 172
Table 15. Firmware error codes (continued) Error code Description Action or possible failing FRU 2803 0003 RTC not updating - not correctable Replace CEC backplane. Location code: U0.1-P1 2803 0004 RTC operating mode parameters (for example, 1. Set time and date. data mode) changed.
Page 173
Table 15. Firmware error codes (continued) Error code Description Action or possible failing FRU 2B20 8885 Unrecoverable internal hardware error Run diagnostics. Look for an operating system error log entry in which the description says I/O bus time out, access, or other error or I/O bridge/device internal error.
Service processor error codes Attention: Follow the procedure defined in the “Checkpoint and error code index” on page 127. If you replace FRUs or perform an action and the problem is still not corrected, go to “MAP 1548: Processor Subsystem Problem Isolation” on page 77 unless otherwise indicated in the tables. If you replace FRUs and the problem is corrected, go to “MAP 0410: Repair Checkout”...
Page 176
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4064 0B02 Problem with internal RIO interface (I/O slot 0, 1. Check for system firmware updates. I/O port 2) 2. Check the cabling between the CEC and the I/O subsystems.
Page 177
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4064 0EA4 The service processor detected a problem, but Call service support. was unable to isolate the source. 4064 0EAF Service processor code and system type are 1.
Page 179
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4065 00B3 Processor/VPD problem detected 1. Check for system firmware updates. 2. Processor card 1 Location code: U0.1-P1-C1 4065 00B4 Processor/VPD problem detected 1. Check for system firmware updates. 2.
Page 180
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4065 0BF0 Problem with RIO interface (I/O slot 0, I/O port 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems.
Page 181
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4065 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 69. 4065 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 69.
Page 183
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4066 0101 Memory extender 2 on processor card 1 Processor card, Location code: U0.1-P1-C1 4066 0130 Memory extender 0 on processor card 2 Processor card, Location code: U0.1-P1-C2 4066 0131 Memory extender 2 on processor card 2 Processor card,...
Page 184
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4066 0CA5 Invalid system configuration 1. Check that the processor cards and DIMMs are properly installed and configured. 2. If there are any FRUs listed in the service processor error log entry, replace them one at a time.
Page 185
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4066 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 69. 4066 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 69.
Page 187
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4067 0A04 Reboot message on VPD channel 4 processor Reboot the system in slow mode. Run diagnostics, then go to ″MAP 0235: System card 2. Reboot is necessary for run-time array gard to take effect.
Page 188
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4067 0EA0 MOPS service processor code error. 1. Check for system firmware updates. 2. Call service support. 4067 0EA1 Operating system code on main processor 1.
Page 189
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4067 0EE5 Firmware error, no callout available 1. Check for system firmware updates. 2. Call service support. 4068 0001 Scan interface basic assurance tests detected a CEC backplane CEC backplane fault.
Page 190
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4068 001F DIMM fault DIMM Location code: U0.1-P1-C2-M4 4068 00B0 Bus controller failure 1. Check for system firmware updates. 2. PCI riser card Location code: U0.1-P2 4068 00B1 Bus controller failure 1.
Page 191
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4068 0B03 Problem with internal RIO interface (I/O slot 0, 1. Check for system firmware updates. I/O port 3) 2. Check the cabling between the CEC and the I/O subsystems.
Page 192
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4068 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on page 71. 4068 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 69.
Page 194
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4069 00B4 Processor/VPD problem detected 1. Check for system firmware updates. 2. Processor card 2 Location code: U0.1-P1-C2 4069 00B7 Insufficient hardware resources were detected to Go to “MAP 1540: Problem Isolation Procedures”...
Page 195
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4069 0BF0 Problem with RIO interface (I/O slot 0, I/O port 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems.
Page 196
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4069 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 69. 4069 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 69.
Page 198
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 406A 0101 Memory extender 2 on processor card 1 Processor card, Location code: U0.1-P1-C1 406A 0130 Memory extender 0 on processor card 2 Processor card, Location code: U0.1-P1-C2 406A 0131 Memory extender 2 on processor card 2 Processor card,...
Page 199
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 406A 0CA5 Invalid system configuration 1. Check that the processor cards and DIMMs are properly installed and configured. 2. If there are any FRUs listed in the service processor error log entry, replace them one at a time.
Page 200
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 406A 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 69. 406A 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 69.
Page 202
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 406B 0A04 Reboot message on VPD channel 4 processor Reboot the system in slow mode. Run diagnostics, then go to ″MAP 0235: System card 2. Reboot is necessary for run-time array gard to take effect.
Page 203
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 406B 0EA0 MOPS service processor code error. 1. Check for system firmware updates. 2. Call service support. 406B 0EA1 Operating system code on main processor 1.
Page 204
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 406B 0EE5 Firmware error, no callout available 1. Check for system firmware updates. 2. Call service support. 406C 0001 Scan interface basic assurance tests detected a CEC backplane CEC backplane fault Location code: U0.1-P1...
Page 205
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 406C 001F DIMM fault DIMM Location code: U0.1-P1-C2-M4 406C 00B0 Bus controller failure 1. Check for system firmware updates. 2. PCI riser card Location code: U0.1-P2 406C 00B1 Bus controller failure 1.
Page 206
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 406C 0B03 Problem with internal RIO interface (I/O slot 0, 1. Check for system firmware updates. I/O port 3) 2. Check the cabling between the CEC and the I/O subsystems.
Page 207
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 406C 0EB0 JTAG problem Go to “MAP 1541: JTAG Problem Isolation” on page 71. 406C 0EB1 Attention line 0 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 69.
Page 209
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 406D 00B4 Processor/VPD problem detected. 1. Check for system firmware updates. 2. Processor card 2 Location code: U0.1-P1-C2 406D 00B7 Insufficient hardware resources were detected to Go to “MAP 1540: Problem Isolation Procedures”...
Page 210
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 406D 0BF0 Problem with RIO interface (I/O slot 0, I/O port 1. Check for system firmware updates. 2. Check the cabling between the CEC and the I/O subsystems.
Page 211
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 406D 0EB4 Attention line 3 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 69. 406D 0EB5 Attention line 4 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 69.
Page 213
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 406E 0101 Memory extender 2 on processor card 1 Processor card, Location code: U0.1-P1-C1 406E 0130 Memory extender 0 on processor card 2 Processor card, Location code: U0.1-P1-C2 406E 0131 Memory extender 2 on processor card 2 Processor card,...
Page 214
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 406E 0CA5 Invalid system configuration 1. Check that the processor cards and DIMMs are properly installed and configured. 2. If there are any FRUs listed in the service processor error log entry, replace them one at a time.
Page 215
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 406E 0EB9 Attention line 8 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 69. 406E 0EBA Attention line 9 stuck fault Go to “MAP 1540: Problem Isolation Procedures” on page 69.
Page 217
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 406F 0A04 Reboot message on VPD channel 4 processor Reboot the system in slow mode. Run diagnostics, and then go to ″MAP 0235: System card 2. Reboot is necessary for run-time array gard to take effect.
Page 218
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 406F 0EA0 MOPS service processor code error. 1. Check for system firmware updates. 2. Call service support. 406F 0EA1 Operating system code on main processor 1.
Page 219
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 406F 0EE5 Firmware error; no callout available. 1. Check for system firmware updates. 2. Call service support. 4503 26F6 Description: 1-way processor card fault Action: This error code indicates that the processor repeat gard function has deconfigured a processor.
Page 220
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4503 26FA Description: 2-way processor card fault Action: This error code indicates that the processor repeat gard function has deconfigured a processor. Do the following: 1.
Page 221
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4503 26FE Description: 2-way processor card fault Action: This error code indicates that the processor repeat gard function has deconfigured a processor. Do the following: 1.
Page 222
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4503 301B Description: 1 GB DIMM fault Action:This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1.
Page 223
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4503 301E Description: 1 GB DIMM fault Action: This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1.
Page 224
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4503 303B Description: 1 GB DIMM fault Action: This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1.
Page 225
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4503 309A Description: 128 MB DIMM fault Action: This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1.
Page 226
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4503 309D Description: 512 MB DIMM fault Action: This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1.
Page 227
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4503 30AA Description: 2 GB DIMM fault Action: This error code indicates that the memory repeat gard function has deconfigured a quad of memory. Do the following: 1.
Page 228
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4504 303C 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 52. 4504 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 52.
Page 229
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4505 302F 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 52. 4505 303B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 52.
Page 230
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4506 301E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 52. 4506 302E 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 52.
Page 231
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4507 301C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 52. 4507 301D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 52.
Page 232
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4508 301A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 52. 4508 301B 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 52.
Page 233
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4509 26FE 2-way processor card fault Replace the processor card specified by the location code. 4509 26FF 1-way processor card fault Replace the processor card specified by the location code.
Page 234
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 450A 26FB 2-way processor card fault Replace the processor card specified by the location code. 450A 26FD 1-way processor card fault Replace the processor card specified by the location code.
Page 235
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 450B 26F8 1-way processor card fault Replace the processor card specified by the location code. 450B 26FA 2-way processor card fault Replace the processor card specified by the location code.
Page 236
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 450C 26F6 1-way processor card fault Replace the processor card specified by the location code. 450C 26F7 2-way processor card fault Replace the processor card specified by the location code.
Page 237
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 450C 309F 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 52. 450C 30AA 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 52.
Page 238
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 450D 309D 512 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 52. 450D 309E 1 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 52.
Page 239
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 450E 309B 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 52. 450E 309C 256 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 52.
Page 240
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 450F 303D 2 GB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 52. 450F 309A 128 MB DIMM fault Go to “MAP 1240: Memory Problem Resolution” on page 52.
Page 241
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 460B 28C4 CEC backplane fault Replace the CEC backplane, Location code: U0.1-P1 460C 287A CEC backplane Replace the CEC backplane, Location code: U0.1-P1 460C 28C4 CEC backplane fault Replace the CEC backplane, Location code: U0.1-P1 460D 287A CEC backplane...
Page 242
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4B23 26F8 Description: 1-way processor card fault Action: This error code indicates that the processor repeat gard function has deconfigured a processor. Do the following: 1.
Page 243
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4B23 26FD Description: 1-way processor card fault Action: This error code indicates that the processor repeat gard function has deconfigured a processor. Do the following: 1.
Page 244
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4B24 26FB 2-way processor card fault Replace the processor card specified by the location code. 4B24 26FD 1-way processor card fault Replace the processor card specified by the location code.
Page 245
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4B27 26FA 2-way processor card fault Replace the processor card specified by the location code. 4B27 26FB Description: 2-way processor card fault Action: 1. If the system has one 2-way processor card or two 2-way processor cards, look at word 13 in the service processor error log entry detail data.
Page 246
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4B29 26FF 1-way processor card fault Replace the processor card specified by the location code. 4B2A 26F6 1-way processor card fault Replace the processor card specified by the location code.
Page 247
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4B2C 26FB 2-way processor card fault Replace the processor card specified by the location code. 4B2C 26FD 1-way processor card fault Replace the processor card specified by the location code.
Table 16. Service processor error codes (continued) Error code Description Action or possible failing FRU 4B2F 26FA 2-way processor card fault Replace the processor card specified by the location code. 4B2F 26FB 2-way processor card fault Replace the processor card specified by the location code.
Error code Description Action or possible failing FRU A1FD 0001 Service processor flash recovery Insert Diskette #1 A1FD 0002 Service processor flash recovery Insert Diskette #2 A1FD 0003 Service processor flash recovery Insert Diskette #3 A1FD 0004 Service processor flash recovery Insert Diskette #4 Scan dump messages Error code...
Page 250
Table 17. Common firmware error codes (continued) Error code Description Action or possible failing FRU B006 1408 Service processor program error 1. Update system firmware. 2. Check for operating system updates. 3. CEC backplane Location code: U0.1-P1 B006 1409 Service processor program error 1.
Page 251
Table 17. Common firmware error codes (continued) Error code Description Action or possible failing FRU B10F 1675 Service processor failure CEC backplane Location code: U0.1-P1 B10F 1681 Service processor firmware update invalid Call service support. B10F 1682 Service processor firmware update invalid Call service support.
Page 252
Table 17. Common firmware error codes (continued) Error code Description Action or possible failing FRU B1xx 4620 Modem configuration failure 1. Check modem configuration files. 2. Check for system firmware update. 3. Modem 4. CEC backplane Location code: U0.1-P1 B1xx 4621 Modem disconnect failure 1.
Page 253
Table 17. Common firmware error codes (continued) Error code Description Action or possible failing FRU B1xx 4660 Memory subsystem failure Go to “MAP 1540: Problem Isolation Procedures” on page 69. B1xx 4661 Memory card VPD failure Go to “MAP 1540: Problem Isolation Procedures” on page 69.
Page 254
Table 17. Common firmware error codes (continued) Error code Description Action or possible failing FRU B1xx 4690 Description: Operating system surveillance timeout (AIX to service processor interface failure). Action: 1. This error code indicates that the operating system terminated early (which usually implies an operating system crash).
Page 255
Table 17. Common firmware error codes (continued) Error code Description Action or possible failing FRU B1xx 4695 System power control network to service 1. Check for system firmware updates. processor interface failure 2. CEC backplane Location code: U0.1-P1 B1xx 4696 System firmware interface failure 1.
Page 256
Table 17. Common firmware error codes (continued) Error code Description Action or possible failing FRU B1xx 4699 (Continued from the previous page.) (continued) A806: Loss of the surveillance heartbeat between the HMC and the service processor. Actions: 1. Disconnect HMC (not supported). 2.
Table 17. Common firmware error codes (continued) Error code Description Action or possible failing FRU B1FD 0015 Too many firmware update diskettes inserted (or 1. Ensure the firmware update image is correct wrong or faulty update image) for the specific system type and model. 2.
Problem determination generated error codes Table 19. Problem determination-generated error codes Error code Description Action or possible failing FRU M0BT 0000 The system hung during speaker POST. This Go to “MAP 1540: Problem Isolation Procedures” error code is generated by the customer on page 69.
Chapter 6. Using the online and standalone diagnostics The NAS diagnostics consist of online diagnostics and standalone diagnostics. Online diagnostics, when they are installed, reside with AIX in the file system. They can be booted: v In single user mode (referred to as service mode) v To run in maintenance mode (referred to as maintenance mode) v To run concurrently with other applications (referred to as concurrent mode) Standalone diagnostics must be booted before they can be run.
Running the online diagnostics in service mode To run online diagnostics in service mode, perform the following steps: 1. Verify with the system administrator and users that all programs may be stopped, then do so. If the system software is running, shut down the system as described in “Stopping the system” on page 310, without removing power.
Running the online diagnostics in concurrent mode To run online diagnostics in concurrent mode, do the following: 1. Log in to the system as root user. 2. Enter the diag command. 3. When the Diagnostic Operating Instructions are displayed, follow the instructions to check out the resources that you want.
Standalone diagnostics operations Use standalone diagnostics to test the system when the online diagnostics are not installed and as a method of testing the disk drives that cannot be tested by the online diagnostics. Note: No error log analysis is done by the standalone diagnostics. The CD-ROM drive and the controller that controls it cannot be tested by the standalone diagnostics.
Chapter 7. Verifying the hardware operation The system verification procedure checks the system for correct hardware operation. If you have a problem with your system in the future, use this procedure to test the system hardware to help you determine if you have a hardware problem. Run the system verification procedure as described in the following steps.
The following figure shows the operator panel and the locations of operator panel components that are referred to in this procedure. 3 4 5 1 Power-on button 6 Operator panel display 2 Power LED 7 (FS1) front serial connector (RJ-48 connector) 3 Attention LED 8 Service processor reset switch (pinhole) 4 SCSI port activity...
2. To check other resources, select the resource. When you have checked all of the resources that you need to check, go to “Step 5. Stop the diagnostics”. Step 5. Stop the diagnostics To stop the diagnostics, do the following: 1.
Chapter 8. Hardware problem determination This chapter provides information about using standalone or online diagnostics to help you solve hardware problems. Problem determination using the standalone or online diagnostics Use this procedure to obtain a service request number (SRN) when you are able to load the standalone or online diagnostics.
Page 268
Step 5 Diagnostic tests can be run on many resources while the operating system is running. However, you can obtain more extensive problem isolation by running online diagnostics in service mode. Do you want to run the online diagnostics in service mode? Go to “Step 6”.
Page 269
Step 10 The diagnostics loaded correctly. Press Enter. Is the Function Selection menu displayed? Go to “Step 11”. Go to “Step 12”. Step 11 There is a problem with the ASCII-terminal keyboard. Go to the problem determination documentation for your ASCII terminal. Step 12 1.
Page 270
Step 13 Is the Diagnostic Selection Menu displayed? If Problem Determination was selected from the Diagnostic Mode Selection menu, and if a recent error has been logged in the error log, the diagnostics automatically begin testing the resource. Follow the displayed instructions. v If the No Trouble Found screen is displayed, press Enter.
Page 271
Step 17 This step loads the standalone diagnostics. If you are unable to load the diagnostics, go to “Step 8” on page 248. 1. Turn on the power. 2. Insert the diagnostic CD-ROM into the CD-ROM drive. 3. When the keyboard indicator is displayed, press 5 on the ASCII keyboard to indicate that diagnostics are to be loaded.
Problem determination when unable to load diagnostics Use this procedure to obtain an error code. Service personnel use the error code to determine which field replaceable units (FRUs) are needed to restore the system to correct operation. Step 1. Considerations before running this procedure v The diagnostics uses an ASCII terminal attached to serial port 1.
Page 273
Symptom Action The power LED does not come on, or comes on and Check the power cable to the outlet. Check the circuit does not stay on. breakers and check for power at the outlet. Ensure that the room temperature is within 16 to 32°C (60 to 90°F).
Page 274
Symptom Action The System Management Services menu is displayed. The device or media that you are attempting to boot from might be faulty. 1. Check the SMS error log for any errors. To check the error log: v View Error Log v If an error is logged, check the time stamp.
Chapter 9. Using the service processor Note: The information in this chapter regarding the configuring of serial ports (and modems attached to those serial ports), applies only to serial port 1 and 2 on the CEC backplane (location U0.1-P1). The service processor runs on its own power boundary and continually monitors hardware attributes and the environmental conditions within the system.
Accessing the service processor menus remotely If your system has a modem connected to serial port 2 and is configured for call-in (see “Modem configuration menu” on page 277), the service processor menus can be accessed remotely as follows: 1. With the system powered off, call in from a remote terminal. 2.
Displays a number of the boot progress indicators, which may include service processor checkpoints, IPL checkpoints, or configuration codes, from the previous system boot. This information can be useful in diagnosing system faults. The progress indicator codes are listed from top (latest) to bottom (oldest). This information is not stored in nonvolatile storage.
Page 278
Note: The information under the Service Processor Firmware heading in the following Main Menu illustration is example information only. Service Processor Firmware VERSION: RH011007 Copyright 2001 IBM Corporation SYSTEM NAME MAIN MENU 1. Service Processor Setup Menu 2. System Power Control Menu 3.
System Management Services (SMS) utilities. See the IBM TotalStorage NAS Gateway 500 Service Guide for information about System Management Services. The general-access password can be set only from service processor menus.
Page 280
Attention: For security purposes, the service processor counts the number of attempts to enter passwords. The results of not recognizing a password within this error threshold are different, depending on whether the attempts are being made locally (at the system) or remotely (through a modem). The error threshold is three attempts.
Page 281
OS Surveillance Setup Menu 1. Surveillance: Currently Enabled 2. Surveillance Time Interval: 2 minutes 3. Surveillance Delay: 2 minutes 98. Return to Previous Menu 0> – Surveillance Can be set to Enabled or Disabled. – Surveillance Time Interval Can be set to any number from 2 through 255. –...
Page 282
SERIAL PORT SNOOP SETUP MENU 1. System reset string: Currently Unassigned 2. Snoop Serial Port: Currently Unassigned 98. Return to Previous Menu 1> Use the Snoop Serial Port option to select the serial port to snoop. Note: Only serial port 1 is supported. Use the system reset string option to enter the system reset string, which resets the machine when it is detected on the main console on serial port 1.
Page 283
Scan Log Dump Setup Menu 1. Scan Log Dump Policy: Currently As Needed 2. Scan Log Dump Content: Currently As Requested 3. Immediate Dump 98. Return to Previous Menu 0> 1 Select from the following options: (As Needed=2, Always=3) Enter New Option: 0>...
Page 284
Option 2 displays the following: Scan Log Dump Setup Menu 1. Scan Log Dump Policy: Currently As Needed 2. Scan Log Dump Content: Currently As Requested 3. Immediate Dump 98. Return to Previous Menu 0> 2 Select from the following options: (As Requested=1, Optimum=2, Complete=3, Minimum=4) Enter New Option: 0>...
System power control menu This menu is used to set power-control options. Other menus that control boot options are available from this menu. SYSTEM POWER CONTROL MENU 1. Enable/Disable Unattended Start Mode: Currently Enabled 2. Ring Indicate Power-On Menu 3. Reboot/Restart Policy Setup Menu 4.
Page 286
v Reboot/Restart Policy Setup Menu The following menu controls the Reboot/Restart Policy: Reboot/Restart Policy Setup Menu 1. Number of reboot attempts: Currently 1 2. Use OS-Defined restart policy? Currently No 3. Enable supplemental restart policy? Currently Yes 4. Call-Out before restart: Currently Disabled 98.
Page 287
v Power-On System Allows immediate power-on of the system. v Power-Off System This option is not available on this system. v Enable/Disable Fast System Boot Allows the user to select the IPL type, mode, and speed of the system boot. Attention: Selecting the fast IPL results in several diagnostic tests being skipped and a shorter memory test being run.
System information menu This menu provides access to system configuration information, error logs, system resources, and processor configuration. SYSTEM INFORMATION MENU 1. Read VPD Image from Last System Boot 2. Read Progress Indicators from Last System Boot 3. Read Service Processor Error Logs 4.
Page 289
v Read System POST Errors This option should only be used by service personnel to obtain additional debug information. v Read NVRAM Displays Non Volatile Random Access Memory (NVRAM) content. v Read Service Processor Configuration Displays current service processor configuration. v Processor Configuration/Deconfiguration Menu Enable/Disable CPU Repeat Gard CPU repeat gard will automatically deconfigure a CPU during a system boot if a processor has...
Page 290
The error status of each processor is indicated by AB, where B indicates the number of errors and A indicates the type of error according to the following: 1. Bring-up failure 2. Run-time non-recoverable failure 3. Run-time recoverable failure 4. Group integrity failure 5.
Page 291
These menus allow the user to change the system memory configuration. If it is necessary to take one of the DIMMs offline, this menu allows you to deconfigure a DIMM, and then reconfigure the DIMM at a later time. When this option is selected, a menu is displayed. The following is an example of this menu: MEMORY CONFIGURATION/DECONFIGURATION MENU 77.
Page 292
To change the memory configuration, select the number of the DIMM. The DIMM state will change from configured to deconfigured or from deconfigured to configured. In the previous example menu, each line shows two DIMMs and indicates whether they are configured. To enable or disable Memory Repeat Gard, use menu option 77 of the Memory Configuration/Deconfiguration Menu.
Page 293
v LED Control Menu This menu displays the state of the system attention LED. Use this menu to toggle the attention LED between identify (blinking) and off. It is not available when the system is in standby. An example of this menu follows: LED Control 1.
Page 294
v Performance Mode Setup Menu If certain types of processor cards are installed in the system, this menu is not available. For other types of processor cards, this menu is active after the first boot. Note: The first time the system is booted after NVRAM is cleared, Not Applicable is displayed under Performance Mode Setup Menu on the panel.
If you want to override the default setting, a brief description of each performance mode follows: Large Commercial System Optimization The setting for systems that do not fall into the other two categories, standard operation and turbo database mode. This setting provides the optimum performance for most applications. Standard Operation Optimizes the system for large-memory-bandwidth applications where minimal sharing of data occurs and the likelihood of significant hardware data-prefetching exists.
Call-in/call-out setup menu Note: The information in this section regarding the configuring of serial ports, and modems attached to those serial ports, applies only to the serial port S2 on the CEC backplane (location U0.1-P1). These serial ports are typically used for call-in and call-out. CALL-IN/CALL-OUT SETUP MENU 1.
30. Save configuration to NVRAM and Configure modem 98. Return to Previous Menu 0> For information on choosing a modem configuration file, see the IBM TotalStorage NAS Gateway 500 Service Guide for information about modem configuration. Serial port selection menu This menu allows you to enable or disable the call-in and call-out functions of each serial port in any combination.
Call-in and ring-indicate power-on cannot be enabled at the same time. If ring-indicate power-on is already enabled and you try to enable call-in, a message prompts you for confirmation. Refer to the message displayed on the console. Serial port speed setup menu This menu allows you to set serial port speed to enhance terminal performance or to accommodate modem capabilities.
v Service Center Telephone Number is the number of the service center computer. The service center usually includes a computer that takes calls from servers with call-out capability. This computer is referred to as the catcher. The catcher expects messages in a specific format to which the service processor conforms.
v Call-Out policy can be set to first or all. If call-out policy is set to first, the service processor stops at the first successful call-out to one of the following numbers in the order listed: 1. Service Center 2. Customer Administrative Center 3.
Service processor parameters in service mode When the system is in service mode, the following service-processor parameters are suspended: v Unattended Start Mode v Reboot/Restart Policy v Call-Out v Surveillance When you exit service mode, the service processor functions are reactivated. System power-on methods This section discusses the following system power-on methods: v Power-on switch...
Service processor reboot/restart recovery Reboot describes bringing the system hardware back up; for example, from a system reset or power-on. The boot process ends when control passes to the operating system process. Restart describes activating the operating system after the system hardware is reinitialized. Restart must follow a successful reboot.
Downloading the firmware update image and update instructions from the Web is preferred. Always check the NAS Gateway 500 Support Web site for the latest firmware images and update instructions. The Web address is: http://www.ibm.com/servers/storage/support/download.html. Notes: 1. Firmware update installation is not concurrent. Installation of the firmware will cause an unconditional reboot of the system.
Download the detailed instructions, as well as the latest flash images, from the following Web address: http://www.ibm.com/servers/storage/support/download.html. Before doing a system firmware update, check the Web site for the latest code and images. Do not attempt a firmware update if you are unsure of the image that you have, or of the procedure required for the update.
Note: This information may be superseded by the information that is available on the Web site listed below. Always check the Web site for the latest images and instructions for checking the firmware level. The Web address is: http://www.ibm.com/servers/storage/support/download.html. The firmware level can be checked using the CLI or in the service processor main menu.
5. Enter y to continue. Press Enter. 6. Follow the on-screen update steps as they are presented. 7. The Rebooting Service Processor message appears on the screen. The NAS Gateway 500 will reboot. This can take up to thirty minutes, depending on the configuration of the target server. Because the update occurs during this shutdown/reboot sequence, it is important to protect the server from interruptions.
Archiving the update files In the event that it becomes necessary to restore the server to a certain firmware level, you should identify and archive the materials for each update you install. If the download process produced diskettes, label and store them in a safe place. If the download process produced files, archive and identify the files for convenient retrieval.
System firmware surveillance System firmware surveillance is automatically enabled during system power-on. It cannot be disabled by the user, and the surveillance interval and surveillance delay cannot be changed by the user. If the service processor detects no heartbeats during system IPL (for a set period of time), it cycles the system power to attempt a reboot.
3. To call out before restart, set Call-out before restart to ENABLED from the Reboot/Restart Policy Setup menu. Note: Some modems, such as IBM 7857-017, are not designed for the paging function. Although they can be used for paging, they will return an error message when they do not get the expected response from another modem.
LCD progress indicator log The following is an example of the LCD progress indicator log. It shows the types of entries that can appear in the log, and is for example purposes only. The progress indicator codes are listed from top (latest) to bottom (oldest). LCD Progress Indicator Log B0FF 0539..17...
Service processor operational phases This section provides a high-level flow of the phases of the service processor. SP Power Applied Pre-Standby Phase Standby Phase SP Menus Available Bring-Up Phase SMS Menus Available Run-time Phase Operating System Login Prompt Available Pre-standby phase This phase is entered when the system is connected to a power source.
Service processor menus are available on ASCII terminals attached to serial port S1. Bring-up phase This phase is entered upon power-on, and exited upon loading of the operating system. The bring-up phase components are as follows: v Retry Request Check The service processor checks to see if the previous boot attempt failed.
Chapter 10. Using System Management Services Use the System Management Services menus to view information about your system, and to perform tasks such as setting a password, changing the boot list, and setting the network parameters. To start the System Management Services, do the following: 1.
On all menus except the Main Menu, there are several navigation keys: Return to the main menu. Return to the previous menu. Exit the System Management Services and start the operating system. If X is entered, you are asked to confirm your choice to exit the SMS menus and start the operating system.
Change password options The Change Password Options menu enables you to select from password utilities. Change Password Options 1 Set Privileged-Access Password 2 Remove Privileged-Access Password -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _...
Setup remote IPL (initial program load) This option allows you to enable and set up the remote startup capability of your system. You must first specify the network parameters. Network Parameters 1. IP Parameters 2. Adapter Parameters 3. Ping Test -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu...
Page 317
Adapter Parameters Device Slot Hardware Address 1. 10/100 Mbps Ethernet PCI Adapt Integrated:U0.1-P1/E2 0002554f50c3 2. Port 1 - IBM 2 PORT 10/100/100 6:U0.1-P2-I6/E1 00096bae31fc 3. Port 2 - IBM 2 PORT 10/100/100 6:U0.1-P2-I6/E2 00096bae31fd 4. 10/100 Mbps Ethernet PCI Adapt Integrated:U0.1-P1/E1...
Page 318
To test a connection to a remote system unit, select Ping Test from the Network Parameters Menu. After selecting the Ping Test option, you must select which adapter communicates with the remote system. Adapter Parameters Device Slot Hardware Address 1. 2-Gbps Fibre Channel HBA 4:U0.1-P1-I4/E1 000629aca72d 2.
After selecting which adapter and its parameters to use to ping the remote system, you must provide the addresses needed to communicate with the remote system. Ping Test 1. Client IP Address [129.132.4.20] 2. Server IP Address [129.132.4.10] 3. Gateway IP Address [129.132.4.30] 4.
Select boot options Use this menu to view and set various options regarding the installation devices and boot devices. 1. Select Install or Boot a Device 2. Select Boot Devices 3. Multiboot Startup <OFF> -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services --------------------------------------------------------------------------------------------------...
Page 321
.---------------------------------------------------------. | THE SELECTED DEVICES WERE NOT DETECTED IN THE SYSTEM ! | | Press any key to continue. `---------------------------------------------------------’ If Hard Drive is selected, the following menu is displayed: Select Hard Drive Type 1. SCSI 2. SSA 3. SAN 4.
Select Task SCSI 18200 MB Harddisk (loc=U0.1-Ethernet) 1. Information 2. Normal Mode Boot 3. Service Mode Boot -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ If either Normal Mode Boot or Service Mode Boot is selected, the next panel will prompt, Are you sure?.
Page 323
Select Device Type 1. Diskette 2. Tape 3. CD/DVD 4. IDE 5. Hard Drive 6. Network 7. None 8. List All Devices -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services -------------------------------------------------------------------------------------------------- Type the number of the menu item and press Enter or Select a Navigation key: _ When a device type is selected, such as item 4, a Select Media Type menu is displayed.
Page 324
When a device type is selected, such as item 1, a Select Task menu is displayed. The following is an example of that menu for an IDE CD-ROM. Select Task IDE CD-ROM 1. Information 2. Set Boot Sequence: Configure as 1st Boot Device -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu...
Display current settings This option displays the current setting of the customized boot list. An example of this menu, with one device in the boot list, follows. Current Boot Sequence 1. IDE CD-ROM None None None None -------------------------------------------------------------------------------------------------- Navigation keys: M = return to main menu ESC key = return to previous screen X = eXit System Management Services...
Chapter 11. Removal and replacement procedures Safety considerations Notes: 1. Servicing of the NAS Gateway 500 is typically performed with the system in the rack and placed into the service position. The service position for the NAS Gateway 500 is described in “Placing the NAS Gateway 500 into the service position”...
Page 328
CAUTION: This unit has more than one power supply cord. To reduce the risk of electrical shock, disconnect two power supply cords before servicing. (D10) NAS Gateway 500 Service Guide...
Procedure list Procedure Procedure name and page location Adapters “PCI-X adapters” on page 324 Battery “Battery” on page 348 Bezel “Bezel” on page 315 Cable-management arm “Cable management arm” on page 323 CD-ROM drive “Media drives (diskette and CD-ROM)” on page 337 CEC backplane “Central electronics complex (CEC) backplane”...
Handling static-sensitive devices Attention: Electronic boards, diskette drives, and disk drives are sensitive to static electricity discharge. These devices are wrapped in antistatic bags to prevent this damage. Take the following precautions: v If you have an antistatic wrist strap available, use it while handling the device. v Do not remove the device from the antistatic bag until you are ready to install the device in the system.
5. After you shut down the operating system, set the power switches of any attached devices to Off. If necessary, disconnect the power sources to both of the NAS Gateway 500’s power supplies. Starting the system To power on the system, perform the following steps in sequence. 1.
Placing the NAS Gateway 500 into the service position Attention: When placing the NAS Gateway 500 into the service position, it is essential that all stability plates are firmly in position to prevent the rack from toppling. Ensure that only one system drawer at a time is in the service position.
3. Release the system-release latches on both the left and right sides. 4. Pull the system drawer out from the rack until the rails are fully extended. Note: When the system rails are fully extended, safety latches on the slide rails lock into place. This prevents the system from being accidentally pulled out too far and dropped.
Service cover Before performing the following procedures, read the “Safety notices” on page xv. Service access cover removal Before performing this procedure, the rack should be correctly stabilized and the system unit placed in the service position, as described in “Placing the NAS Gateway 500 into the service position” on page 312. To remove the service access cover, perform the following steps: 1.
Bezel Before performing the following procedures, read the “Safety notices” on page xv. Front bezel removal To remove the bezel, perform the following steps: 1. Press in on both bezel-release tabs at the same time. See the following illustration for the bezel-release tabs.
System rack removal and replacement Before performing these procedures, read “Safety notices” on page xv. System drawer removal ≥18 kg (37 lbs) ≥32 kg (70.5 lbs) ≥55 kg (121.2 lbs) CAUTION: Use safe practices when lifting. (C24) To remove the system drawer from the rack, do the following: 1.
System drawer replacement CAUTION: The stabilizer must be firmly attached to the bottom rear of the rack to prevent the rack from turning over when the drawers are pulled out of the rack. Do not pull out or install any drawer or feature if the stabilizer is not attached to the rack.
Page 338
5. Pull the system out of the rack, making sure that the inner rails follow the system drawer, until you can see the first screw hole located on the inner rail. Notes: a. If necessary, push the middle rails back to prevent them from following the system drawer. b.
Page 339
8. To secure the third and fourth M4 retaining screws to the system, perform the following steps: a. Pull the system further out from the rack until you can see the fourth screw hole. b. Align the third screw hole located on the middle rail to the third screw hole located on the inner rail, and install the M4 screws.
the rack flange with a screw clip nut. 1 NAS Gateway 500 front bezel 2 M8 16-mm retaining screw 3 System-drawer release latch Drawer-release latch and brackets The following procedure covers the removal and replacement of the system’s drawer-release latch. Drawer-release latch removal To remove one of the NAS Gateway 500 drawer-release latches, do the following: 1.
3. Pull the release latch off the latch bracket post. 1 System drawer-release latch 5 System drawer-release latch 2 System drawer-release latch bracket (right) 6 Retaining screws 3 NAS Gateway 500 7 Release latch retaining screw 4 System drawer-release latch bracket (left) 8 Latch bracket post Drawer-release latch replacement To replace one of the NAS Gateway 500 drawer-release latches, do the following:...
4. Remove the retaining screws that secure the latch bracket to the side of the NAS Gateway 500. 1 System drawer-release latch 5 System drawer-release latch 2 System drawer-release latch bracket (right) 6 Retaining screws 3 NAS Gateway 500 7 Release latch retaining screw 4 System drawer-release latch bracket (left) 8 Latch bracket post Drawer-release latch bracket replacement...
Cable management arm Cable management arm removal 1. Make sure that the system drawer is in the service position as described in “Placing the NAS Gateway 500 into the service position” on page 312. 2. Carefully unbundle the external cables by unfastening the Velcro fastener strips on the cable management arm.
6. Carefully bundle the external cables together, starting with the thickest (most durable first). Fiber optic cable should be gathered last and take extra care not to crimp or crease the cables. 7. Loosely wrap the Velcro strips around the external cables and the cable management arm. Note: To allow for cable movement, do not wrap the Velcro fastener strips tightly around the cable bundle.
PCI-X adapter replacement Note: Use this procedure only when you are replacing an adapter with an identical adapter. If you are replacing an adapter with an adapter that is not identical to the adapter removed, go to “PCI-X adapter removal” on page 328 and “PCI-X adapter installation” on page 329. To replace a PCI-X adapter, do the following: 1.
Page 346
13. Turn the locking latch and lift the plastic retainer seat off the adapter. 1 System unit 2 Adapter retainer assembly 3 Adapter retainer assembly (retainer seat down and the retainer clip in the locked position) 4 Adapter retainer assembly (retainer seat down and the retainer clip in the unlocked position) 5 PCI-X adapter faceplate (dotted lines) 6 Adapter retainer assembly (in the unlocked position) 7 PCI-X adapter...
Page 347
18. Lower the plastic retainer seat over the PCI-X adapter faceplate, and rotate the locking latch clockwise until it clicks into the locked position. 1 System unit 2 Adapter retainer assembly (in the unlocked position) 3 Adapter retainer assembly (retainer seat down and the retainer clip in the unlocked position) 4 Adapter retainer assembly (retainer seat down and the retainer clip in the locked position) 5 PCI-X adapter faceplate (dotted lines) 19.
PCI-X adapter removal To remove a PCI-X adapter, perform the following steps: 1. Determine which adapters you plan to remove. 2. Record the slot number and location of each adapter being removed. Note: Adapter slots are numbered on the back of the system unit. 3.
12. Turn the locking latch and lift the plastic retainer seat off the adapter. 1 System unit 5 PCI-X adapter faceplate (dotted lines) 2 Adapter retainer assembly 6 Adapter retainer assembly (in the unlocked position) 3 Adapter retainer assembly (retainer seat 7 PCI-X adapter down and the retainer clip in the locked position)
Page 350
6. Rotate the locking retainer clip on the identified slot, lift the plastic stop, and remove the expansion slot cover. 1 System unit 5 PCI-X adapter faceplate (dotted lines) 2 Adapter retainer assembly 6 Adapter retainer assembly (in the unlocked position) 3 Adapter retainer assembly (retainer seat 7 PCI-X adapter down and the retainer clip in the locked...
Page 351
8. Lower the tab onto the PCI-X adapter face plate. Rotate the adapter retainer clip clockwise until it covers the tab at approximately a 45-degree angle. See the following illustration. 1 System unit 2 Adapter retainer assembly (in the unlocked position) 3 Adapter retainer assembly (retainer seat down and the retainer clip in the unlocked position) 4 Adapter retainer assembly (retainer seat down and the retainer clip in the locked position) 5 PCI-X adapter face plate (dotted lines)
v If you receive a failure message indicating a hardware error, the problem might be either the adapter or the PCI-X slot. Isolate the problem by swapping an identical adapter into the slot and retrying. If you determine that you have failing hardware, call your service representative. Note: After isolating the problem, be sure to swap the adapters back to their original slots.
Page 353
Chapter 11. Removal and replacement procedures...
Fans Before performing the following procedures, read “Safety notices” on page xv. Fan assembly removal This section contains procedures for removing processor cooling fans and PCI-X adapter cooling fans. Select the removal procedure according to the fan assembly that you are servicing. To remove a fan, do the following: 1.
Processor fan removal If you are servicing one of the processor-cooling fan assemblies, do the following: 1. Lift the processor-fan-access cover. 2. Place your thumb and index finger into the two finger holes, located on the top of the faulty fan. 3.
Page 356
PCI-X adapter fan replacement Attention: The system will power off if the failing PCI cooling fan has not been exchanged within five minutes. When installing a PCI-X adapter cooling fan, do the following: 1. Before inserting the fan assembly into the fan cage, ensure that the fan connector located on the fan housing is facing the front of the system.
Media drives (diskette and CD-ROM) Note: The two media bays are used to house an IDE CD-ROM (media bay 1) and diskette drive (media bay 2). See the illustrations on page 338 for the location of the two media bays. Before performing this procedure, read the following: v “Safety notices”...
4. Put the media drive and carrier-tray assembly in a safe place. 1 Service access cover 4 IDE CD-ROM 2 NAS Gateway 500 5 Media-device support shelf 3 Diskette drive 6 Front bezel Media drive replacement To replace a media drive, perform the following steps: 1.
Disk drives Before performing the following procedure, read “Safety notices” on page xv. The procedure to replace a disk drive depends on whether or not the NAS Gateway 500 has the mirroring feature code installed. Attention: You do not have to power off the system to remove a hot-plug disk drive. Before you perform these procedures, ensure that the customer has taken appropriate actions to back up the data for the drive you are removing, and that the drive has been removed from the configuration.
9. Press F4 to list the available volume groups, then select the volume group name and press Enter. 10. Press F4 to select a physical volume, and follow the instructions on the screen to select the physical volume. Then press Enter. 11.
Disk drive replacement Attention: Before you perform these procedures, ensure that the customer has taken appropriate actions to back up the data for the drive you are removing, and that the drive has been removed from the configuration. Physically removing a hot-plug drive from the system before it has been removed from the system configuration can cause unrecoverable data corruption.
Configuring (mirroring) Attention: You do not have to power off the system to add a hot-plug disk drive. Before you perform these procedures, ensure that you have taken the appropriate actions to back up the data for the drive that you are mirroring. To install a hot-plug disk drive, perform the following steps: 1.
Processor book Before performing the following procedure, read “Safety notices” on page xv. Notes: 1. Installing a new processor book into your system might require the updating of the system firmware. 2. Before handling any card, board, or DIMM, touch any metal surface of the chassis with one hand to minimize static electricity discharge.
Processor book replacement You must first transfer the DIMMs from the old processor book to the new processor book: 1. Lay the old processor book, cover-side up, on a flat, static-protective surface. 2. From the bottom of the old processor book, lift the side access cover. Four hinges hold the cover to the top of the processor book.
Page 365
3. Align the processor-card connector with the docking connector located on the CEC backplane. 4. Carefully insert the processor assembly into the system. 5. As the processor connector is seated into its CEC backplane docking connector, begin lowering the processor-assembly release handles. This action correctly seats and locks the processor assembly to the CEC backplane.
DIMMs Before performing the following procedures, read “Safety notices” on page xv. Notes: 1. Adding an additional processor card or installing a new processor card into your system might require that you update the system firmware. 2. Before handling any card, drive, book, or DIMM, touch any metal surface of the chassis with one hand to minimize static-electricity discharge.
4. Pull the DIMM out of the connector as shown in the following illustration. 1 Connector tabs (unlocked position) 2 DIMM DIMM insertion To insert a DIMM, perform the following steps: 1. Before handling any card, board, or DIMM, touch any metal surface of the chassis with one hand to minimize static-electricity discharge.
Battery Before performing the following procedure, read “Safety notices” on page xv. CAUTION: A lithium battery can cause fire, explosion, or a severe burn. Do not recharge, disassemble, heat above 100 degrees C (212 degrees F), solder directly to the cell, incinerate, or expose cell contents to water.
Page 369
Note: Processor book 1 is the processor book that is closest to the power supplies. See the following illustration. 1 Service access cover 6 Front bezel 2 Processor book 1 7 Battery (positive (+) side facing up) 3 NAS Gateway 500 8 Media device support shelf 4 Diskette drive 9 Captive thumbscrew...
Page 370
12. Gently insert the new battery into its plastic mount. 13. Replace processor book 1 as described in “Processor book insertion” on page 344. 14. Replace the media device support shelf. Tighten the thumbscrew located on the support shelf. Note: When replacing the support shelf, ensure that the lower support leg activates the microswitch located on the CEC backplane.
Power supplies DANGER Do not attempt to open the covers of the power supply. Power supplies are not serviceable and are to be replaced as a unit. (D08) This system drawer has power supplies that have hot-swap capabilities. To use this feature, you must have two power supplies in place.
7. Lift the handle until it is perpendicular (90 degrees) to the top of the power supply. By placing the handle perpendicular to the top of the power supply, the base or hinged portion of each handle acts as a cam and will gently pry the power supply from its connector located on the CEC backplane. 8.
Page 373
3. Lower the power supply handles, carefully pressing the power supply into the connector. The spring latch located on the underside of each handle will snap into place, indicating that the power supply is seated correctly and locked into position. Note: Improper operation of the spring-activated portion of the power supply handles can result in the following: v The retractable spring remains compressed.
PCI riser card Attention: The PCI riser card is a FRU and should be replaced only by trained experienced service personnel. Note: Performing this procedure requires a considerable amount of force when removing or installing the riser card from its docking connectors. It is strongly recommended that the system drawer be removed from the rack.
Page 375
10. Record the slot number and location of each adapter. 1 Service access cover 7 Front bezel 2 PCI-X adapters 8 IDE CD-ROM 3 PCI riser card 9 Processor card assembly #2 4 NAS Gateway 500 10 Processor filler panel 5 Media device 11 Card separators 6 Media-support tray...
Page 376
16. Remove all seven hex-head screws and the two 5.5-mm nuts that secure the PCI riser card to the CEC backplane and chassis. 1 PCI riser card 2 Removal sling 3 CEC backplane 4 CEC backplane connectors to PCI riser card connectors 5 “Push”...
PCI riser card replacement To replace the PCI riser card, do the following: 1. Before handling any card, board, or DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling static-sensitive devices” on page 310. 2.
Page 378
12. Replace all removed media drives, ensuring that the media drives removed are returned to the same media bay locations from which they were removed. Refer to “Media drive replacement” on page 338. 13. Reconnect all power and signal cables to their respective media drives. 14.
Disk drive backplane Attention: The disk drive backplane is a FRU and should be replaced only by trained experienced service personnel. These procedures require you to remove and install the PCI riser card. Removing or Installing a PCI riser card from its docking connectors requires a considerable amount of force. It is strongly recommended that the system drawer be removed from the rack.
Page 380
18. Disconnect all power and signal cables connected to the disk drive backplane. 19. Remove the four retaining screws that secure the disk drive cage to the front of the system chassis. 20. Remove the two screws, located by the backplane card, that secure the disk drive cage to the chassis bottom.
Disk drive backplane replacement To replace the disk drive backplane, do the following: 1. Using the six thread-lock retaining screws removed earlier, secure the backplane to the disk drive cage. 2. Carefully place the disk-drive-cage assembly into the system. 3. Slide the disk-drive-cage assembly toward the front of the system. 4.
Operator panel Attention: The operator panel is a FRU and should be replaced only by trained experienced service personnel. Before performing the following procedure, read “Safety notices” on page xv. Operator panel removal To remove the operator panel, do the following: 1.
Operator panel replacement Attention: If you are replacing the operator panel with a new FRU, you must write the system VPD information (machine type, model number, and serial number) into the VPD module. You must contact the support center to obtain instructions and the necessary password to enter the information into the VPD module.
3. At the command prompt, type the code that accesses the hidden menus. If necessary, call your local support center to obtain the code. This menu is for IBM Authorized use only. If you have not been authorized to use this menu, please discontinue use immediately.
Page 385
5. Type the VPD serial number. Attention: The serial number must be entered correctly. Enter the last seven digits only. Do not include the dash (-) in the serial number as a digit. If the serial number is not entered correctly, a new operator panel must be ordered and installed.
Central electronics complex (CEC) backplane The PCI riser card is a FRU and should be replaced only by trained experienced service personnel. Before performing the following procedure, read the “Safety notices” on page xv. Note: To perform this procedure, you must remove and install the PCI riser card. Removing or installing the PCI riser card from its docking connectors requires a considerable amount of force.
Page 387
10. Remove the PCI-X adapters as described in “PCI-X adapter removal” on page 328. 1 Service access cover (drawer) 11 Front bezel 2 Primary and redundant PCI cooling fans 12 IDE CD-ROM drive 3 PCI cooling fan and stabilizing bracket 13 Operator panel 4 Processor cooling fans 14 Front power supply and processor support bracket...
Page 388
19. Remove the processor-cooling fans as described in “Fan assembly removal” on page 334. 20. Remove the processor-cooling-fan-cage assembly by completely removing the two hex-head screws located at the base of the cage. You have to open the processor-fan-access cover to see these two screws.
Page 389
the back chassis bulkhead. 1 CEC backplane 2 NAS Gateway 500 26. To remove the CEC from the chassis, lift the front edge of the CEC backplane and pull it out and away from the system chassis. 27. Put the CEC backplane in a safe place. Chapter 11.
CEC backplane replacement To replace the CEC backplane (system board), do the following: 1. Before handling any card, board, or DIMM, be sure to use your electrostatic discharge strap to minimize static-electric discharge. Refer to “Handling static-sensitive devices” on page 310. 2.
Page 391
19. Replace the front bezel as described in “Front bezel replacement” on page 315. 20. Replace the service access cover, as described in “Service access cover replacement” on page 314. 21. Push the system drawer back into the operating position as described in “Returning the NAS Gateway 500 to the operating position”...
Chapter 12. Parts information This chapter contains parts information for the NAS Gateway 500, identifying field replacement units (FRUs) and customer replacement units (CRUs). A CRU can be exchanged by a customer; however, a FRU can only be replaced by a trained service representative.
Page 395
Index FRU part Units per number number Yes/No assembly Description – Service access cover (drawer) 09P5865 Processor cooling fans 80P2341 Up to 6 PCI-slot filler – – Processor cooling fan cage and rear bulk head assembly 00P3055 1-port Ethernet SX PCI adapter 00P4289 2-port Ethernet TX PCI adapter 00P4290...
System parts, continued NAS Gateway 500 Service Guide...
Page 397
Index FRU part Units per number number Yes/No assembly Description 21P4312 Cable management arm assembly 00P2806 Cable carrier support bracket 00P2807 Rack flange mounting bracket 00P2736 Up to 2 Processor assembly, 1.45 GHz, 2-way 53P3226 Up to 16 512-MB DIMMs 53P3230 Up to 16 1-GB DIMMs...
System internal cables The following diagrams show the system cable connections. The first diagram illustrates the routing of the internal power cables. The second diagram illustrates the routing of the signal cables. Internal power cable routing diagram Index FRU part Units per number number...
Page 399
Internal signal cable routing diagram Index FRU part Units per number number Yes/No assembly Description 00P3210 Operator panel – – SCSI media device (50-pin) – – SCSI media device (68-pin) – – SCSI media device (68-pin) 09P5870 CEC backplane to operator panel signal and power cable –...
Page 400
Index FRU part Units per number number Yes/No assembly Description External SCSI port 09P5899 CEC backplane to rack indicator signal cable – – Rack indicator port – – CEC backplane to debug port signal cable – Debug parallel port NAS Gateway 500 Service Guide...
Power cable Wall plug power Cord PDU Power Index part number Cord P/N Country or region 1838574 1838573 Bahamas, Barbados, Bolivia, Brazil, Canada, Costa Rica, Dominican Republic, El Salvador, Ecuador, Guatemala, Guyana, Haiti, Honduras, Jamaica, Japan, Netherlands Antilles, Panama, Peru, Philippines, Taiwan, Thailand, Trinidad, Tobago, U.S.A., and Venezuela 86G7648 87G3880...
Chapter 13. Using the system Reading the operator panel display The operator panel display is used to: v Track the progress of the system unit self-tests and configuration program v Display codes when the operating system comes to an abnormal end v Display system messages Operator panel 3 4 5...
System attention LED The system attention LED on the operator panel is turned on when an entry is made in the service processor error log that gets transmitted to the system-level error logs. When the attention light comes on, examine these error logs to see if user intervention is required. If a hardware problem is indicated, call service support.
Page 405
The following sections identify the location, color and status of your system component LEDs. LEDs visible from the front of the NAS Gateway 500 Table 20. LEDs visible from the front of the NAS Gateway 500 Location Name Color Notes When flashing, the unit is in standby.
Page 406
Table 21. LEDs visible from the back of the NAS Gateway 500 (continued) Location Name Color Notes Third LED from the Ethernet 2 Green 10 Mbps – Off right below RIO connector mode 100 Mbps – On connector 1 indicator Fourth LED from the Ethernet 2 Green...
Page 407
LEDs visible from the top of the NAS Gateway 500 with the top cover removed Table 22. LEDs visible from the top of the NAS Gateway 500 with the top cover removed Location Name Color Notes If the AC power-on button is pressed, the LED comes on AC indicator Green solid.
Page 408
LEDs visible on the back of the PCI-X adapter cards Table 23. LEDs visible on the back of the PCI-X adapter cards Location Name Color Notes Green Link is active. Activity Green blinking Data activity FC 5706 10 Mbps FC 5701 Link speed Green 100 Mbps...
Appendix A. Getting help, service, and information If you need help, service, technical assistance, or just want more information about IBM products, you will find a wide variety of sources available from IBM to assist you. IBM maintains pages on the World Wide Web where you can get information about IBM products and services and find the latest technical information.
Access to hints, tips, and frequently asked questions v Access to escalation management 24 hours per day, seven days a week v Ability to authorize any number of knowledgeable IS technical staff who can submit problems to IBM Support Centers on your behalf Refer to your IBM software maintenance agreement for a full explanation of IBM’s warranty terms.
Getting help by telephone During the IBM product hardware or software warranty period, you may call the IBM Support Center (1 800 426-7378 in the U.S.) for product assistance covered under the terms of the IBM hardware or software warranty.
The service processor and Electronic Service Agent are designed to place little demand on an attached modem, thereby increasing the setup and connection success rates. Modem setup This section describes how to configure the IBM modems recommended for use with the Electronic Service Agent and the service processor. The recommended modems are:...
If Yes, use “Configuring the 7852-400 Modem” to set DIP switches on the modem and use modem configuration file modem_m0.cfg and then go to step 3. Note: The IBM 7852-400 modem has DIP switches on the right side of the unit. See “IBM 7852-400 DIP switch settings” on page 396 for the correct switch settings.
Page 415
CONFIGURATIONS D12 Select Factory 0 4. Press Enter to load the predefined factory configuration 0. IBM 7857 AT CMD aa " td_ rd_ dsr_ ec " ll_ 5. Press ↓ 7 times until the S-REGISTER message is displayed. S-REGISTER D7 6.
AIX environment they might have to be modified depending on the actual setup and configuration of your environment. IBM 7852-400 DIP switch settings If you are using a 7852-400 modem to enable Electronic Service Agent and service processor...
Page 417
Some modems, such as the IBM 7857-017, are not designed for the paging function. Although they can be used for paging, they return an error message when they do not get the expected response from another modem.
2. Copy the /usr/share/modems/modem_f.cfg file to a new file with a different name (for example, modem_fx.cfg). 3. In the new file (modem_fx.cfg), change the line Send "ATE0T\r" to Send "ATcccE0T\r" where ccc is the added command as specified in your modem manual, as follows: Change the third line of each of the following stanzas: v condout v condin...
If your system is particularly difficult to access physically, another strategy is to protect it with an Uninterruptible Power Source (UPS) and a phone-line surge protector. If recovery becomes necessary, shut down your system using established procedures. Disconnect the power cable and press the power button to drain capacitance while power is disconnected. Disconnect and reconnect modem power, and then reconnect system power to completely reinitialize your system.
Page 420
To set up a remote terminal session, dial into the service processor and start the NAS Gateway 500. After the operating system is loaded and initialized, the connection is dropped. At this point, call the NAS Gateway 500 and the operating system answers and offer you the login prompt. Prevention strategy The disconnect is caused by the operating system when it initializes the Primary Console.
# Licensed Materials - Property of IBM # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # The modem has configuration switches. They should be set to the # factory default settings, except switches 11 and 12.
Page 422
"19\r" or "13\r" or "12\r" or "1\r" busy "7\r" timeout 60 done disconnect: delay 2 # Separate from previous data. send "+++" # Assure command mode. delay 2 # Allow mode switching delay. send "ATH0T\r" # Set modem switch-hook down # (i.e., hang up).
Page 423
Sample file modem_m1.cfg # COMPONENT_NAME: (ESPSETUP) ENTRY SERVICE PROCESSOR SETUP modem_m1 # FUNCTIONS: Modem configuration file specifically for IBM 7857-017 modem with # Auto-Reliable feature. This feature must be turned off for Catcher calls. # This example uses the AT&F reset command to choose the factory defaults.
Page 425
# Licensed Materials - Property of IBM # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # If the modem has configuration switches, they should be set to the # factory default settings.
Page 426
done connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done retry: send "A/" # Repeat the previous command.
Page 427
# Licensed Materials - Property of IBM # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # If the modem has configuration switches, they should be set to the # factory default settings.
Page 429
# Licensed Materials - Property of IBM # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # If the modem has configuration switches, they should be set to the # factory default settings.
Page 430
# &C1 Detect CD &D2 Respond to DTR (often the default) # %N Call-Out phone number Paging phone number # %S Modem speed (available to users) # Following are common responses from a wide range of modems: # 16, 15, 12, 10, 5 and 1 are connection responses. Add others as required. # 7=busy;...
Page 431
# (C) COPYRIGHT International Business Machines Corp. 1996 # All Rights Reserved # Licensed Materials - Property of IBM # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. Appendix B. Modem configurations...
Page 432
# If the modem has configuration switches, they should be set to the # factory default settings. ICDelay 1 DefaultTO 10 CallDelay 120 # AT Attention Code Inserts delay in dialing commands # &F0 Reset. Restore profile 0 Q0 Turn on responses # E0 Turn echo off Turn off responses # V0 Use numeric responses...
Page 434
# Licensed Materials - Property of IBM # US Government Users Restricted Rights - Use, duplication or # disclosure restricted by GSA ADP Schedule Contract with IBM Corp. # If the modem has configuration switches, they should be set to the # factory default settings.
Page 435
connect: send "ATDT%N\r" # Tone dialing command. # %N from Call Home setup. # Expect a connection response. expect "16\r" or "15\r" or "14\r" or "12\r" or "10\r" or "5\r" or "1\r" busy "7\r" timeout 60 done retry: send "A/" # Repeat the previous command.
Appendix C. General attributes required when using a TTY terminal The following general attributes are the default settings for the diagnostic programs. Be sure your terminal is set to these attributes. Note: Set these attributes before the diagnostic programs are loaded. 3151 /11/31/41 3151 /51/61...
Additional communication attributes The following communication attributes are for the 3151, 3161, and 3164 terminals. 3151/11 Communication setup /31/41) 3151 /51/61, 3161 /3164 attributes settings settings settings Description Operating mode Echo Echo Echo Data entered from the keyboard on the terminal is sent to the system unit for translation and then sent back to the screen.
3151/11 /31/41 3151 /51/61 3161 /3164 Keyboard setup attributes settings settings settings Description New line The Return key generates the carriage return (CR) and the line feed (LF) characters. The line turnaround occurs after the CR and LF characters are generated.
Appendix D. PD data collection worksheets PD data collection worksheet Questions Responses General problem description 1. Problem definition 2. Time/date of failure 3. How was the problem detected? General problem determination questions a. Does problem exist on multiple machines? b. Was the error detected by a user? (for example, a usability problem) c.
PD data collection worksheet (continued) Data Procedure Results System configuration 1. General system information CLI prtconf a. Host name b. System model c. Machine serial d. Processor type e. Number of processors f. Processor clock speed g. CPU type h. Total memory size i.
Page 443
Data Procedure Results External SCSI device Only tape drive File serving network configuration Ethernet device driver level *** lslpp -l devices.pci.* | grep thernet Ethernet 1 configuration SMIT chdev: IP address Select Communications Static/DHCP Select Ethernet Auto negation on/off Select Adapter Link speed Select Change and show characteristics..
Page 444
Data Procedure Results Ethernet 5 configuration SMIT chdev: IP address Select Communications Static/DHCP Select Ethernet Auto negation on/off Select Adapter Link speed Select Change and show characteristics..Frame size Select Adapter to show Network interface (for example, ent0) properties Slot #___ Port #____ Physical destination (for example, Cisco switch) Ethernet 6 configuration...
For information on disposal of batteries, contact your local waste disposal facility. In the United States, IBM has established a collection process for reuse, recycling, or proper disposal of used IBM sealed lead acid, nickel cadmium, nickel metal hydride, and other battery packs from IBM Equipment.
Consult your local IBM representative for information on the products and services currently available in your area. Any reference to an IBM product, program, or service is not intended to state or imply that only that IBM product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any IBM intellectual property right may be used instead.
Page 450
v OS/2 v POWER4 v pSeries v RS/6000 v TotalStorage Microsoft and Windows are trademarks of Microsoft Corporation in the United States, other countries, or both. Other company, product, and service names may be trademarks or service marks of others. NAS Gateway 500 Service Guide...
Glossary Journaled File System List of abbreviations JFS2 Journaled File System 2 These abbreviations apply to this product, its LDAP Lightweight Directory Access Protocol environment, and associated products. Logical File System Central Electronics Complex logical partition CIFS Common Internet File System licensed program product common information model LPAR logical partition...
If you do not be accessed only by authorized users in authorized find the term you are looking for, see the IBM ways. Glossary of Computing Terms located at: access control list (ACL).
Page 453
See bits per second. orderable part that (a) has no type number, (b) is for buffer. See buffer storage. purchase only, and (c) does not receive normal IBM maintenance. buffer storage. (1) A special-purpose storage or storage area allowing, through temporary storage, the ACL.
Page 454
machine, the server the supplying machine. 2) This is not a common term to me. 3) If we do use this term, shouldn’t it be client/server model for consistency with DASD. See direct access storage device. client/server? data bus. A bus used to communicate data internally cluster.
Page 455
(LAN) that allows multiple access data rate of 100 Mbps. and manages contention by using carrier sense multiple feature code. A code used by IBM to process access with collision detection (CSMA/CD) as the hardware and software orders.
Page 456
File Transfer Protocol (FTP). In the Internet suite of a set of system-wide, shared resources that cooperate protocols, an application layer protocol that uses TCP/IP to guarantee essential services. and Telnet services to transfer bulk-data files between hertz (Hz). A unit of frequency equal to one cycle per machines or hosts.
Page 457
Internet Protocol (IP). A protocol that routes data loop. A closed unidirectional signal path connecting through a network or interconnected networks. IP acts input/output devices to a system. as an intermediary between the higher protocol layers LUN. See logical unit number. and the physical network.
Page 458
Peripheral Component Interconnect (PCI). A local bus for PCs from Intel that provides a high-speed data NetBIOS. A standard interface to networks, IBM path between the CPU and up to 10 peripherals (video, personal computers (PCs), and other compatible PCs. It disk, network, and so on).
Page 459
storage network. An arrangement that provides shared access to a set of logical unit numbers (LUNs) across one - n storage client networks. SAN. See storage area network. storage port. An engine’s connection point to a SCSI. See small computer system interface. storage client network.
Page 460
allows users of one host to log on to a remote host and virtual logical unit number (VLUN). A subset of a interact as directly attached terminal users of that host. logical drive. terabyte. For processor storage, real and virtual VLAN.
Index Numerics covers (continued) access cover (rack) 314 7852–400 modem, configuring 394 customer account setup menu 280 7857–017 modem, configuring 394 7858–336 modem, configuring 394 deconfigure memory 287 about this manual ix processor 287 access cover (rack) 314 deconfiguring replacement 314 disk drives 339 accessibility xiii diagnostics 243...
Page 462
fan assemblies 334 language selection menu 275 removal 334 laser compliance statement xvi replacement 335 laser safety information xvi fan locations 9 LCD progress indicator log 290 fan removal 334 LEDs 383 fan replace 335 component 384 features, system 2 operator panel 383 architecture, bus 2 loading diagnostics 243...
Page 463
menu (continued) passwords (continued) privileged general user 257 change 260 processor configuration/deconfiguration 269 privileged reboot/restart policy setup 266 change 260 ring indicate 265 physical location codes 17, 21 serial port selection 277 POST indicators 7 serial port speed setup 278 POST keys 7 setup 259 1 key 7...
Page 464
removal (continued) service position 312 processor book 343 service processor 255 static–sensitive devices 310 access system board 366 local 255 removal and replacement procedures 307 remote 256 battery, replacing 348 boot mode menu 267 bezels 315 call-in/call-out setup menu 276 bracket, release latch 321 call-out call-home 288 CD-ROM drive 337...
Page 465
system Web sites xiii, 389 resetting attention LED 37, 384 who should read this manual ix stopping 310 system error interrogation 36 system board 366 removal 366 replacement 370 system board locations 10, 11 system cables 27, 378 system error interrogation 36 system features 2 system firmware update messages 228 system firmware updates 283...
Page 467
Thank you for your responses. May we contact you? h Yes h No When you send comments to IBM, you grant IBM a nonexclusive right to use or distribute your comments in any way it believes appropriate without incurring any obligation to you. Name...
Page 468
Readers’ Comments — We’d Like to Hear from You Cut or Fold Along Line GY27-0418-00 Fold and Tape Please do not staple Fold and Tape _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ NO POSTAGE NECESSARY IF MAILED IN THE...
Need help?
Do you have a question about the TotalStorage NAS Gateway 500 and is the answer not in the manual?
Questions and answers