IBM xSeries 440 Troubleshooting Manual

Eserver type 8687
Hide thumbs Also See for xSeries 440:
Table of Contents

Advertisement

xSeries 440
Type 8687
Troubleshooting Guide

Advertisement

Table of Contents
loading

Summary of Contents for IBM xSeries 440

  • Page 1 440 Type 8687 Troubleshooting Guide...
  • Page 3: Troubleshooting Guide

    440 Troubleshooting Guide SC59-P651-50...
  • Page 4 Before using this information and the product it supports, be sure to read the general information in Appendix A, “Notices” on page 51. Second Edition (July 2002) © Copyright International Business Machines Corporation 2002. All rights reserved. US Government Users Restricted Rights – Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp.
  • Page 5: Table Of Contents

    Major components of the xSeries 440 ........
  • Page 6 Index ............57 xSeries 440:Troubleshooting Guide...
  • Page 7: Safety

    Vor der Installation dieses Produkts die Sicherheitshinweise lesen. Prima di installare questo prodotto, leggere le Informazioni sulla Sicurezza. Les sikkerhetsinformasjonen (Safety Information) før du installerer dette produktet. Antes de instalar este produto, leia as Informações sobre Segurança. © Copyright IBM Corp. 2002...
  • Page 8 Turn everything OFF. First, attach all cables to devices. First, remove power cords from outlet. Attach signal cables to connectors. Remove signal cables from connectors. Attach power cords to outlet. Remove all cables from devices. Turn device ON. xSeries 440:Troubleshooting Guide...
  • Page 9 Statement 2: CAUTION: When replacing the lithium battery, use only IBM Part Number 33F8354 or an equivalent type battery recommended by the manufacturer. If your system has a module containing a lithium battery, replace it only with the same module type made by the same manufacturer.
  • Page 10 The device also might have more than one power cord. To remove all electrical current from the device, ensure that all power cords are disconnected from the power source. viii xSeries 440:Troubleshooting Guide...
  • Page 11 Statement 8: CAUTION: Never remove the cover on a power supply or any part that has the following label attached. Hazardous voltage, current, and energy levels are present inside any compo- nent that has this label attached. There are no serviceable parts inside these components.
  • Page 12 440:Troubleshooting Guide...
  • Page 13: Chapter 1. Introduction

    Documentation CD. • Rack Installation Instructions This printed publication contains the instructions needed to install your server in a rack cabinet. This publication is also provided in PDF format on the IBM xSeries Documentation CD. • Safety Book This multilingual publication is provided in PDF format on the IBM xSeries Docu- mentation CD.
  • Page 14: Server Controls And Indicators

    SCSI activity light: When this green light is on, it indicates that there is activity on the SCSI bus. Locator light: This blue light is used to help you locate other devices connected to the server. CD-ROM drive eject button: Push this button to release a CD-ROM drive from the server. xSeries 440:Troubleshooting Guide...
  • Page 15 CD eject button: Push this button to release a CD from the drive. CD-ROM drive activity light: When this light is on, it indicates that the CD-ROM drive is in use. Diskette drive eject button: Push this button to release a diskette drive from the server.
  • Page 16: Rear View

    Power LED Management port 10/100 (green) Ethernet port • External power connector - This connector is not supported on this server. • Error LED - This amber light goes on when a system management error has occurred. xSeries 440:Troubleshooting Guide...
  • Page 17 • ASM interconnect port - Signal cables for managing expansion module resources are connected to this port. • Ethernet link light: This green light, located on the right of the Ethernet port, goes on when there is an active link connection on the Ethernet controller for the Ethernet port.
  • Page 18: Server Power Features

    3. Plug the server power cords into the power source. 4. Press the power-control button on the front of the server. Note: While the server is powering up, the power-on LED on the front of the server is lit. xSeries 440:Troubleshooting Guide...
  • Page 19: Turning Off The Server

    Turning off the server Complete the following steps to manually turn off the server: 1. Review the information in “Safety” on page v. 2. See your operating system documentation for the proper procedure to shut down the operating system. Statement 5: CAUTION: The power control button on the device and the power switch on the power supply do not turn off the electrical current supplied to the device.
  • Page 20: Standby Mode

    You might need to press and hold the power-control button for more than 4 seconds to cause an immediate shutdown of the operating system and to force it into Standby mode. You can use this feature if the operating system stops functioning. xSeries 440:Troubleshooting Guide...
  • Page 21: Major Components Of The Xseries 440

    Major components of the xSeries 440 The following illustration shows the locations of major components in your server. Note: The illustrations in this document might differ slightly from your hardware. Retention bracket DIMM SMP baffle access doors Fan 4 Heat-sink...
  • Page 22: Center Plane Connectors And Leds

    Module, I/O board, and the Remote Supervisor Adapter. Lightpath SCSI power Power Power Power Thumbscrews Center plane error LED PCI error LED Lower SMP error LED System management Upper SMP error LED error LED Power good LED VRM error LED I/O error LED xSeries 440:Troubleshooting Guide...
  • Page 23: Smp Expansion Module Connectors And Lights

    SMP Expansion Module connectors and lights The following illustrations identify the connectors, switches, and lights on the SMP Expansion Module. Chapter 1. Introduction...
  • Page 24 VRM error LED Microprocessor 4 VRM error LED Microprocessor 1 Microprocessor 4 VRM error LED error LED Microprocessor 1 Microprocessor 2 error LED error LED Microprocessor 3 Microprocessor 2 error LED VRM error LED Microprocessor 3 VRM error LED xSeries 440:Troubleshooting Guide...
  • Page 25: Pci-X Planar Internal Connectors And Leds

    PCI-X planar internal connectors and LEDs The following illustration identifies the internal connectors and LEDs on the PCI-X pla- nar. This planar enables you to install adapters into the server. PCI-X slot 4 PCI-X slot 3 (100 (100 PCI-X slot 5 PCI-X slot 2 (133 PCI-X slot 6...
  • Page 26: I/O Board Internal Connectors

    The following illustration identifies the connectors and lights on the Remote Supervi- sor Adapter. Lithium battery Ethernet speed LED (green) Ethernet link External power Error LED LED (green) supply connector (amber) Serial port Ethernet port Power LED (RJ-45) (COM) (green) ASM Interconnect port (RJ-14) xSeries 440:Troubleshooting Guide...
  • Page 27: Chapter 2. Solving Problems

    After you register and profile your xSeries products, you can diagnose problems using the IBM Online Assistant and you can participate in the IBM discussion forum. For more detailed information about registering and creating a customized profile for your IBM products, visit the following addresses on the Web: —...
  • Page 28: Server Support

    Register and profile your server properly? After you register and profile, you will be able to: • Diagnose problems using the IBM Online Assistant • Participate in the IBM discussion forum • Receive e-mail notifications of technical updates related to your profiled products...
  • Page 29: Post

    POST When you turn on the server, it performs a series of tests to check the operation of server components and some of the options installed in the server. This series of tests is called the power-on self-test, or POST. If POST finishes without detecting any problems, a single beep sounds, and the first screen of your operating system or application program appears.
  • Page 30: Post Beep Codes

    RAM refresh verification has failed. Reseat the memory modules 1-3-1 First 64 Kb RAM test has failed. or install a memory module. 1-3-2 First 64 Kb RAM parity test has failed. If the problem persists, call for service. xSeries 440:Troubleshooting Guide...
  • Page 31 Table 1. POST beep codes (continued) Beep code Description Action 1-4-3 Interrupt vector loading test has failed. Call for service. 2-1-1 Secondary DMA register test has failed. 2-1-2 Primary DMA register test has failed. 2-1-3 Primary interrupt mask register test has failed. 2-1-4 Secondary interrupt mask register test has failed.
  • Page 32: Post Error Messages

    Set the correct date and time. If the date and time are set correctly and saved, but the 163 error message reappears, call for service. You can use the server until the system is serviced, but any application programs that use the date and time will be affected. xSeries 440:Troubleshooting Guide...
  • Page 33 Table 2. POST error messages (continued) POST message Description A change in the memory configuration occurred. This message might appear after you add or remove memory. Notes: The server can be used with decreased memory capacity. If POST error message 289 also occurred, follow the instructions for that error message first. If you just installed or removed memory, run the Configuration/Setup Utility program;...
  • Page 34 A diskette drive configuration error occurred. Note: If you removed a diskette drive, make sure that the diskette drive setting is correct in the Configuration/Setup Utility program. If the setting is not correct, change it. If the problem persists, call for service. xSeries 440:Troubleshooting Guide...
  • Page 35 Table 2. POST error messages (continued) POST message Description 11xx An error occurred during the system-board serial port test. Note: If you have a modem, serial printer, or other serial device attached to your server, verify that the serial cable is connected correctly. If it is, use the following procedure: 1.
  • Page 36 1. Make sure that the I/O address for the PCI adapter and all other adapters are set correctly in the Configuration/Setup Utility program. 2. If the I/O port resource settings are correct, the PCI adapter might be defective. Call for service. xSeries 440:Troubleshooting Guide...
  • Page 37 Table 2. POST error messages (continued) POST message Description 00180300 A PCI adapter has requested a memory address that is not available, or the PCI adapter might be defective. Note: 1. Make sure that the memory address for all other adapters are set correctly in the Configuration/Setup Utility program.
  • Page 38: Error Logs

    Start the Configuration/Setup Utility program; then, select Error Logs from the main menu. See "Using the Configuration/Setup Utility program" in the User’s Guide on the IBM Documentation CD. • Start the diagnostic programs; select Hardware Info from the top of the diagnostic programs screen;...
  • Page 39: Light Path Diagnostics Panel

    server again to be able to use the Light Path Diagnostics lights to help locate system errors. To view the lights on the various system boards: 1. Turn off the server and peripheral devices. 2. Press and hold the Light Path Diagnostics (blue) button on the diagnostics panel. The lights will be illuminated while the switch is pressed.
  • Page 40 One of the VRMs on the system board has Remove ac power from the server and then failed. restart the server. Note: Wait 30 seconds before turning on the server. If the problem persists have the system serviced. xSeries 440:Troubleshooting Guide...
  • Page 41 Table 3. Light Path Diagnostics (continued) Lit light on diagnostics panel Cause Action An error occurred on a PCI bus. The system Check the error log for additional board caused the error. information. If you cannot isolate the failing adapter from the information in the error log, try to determine the failing adapter by removing one adapter at a time from the failing PCI-X...
  • Page 42: Serverguide Problems

    Ensure that the NOS is supported on your server. If the NOS is supported, either there is no option is unavailable. logical drive defined (ServeRAID systems) or the ServerGuide System Partition is not present. Run the ServerGuide setup and configuration program, and ensure that setup is complete. xSeries 440:Troubleshooting Guide...
  • Page 43: Small Computer System Interface (Scsi) Messages

    Small computer system interface (SCSI) messages If you receive a SCSI error message when running the SCSISelect Utility program, one or more of the following might be causing the problem: • A failing SCSI device (adapter, drive, controller) • An improper SCSI configuration •...
  • Page 44: Diagnostic Programs And Error Messages

    (fff) shown in the previous list. Result can be one of the following: Passed This result occurs when the diagnostic test is completed without any errors. Failed This result occurs when the diagnostic test discovers an error. xSeries 440:Troubleshooting Guide...
  • Page 45: Starting The Diagnostic Programs

    User Aborted This result occurs when you stop the diagnostic test before it is complete. Not Applicable This result occurs when you specify a diagnostic test for a device that is not present. Aborted This result occurs when the test could not proceed;...
  • Page 46: Viewing The Test Log

    Refer to the information test provided with the adapter for instructions. (where n is the slot number of the failing adapter) If the problem persists, call for service. SCSI controller on system board failed Call for service. register/counter/power test xSeries 440:Troubleshooting Guide...
  • Page 47 Code Function Result Description Action ServeRAID Aborted Test setup error: No ServeRAID adapter found on Make sure that the system board or PCI bus ServeRAID adapter is properly installed. If the problem remains, replace the ServeRAID adapter. If the problem persists, call for service.
  • Page 48 VRM corresponding to microprocessor in socket id Install a VRM. xyz is not installed If the problem persists, (where xyz identifies the microprocessor whose VRM call for service. is causing the error message) xSeries 440:Troubleshooting Guide...
  • Page 49 Code Function Result Description Action RIOG port Failed Ping rate failure Verify that cables are (also called: connected correctly and Receive threshold exceeded RXE Expansion securely, and try again. port) Transmit threshold exceeded If the problem persists, Connection error call for service. Scalability port Failed Ping rate failure...
  • Page 50 DIMMs in location DIMM n 1. Reseat the failing DIMM. (where n is the number of the socket that contains the 2. If the problem failing DIMM) remains, replace the DIMM. If the problem persists, call for service. xSeries 440:Troubleshooting Guide...
  • Page 51 Code Function Result Description Action Processor cache Aborted Test setup error: BIOS cannot access VPD If your server does not information have the latest level BIOS code installed, Test setup error: Corrupt DMI BIOS. Information in update the BIOS code BIOS is not as expected to the latest level and run the diagnostic...
  • Page 52 (where n is the number of the device and m is the adapter number) See the information that is provided with the tape drive. If the problem persists, call for service. xSeries 440:Troubleshooting Guide...
  • Page 53 Code Function Result Description Action Keyboard Failed On system board keyboard test failed. 1. Verify that the keyboard cable is connected. 2. If the problem remains, replace the keyboard. Note: After installing a USB keyboard, you might need to use the Configuration/Setup utility to Enable keyboardless operation...
  • Page 54: Recovering Bios Code

    Use the ServerGuide program to make a BIOS flash diskette. • Download a BIOS flash diskette from the World Wide Web. Go to http://www.ibm.com/pc/support/, click IBM Server Support, and make the selections for your server. • Contact your IBM service representative.
  • Page 55: Troubleshooting Charts

    1. Turn off the server and peripheral devices and disconnect all external cables and power cords; then, remove the cover. 2. Locate the BIOS code page jumper (J28) on the I/O board. Jumper (J28) 3. Move the jumper from pins 1 and 2 to pins 2 and 3 to enable the BIOS back-up image.
  • Page 56 Make sure that the keyboard cable is properly connected to the server. pointing-device problems. Make sure that the server and the monitor are turned on. Try using another keyboard. All or some keys on the keyboard do not work. If the items above are correct, call for service. xSeries 440:Troubleshooting Guide...
  • Page 57 If the problem remains, call for service. Monitor problems Some IBM monitors have their own self-tests. If you suspect a problem with your monitor, see the information that comes with the monitor for adjusting and testing Testing the monitor.
  • Page 58 To prevent diskette drive read/write errors, be sure the distance between monitors and diskette drives is at least 76 mm (3 in.). Non-IBM monitor cables might cause unpredictable problems. An enhanced monitor cable with additional shielding is available for the 9521 and 9527 monitors.
  • Page 59: Troubleshooting An Ethernet Controller

    Table 5. Troubleshooting charts (continued) Symptom Suggested action Power problems Verify that: The power cables are properly connected to the server. The server does not power on. The electrical outlet functions properly. The type of memory installed is correct. If you just installed an option, remove it, and restart the server. If the server now turns on, you might have installed more options than the power supply supports.
  • Page 60: Before You Call

    Although interrupt sharing is allowed for PCI devices, some devices do not function well when they share an interrupt with a dissimilar PCI device. Try changing the IRQ assigned to the Ethernet adapter or the other device. If the problem remains, call for service. xSeries 440:Troubleshooting Guide...
  • Page 61: Getting Help And Technical Assistance

    If you need help, service, or technical assistance or just want more information about IBM products, you will find a wide variety of sources available from IBM to assist you. This appendix contains information about where to go for additional information about IBM and IBM products, what to do if you experience a problem with your xSeries or ®...
  • Page 62 Hardware service and support You can receive hardware service through IBM Integrated Technology Services or through your IBM reseller, if your reseller is authorized by IBM to provide warranty ser- vice. Go to http://www.ibm.com/planetwide/ for support telephone numbers. In the U.S. and Canada, hardware service and support is available 24 hours a day, 7 days a week.
  • Page 63: Appendix A. Notices

    Web sites. The materials at those Web sites are not part of the materials for this IBM product, and use of those Web sites is at your own risk.
  • Page 64: Trademarks

    Lotus, Lotus Notes, SmartSuite, and Domino are trademarks of Lotus Development Corporation and/or IBM Corporation in the United States, other countries, or both. Intel, Celeron, LANDesk, MMX, NetBurst, Pentium, Pentium II Xeon, Pentium III Xeon, and Xeon are trademarks of Intel Corporation in the United States, other countries, or both.
  • Page 65: Important Notes

    IBM makes no representations or warranties with respect to non-IBM products. Sup- port (if any) for the non-IBM products is provided by the third party, not IBM. Some software may differ from its retail version (if available), and may not include user manuals or all program functionality.
  • Page 66: Industry Canada Class A Emission Compliance Statement

    This product is in conformity with the protection requirements of EU Council Directive 89/336/EEC on the approximation of the laws of the Member States relating to electro- magnetic compatibility. IBM cannot accept responsibility for any failure to satisfy the protection requirements resulting from a nonrecommended modification of the prod- uct, including the fitting of non-IBM option cards.
  • Page 67: Japanese Voluntary Control Council For Interference (Vcci) Statement

    Japanese Voluntary Control Council for Interference (VCCI) statement Power cords For your safety, IBM provides a power cord with a grounded attachment plug to use with this IBM product. To avoid electrical shock, always use the power cord and plug with a properly grounded outlet.
  • Page 68 IBM power cord part Used in these countries and regions number 14F0033 Antigua, Bahrain, Brunei, Channel Islands, China (Hong Kong S.A.R.), Cyprus, Dubai, Fiji, Ghana, India, Iraq, Ireland, Kenya, Kuwait, Malawi, Malaysia, Malta, Nepal, Nigeria, Polynesia, Qatar, Sierra Leone, Singapore, Tanzania, Uganda, United Kingdom,...
  • Page 69: Index

    32 log 26 messages 32 error log, viewing 26, 34 error messages diagnostic 32, 34 POST 20 SCSI 31 Ethernet troubleshooting information 46 expansion enclosure problems 43 FCC Class A notice 53 hardware © Copyright IBM Corp. 2002...
  • Page 70 Light Path Diagnostics table 27 memory problems 44 messages diagnostic error 32, 34 diagnostic text 32 POST error 20 SCSI error 31 microprocessor problem 44 monitor problems 44 mouse problems 44 network connection problems 46 notes, important 53 notices xSeries 440:Troubleshooting Guide...
  • Page 71 electronic emission 53 FCC, Class A 53 option problems 45 PCI-X internal connectors 13 pointing device problems 44 POST (Power-on self test) beep codes 15, 17 error log 26 error logs 15 error messages 15, 20 POST (power-on self-test) overview 17 power problems 46 power cords 55...
  • Page 72 52 troubleshooting charts 15, 42 Ethernet controller 46 tools 15 turning on the server 6 United States electronic emission Class A notice 53 United States FCC Class A notice 53 Universal Serial Bus (USB) problems 46 xSeries 440:Troubleshooting Guide...
  • Page 74  Part Number: 59P6515 Printed in U.S.A. 59P6515...

Table of Contents