Page of 307
Download Table of ContentsContents Print This PagePrint Bookmark

HP 1850R - ProLiant - 128 MB RAM Troubleshooting Manual

Proliant server troubleshooting guide.
Hide thumbs
HP Servers Troubleshooting Guide
January 2003 (Seventh Edition)
Part Number 161759-007

Advertising

   Related Manuals for HP 1850R - ProLiant - 128 MB RAM

   Summary of Contents for HP 1850R - ProLiant - 128 MB RAM

  • Page 1

    HP Servers Troubleshooting Guide January 2003 (Seventh Edition) Part Number 161759-007...

  • Page 2

    Microsoft, Windows, and Windows NT are trademarks of Microsoft Corporation in the U.S. and other countries. Intel, Pentium, and Itanium are trademarks of Intel Corporation in the U.S. and other countries. UNIX is a trademark of The Open Group in the U.S. and other countries. Hewlett-Packard Company shall not be liable for technical or editorial errors or omissions contained herein.

  • Page 3: Table Of Contents

    Contents About This Guide Who Should Use This Guide..................... xi How to Use This Guide ....................xii Key Terms ........................xii Symbols in Text....................... xiii Reader’s Comments ......................xiii HP Resources ........................xiii Chapter 1 Diagnosing the Problem Developing a Troubleshooting Plan ................1-2 Preparing to Troubleshoot the Server................

  • Page 4: Table Of Contents

    Contents Chapter 2 Hardware Problems Important Safety Information ..................2-2 Symbols on Equipment .....................2-2 Rack Stability ......................2-4 Power Problems .......................2-5 Power Source......................2-5 Power Supply ......................2-6 System Power Module....................2-6 Uninterruptible Power Supply...................2-7 System Short Circuit ....................2-8 Redundant Line Cord Switch ..................2-9 General Hardware Problems ..................2-10 Loose Connections ....................2-10 Cabling Order......................2-10 New Hardware......................2-11...

  • Page 5: Table Of Contents

    Contents Network and Modem Problems..................2-41 Network Controllers....................2-41 Modems ........................2-43 Chapter 3 Software Problems Operating Systems......................3-2 Operating System Problems..................3-2 Operating System Updates..................3-5 Restoring to a Backed-Up Version ................3-5 When to Reconfigure or Reload Software ..............3-6 Application Software.......................

  • Page 6: Table Of Contents

    Contents Power-On Self-Test .......................4-18 BIOS Serial Console ....................4-18 Server Blade Health Driver....................4-18 Chapter 5 Error Recovery ROMPaq Disaster Recovery ....................5-2 Redundant ROM Image Recovery ................5-4 Automatic Server Recovery-2..................5-4 Unattended Recovery ....................5-5 Attended Recovery ....................5-6 Hardware Requirements ....................5-6 Health Driver ........................5-9 Initiating Diagnostics ....................5-10 Initiating the Operating System................5-14 ASR-2 Password Security....................5-15...

  • Page 7: Table Of Contents

    Contents Power Cord Considerations ..................6-9 Power Redundancy Considerations ................ 6-10 Preventing Grounding Problems ................... 6-10 Preventing Temperature Problems ................6-10 Preventing Damage to Removable Drives ..............6-11 Preventing Electrostatic Damage .................. 6-12 Preventing Cable Damage ..................... 6-12 Preventing Tape Drive Errors..................6-13 Appendix A HP Resources Online Resources......................

  • Page 8

    Contents 300 Series POST Error Messages ................. C-25 400 Series POST Error Messages ................. C-27 500 Series POST Error Messages ................. C-27 600 Series POST Error Messages ................. C-28 800 Series POST Error Messages ................. C-30 900 Series POST Error Messages ................. C-36 1100 Series POST Error Messages ................

  • Page 9

    Contents 2-15 Audio Problems ...................... 2-37 2-16 Printer Problems ..................... 2-37 2-17 Mouse/Keyboard Problems..................2-38 2-18 Network Controller Problems ................. 2-41 2-19 Modem Problems ....................2-43 Operating System Problems..................3-2 Recovery Software....................3-7 Application Software Problems ................3-8 Remote ROM Upgrade Errors ................3-10 Operating System Driver Delivery .................

  • Page 10

    Contents 300 Series POST Error Messages ................C-25 400 Series POST Error Messages ................C-27 500 Series POST Error Messages ................C-27 600 Series POST Error Messages ................C-28 800 Series POST Error Messages ................C-30 C-10 900 Series POST Error Messages ................C-36 C-11 1100 Series POST Error Messages ................

  • Page 11: About This Guide, Who Should Use This Guide

    About This Guide This guide provides troubleshooting information for ProLiant and TaskSmart servers. For convenience, this guide includes a complete list of Power-On Self-Test (POST) error messages, Diagnostics test error codes, Integrated Management Log (IML) event list error messages, and Array Diagnostic Utility (ADU) error messages. IMPORTANT: The chapters in this guide provide information for multiple servers.

  • Page 12: How To Use This Guide

    About This Guide How to Use This Guide To learn and use proper troubleshooting methods, follow the procedures described throughout Chapter 1, which helps you isolate the problem and refers you to the part of this guide containing the information necessary to solve the problem. To immediately find help for the specific problem you are troubleshooting, refer to “Locating Troubleshooting Information”...

  • Page 13: Symbols In Text

    About This Guide • Server setup utility—a utility designed to set up and configure your server, including ROM-Based Setup Utility (RBSU), System Configuration Utility (SCU), and BIOS Setup Utility. Symbols in Text These symbols may be found in the text of this guide. They have the following meanings.

  • Page 14

    Diagnosing the Problem This chapter covers the steps that you are recommended to take when an error occurs. Going through a structured set of tasks helps you to isolate the problem quickly. IMPORTANT: This guide provides information for multiple servers. Some of the hardware or software information may not apply to your specific server.

  • Page 15: Developing A Troubleshooting Plan

    — Gather the documentation that shipped with your server. Server-specific user documentation is also located on the following website: www.compaq.com/support/servers Select your server, and then look in the Manuals section. — Compile a master plan to be sure that you manipulate one variable at a time.

  • Page 16: Preparing To Troubleshoot The Server, Preparing The Server For Diagnosis, Using A Troubleshooting Methodology

    Diagnosing the Problem Preparing to Troubleshoot the Server Before troubleshooting, follow the steps to prepare the server for diagnosis. Also, read the proper troubleshooting procedures to increase troubleshooting effectiveness. Preparing the Server for Diagnosis Before troubleshooting the server: 1. Record any error messages displayed by the system. 2.

  • Page 17

    Diagnosing the Problem Table 1-1: Troubleshooting Methodology What to Check Troubleshooting Method What are the results of each Look for and record new troubleshooting step? symptoms, such as error messages or informational messages. Were the results logical, consistent, and expected? Did anything change? If so, what? Check system logs.

  • Page 18: Gathering Information, Preliminary Information

    Diagnosing the Problem Gathering Information If you encounter a problem with your server, follow the guidelines in this section and record your findings in a notebook. Having these details available reduces troubleshooting time. This information also helps the authorized service provider to diagnose and solve your problem, if their assistance is used.

  • Page 19: When The Server Does Not Start

    Diagnosing the Problem When the Server Does Not Start The following visual and audio clues indicate that the server is not starting: • The LEDs are off. • The fans are not spinning. • Something seems, looks, or sounds wrong or different. •...

  • Page 20

    Diagnosing the Problem Table 1-2: When the Server Does Not Start continued What to Check What to Do Check for incorrect system settings: Refer to your server-specific user documentation to verify switch • Are switches set correctly? settings. Check for faulty power delivery: Refer to “Power Source”...

  • Page 21

    Diagnosing the Problem Table 1-2: When the Server Does Not Start continued What to Check What to Do Check for Processor Power Module Refer to “Processor Power (PPM) problems: Modules” in Chapter 2. • Has a PPM failed and forced the server into a reset condition? Check for automatic server Be sure that the server is not...

  • Page 22

    Diagnosing the Problem When a ProLiant BL Server Does Not Start Use the information in Table 1-3 to troubleshoot problems with a ProLiant BL server. Table 1-3: When a ProLiant BL Server Does Not Start What to Check What to Do Check the enclosure(s): Check for connection problems: Refer to:...

  • Page 23

    Diagnosing the Problem Table 1-3: When a ProLiant BL Server Does Not Start continued What to Check What to Do Check for power supply problems: Refer to: • • Is each power supply fan “Power Supply” in spinning? Chapter 2. •...

  • Page 24

    Diagnosing the Problem Table 1-3: When a ProLiant BL Server Does Not Start continued What to Check What to Do Check each server blade: Check for power delivery problems: Refer to your server-specific user documentation for LED • Do all appropriate LEDs information.

  • Page 25: When The Self-tests Fail

    Diagnosing the Problem When the Self-Tests Fail This section provides steps to follow if the system starts, but fails to complete the self-tests without error. The following visual and audio clues indicate that the system is not completing the self-tests: •...

  • Page 26

    Diagnosing the Problem Table 1-4: When the Self-Tests Fail continued What to Check What to Do Check the system configuration: • Refer to your server-specific user Are all required switch documentation. settings set correctly? What is the system Run the Inspect Utility. Refer to: configuration for the: •...

  • Page 27

    Diagnosing the Problem Table 1-4: When the Self-Tests Fail continued What to Check What to Do Check for system failures: • Refer to “Loose Connections” in Be sure that all expansion Chapter 2. boards, drives, and processors are firmly seated and that all latches are firmly closed.

  • Page 28: When The Operating System Does Not Load

    Diagnosing the Problem When the Operating System Does Not Load This section provides steps to follow if the server starts and completes the self-tests without error, but encounters errors while loading the operating system. Make note of the following information before following the steps in this section: •...

  • Page 29

    Diagnosing the Problem Table 1-5: When the Operating System Does Not Load continued What to Check What to Do • Run Diagnostics. Were there any test errors? Refer to: • “Diagnostics” in Chapter 4 • Appendix B, “Test Error Codes” Check for any incorrect, conflicting, or out-of-date software versions: •...

  • Page 30

    Diagnosing the Problem Table 1-5: When the Operating System Does Not Load continued What to Check What to Do If your server uses EFI boot manager, check the EFI settings: • Refer to your server-specific user Is the operating system documentation for more configured as the default information.

  • Page 31: Locating Troubleshooting Information

    Diagnosing the Problem Locating Troubleshooting Information This section is a reference to help you locate detailed troubleshooting information in the remainder of this guide, and refers you to other guides if necessary. For other resources, refer to Appendix A, “HP Resources.” Table 1-6: Locating Troubleshooting Information Problem Location of information...

  • Page 32

    Diagnosing the Problem Table 1-6: Locating Troubleshooting Information continued Problem Location of information Mouse/keyboard does not work. “Mouse and Keyboard” in Chapter 2 • A peripheral device is not working. “General Hardware Problems” in Chapter 2 • “External Device Problems” in Chapter 2 Server LEDs are on, and you are Your server-specific user...

  • Page 33: Contacting Hp, Contacting Technical Support Or Authorized Reseller

    Diagnosing the Problem Contacting HP Contact HP only if you have followed the procedures described in this guide and still have not fixed the problem with your server. Contacting Technical Support or Authorized Reseller If you have exhausted the information in this guide, you can obtain further information and help from the following locations.

  • Page 34: Server Information You Need

    Diagnosing the Problem Server Information You Need Before contacting HP, obtain the following: • All information from the “Gathering Information” section in this chapter. • A printed copy of the system and operating environment information and a copy of any historical data that might be relevant. To obtain this information, run the Inspect utility (if supported by your server), described in “Inspect Utility”...

  • Page 35: Operating System Information You Need

    Diagnosing the Problem Operating System Information You Need Be sure that you have the following operating system information available before contacting your authorized service provider. If possible, gather this information about the last working version and the current version. IMPORTANT: This section provides general operating system information for several servers. Some information covered may not apply to your specific server.

  • Page 36

    An updated Emergency Repair Diskette • If HP drivers are installed: — Version of the Compaq Support Paq (CSP), ProLiant Support Pack (PSP), or Support Software Diskette (SSD) used — List of drivers from the CSP, PSP, or SSD •...

  • Page 37

    Diagnosing the Problem Linux Operating Systems Collect the following information: • Operating system distribution and version Look for a file named /etc/distribution-release (for example, /etc/redhat-release) • Kernel version in use • Output from the following commands (performed by root): lspci -v uname -a cat /proc/meminfo cat /proc/cpuinfo...

  • Page 38

    A list of the drivers and NLMs used on your server, including the names, versions, dates, and sizes • If HP drivers are installed: — Version of the Compaq Support Paq (CSP), ProLiant Support Pack (PSP), or Support Software Diskette (SSD) used — List of drivers from the CSP, PSP, or SSD •...

  • Page 39

    Diagnosing the Problem Caldera and SCO Operating Systems Collect the following information: • Installed system software versions (TCP/IP, VP/Ix) • Process status at time of failure, if possible • Printouts of: Output of command /etc/hwconfig Output of command /usr/bin/swconfig Output of command /etc/ifconfig /etc/conf/cf.d/sdevice...

  • Page 40

    Diagnosing the Problem IBM OS/2 Operating Systems Collect the following information: • Operating system version number and printouts of: IBMLAN.INI PROTOCOL.INI CONFIG.SYS STARTUP.CMD SYSLEVEL information in detail TRAPDUMP information (if a TRAP error occurs) • A directory listing of: C:\OS2 C:\OS2\BOOT HPFS386.INI (for Advanced or Advanced with SMP) •...

  • Page 41

    Diagnosing the Problem • A list of all third-party hardware and software installed, with versions • A detailed description of the problem and any associated error messages NOTE: If your configuration includes SQL server, refer to the “SQL Server” section in this chapter.

  • Page 42

    Diagnosing the Problem Banyan VINES Operating Systems Collect the following information: • Operating system versions, revisions, and installed patches • The Banyan list of installed boards. Software settings should match your hardware configurations. • Number of servers on the network •...

  • Page 43: Hardware Problems

    Hardware Problems This chapter provides steps for troubleshooting common hardware problems. IMPORTANT: This guide provides information for multiple servers. Some of the hardware or software information may not apply to your specific server. You may need to modify some of the examples or procedures in this guide for your work environment.

  • Page 44: Important Safety Information, Symbols On Equipment

    Hardware Problems Important Safety Information This section is a reference which explains the safety information placed on equipment and in your server-specific user documentation. Familiarize yourself with the warnings in this section to reduce the potential risks involved in handling equipment.

  • Page 45

    Hardware Problems This symbol indicates the presence of a hot surface or hot component. If this surface is contacted, the potential for injury exists. WARNING: To reduce the risk of injury from a hot component, allow the surface to cool before touching. These symbols, on power supplies or systems, indicate that the equipment is supplied by multiple sources of power.

  • Page 46: Rack Stability

    Hardware Problems Rack Stability Before working with racks, be sure that you understand the following warning and caution information, which may appear in your server-specific user documentation. WARNING: To reduce the risk of personal injury or damage to the equipment, be sure that: •...

  • Page 47: Power Problems, Power Source

    Hardware Problems Power Problems Power Source Use the information in Table 2-1 to troubleshoot power source problems. Table 2-1: Power Source Problems Problem Possible Cause Possible Solution Power is not working Power On/Standby Be sure that the switch is on. properly or power-related switch If your server has a Power On/Standby...

  • Page 48: Power Supply, System Power Module

    Hardware Problems Power Supply Use the information in Table 2-2 to troubleshoot power supply problems. Table 2-2: Power Supply Problems Problem Possible Cause Possible Solution Power is not working Power supplies are not Check for loose connections. Refer to properly or power-related connected properly.

  • Page 49: Uninterruptible Power Supply

    Hardware Problems Table 2-3: System Power Module Problems Problem Possible Cause Possible Solution SPM- or power SPMs are not installed Be sure that each SPM is installed with a supply-related errors are properly. corresponding power supply, as shown in occurring your server-specific user documentation.

  • Page 50: System Short Circuit

    Hardware Problems Table 2-4: UPS Problems continued Problem Possible Cause Possible Solution UPS will not start. Line cord is not Connect the line cord. connected. Circuit breaker or fuse is Push the circuit breaker button or replace open. the fuse. If this occurs repeatedly, contact an authorized service provider.

  • Page 51: Redundant Line Cord Switch

    Hardware Problems CAUTION: Never operate the server for an extended period of time if an access panel is removed. Doing so can cause thermal damage to drives and components and can void your system warranty. Perform the following steps: 1. Remove the appropriate access panel, and then check the system-interlock or system-interconnect status LEDs.

  • Page 52: General Hardware Problems, Loose Connections, Cabling Order

    Hardware Problems General Hardware Problems WARNING: To avoid potential problems, ALWAYS read the warnings and cautionary information in your server-specific user documentation before removing, replacing, reseating, or modifying system components. Loose Connections If a device does not work, it is often because the device is not properly connected or not properly seated.

  • Page 53: New Hardware

    Hardware Problems New Hardware To troubleshoot problems that occur after you add hardware to the server: IMPORTANT: Always obtain a printed copy of Inspect data before and after adding, removing, or repositioning any board or options. Refer to “Inspect Utility” in Chapter 4. 1.

  • Page 54: General Devices

    Hardware Problems General Devices Third-Party Devices If a third-party device does not work, it may be because the device is not supported by your server or lacks up-to-date drivers. • Refer to your device and server -specific user documentation to be sure that your server and operating system support the device.

  • Page 55: Rack Environment

    Hardware Problems 4. If you are testing a board (or a device that connects to a board): a. Test the board with all other boards removed. b. Test the server with only that board removed. 5. Move the device to a different slot on the same bus. If the board works, either the original slot is bad or the board was not properly seated.

  • Page 56: Internal System Problems, Processors

    Hardware Problems Internal System Problems Processors Use the information in Table 2-5 to troubleshoot processor problems. WARNING: To avoid potential problems, ALWAYS read the warnings and cautionary information in your server-specific user documentation before removing, replacing, reseating, or modifying system components. Table 2-5: Processor Problems Problem Possible Cause...

  • Page 57

    Hardware Problems Table 2-5: Processor Problems continued Problem Possible Cause Possible Solution Processor problems are Server has multiple processors Remove all processors occurring (continued). installed, and one or more but one from the server, processors are faulty. replacing them with processor terminator boards or blanks, if applicable to your server.

  • Page 58: Processor Power Modules

    Hardware Problems Table 2-5: Processor Problems continued Problem Possible Cause Possible Solution Processor problems are Processors are incorrectly Be sure that each processor is occurring (continued). installed in your server. correctly installed in your server, as detailed in your server-specific user documentation.

  • Page 59: Pci Or Pci-x Hot Plug, Memory

    Hardware Problems To check for failure if your server has redundant PPMs, briefly remove the appropriate access panel to inspect for the following: • If the failure LED is lit, then one of the voltage regulators on the redundant PPM has failed.

  • Page 60

    Hardware Problems Table 2-6: Memory Problems Problem Possible Cause Possible Solution Out of memory. Memory configuration Check the application documentation may not be set up to determine the memory correctly or you have run configuration requirements. out of memory to run the Check for operating system errors.

  • Page 61

    Hardware Problems Table 2-6: Memory Problems continued Problem Possible Cause Possible Solution Memory count error Be sure that the memory modules occurs. have been installed correctly. Refer to your server-specific user documentation. Be sure that the memory modules are properly seated. Run the server setup utility.

  • Page 62: Fans

    Hardware Problems Fans General Fan Problems Use the information in Table 2-7 to troubleshoot general fan problems. Table 2-7: General Fan Problems Problem Possible Cause Possible Solution Thermal and fan Fans are improperly Follow your server-specific user problems are occurring. seated.

  • Page 63

    Hardware Problems Table 2-7: General Fan Problems continued Problem Possible Cause Possible Solution Thermal and fan Fans are not functioning Be sure that all fans are properly seated problems are occurring properly. and working. (continued). Replace any required nonfunctioning fans and restart the server. Refer to your server-specific user documentation for specifications on fan requirements.

  • Page 64: Drive Problems, Diskette Drives

    Hardware Problems Drive Problems Diskette Drives Use Table 2-9 to troubleshoot diskette drive problems. Table 2-9: Diskette Drive Problems Problem Possible Cause Possible Solution Diskette drive light stays Diskette is damaged. Run your diskette utility on the diskette (CHKDSK on some systems). Diskette is incorrectly Remove the diskette and reinsert inserted.

  • Page 65: Cd-rom And Dvd Drives

    Hardware Problems Table 2-9: Diskette Drive Problems continued Problem Possible Cause Possible Solution Diskette drive cannot Diskette is not formatted. Format the diskette. write to a diskette. Diskette is Use another diskette or remove the write write-protected. protection. You are attempting to Check the drive letter in your path write to the wrong drive.

  • Page 66: Hard Drives

    Hardware Problems Table 2-10: CD-ROM and DVD Drive Problems continued Problem Possible Cause Possible Solution CD-ROM or DVD drive is CD-ROM or DVD drive is Refer to the documentation included with not detected. not properly connected. the drive. Connect cables correctly. Cable or cables are bad.

  • Page 67

    Hardware Problems Table 2-11: Hard Drive Problems continued Problem Possible Cause Possible Solution Hard drive is not Array is not configured Run your array configuration utility. recognized by server for the hard drive. (continued). There is a conflict with On non-hot plug hard drives, check for another hard drive.

  • Page 68

    Hardware Problems SCSI Non-Hot-Plug Hard Drives If the SCSI device is not recognized, be sure that it is configured correctly. Refer to the documentation provided with the SCSI device. HP ships non-hot-plug hard drives set to ID 0 and CD-ROM drives set to ID 5. •...

  • Page 69

    Hardware Problems CAUTION: Before removing a hot-plug drive, refer to “Unsafe Hot-Plug Drive Replacement Precautions” in this chapter to avoid potential data loss. IMPORTANT: It is recommended that disk drive replacement be performed during low-activity periods whenever possible. In addition, a current valid backup should be available for the logical drives in the array of the drive being replaced, even if drive replacement is being made during server downtime.

  • Page 70

    Hardware Problems Unsafe Hot-Plug Drive Replacement Precautions Be aware of the following guidelines cautioning unsafe hot-plug replacement: • Do not remove a drive if any other member of the array is offline (the online LED is off). No other drive in the array can be hot-plugged without data loss. The possible exception to this might be the utilization of RAID 0+1 as a fault-tolerant form.

  • Page 71

    Hardware Problems • Do not replace more than one drive at the same time (for example, when the system is off), since the fault tolerance can be compromised. CAUTION: Replacing more than one drive at the same time compromises the fault tolerance.

  • Page 72

    Hardware Problems Predictive Failure Alert The predictive failure alert is a powerful problem-prevention tool that warns you when the system has determined that a drive failure is imminent. This alert allows you to proactively schedule downtime for maintenance and not interrupt critical business operations that rely on your server.

  • Page 73: Tape Drives

    Hardware Problems • You must follow the server cabling guidelines when configuring your array to implement the best possible cabling solution for your server. Refer to your server-specific user documentation for step-by-step instructions. • Check for cabling configurations that are not supported. Signal integrity errors can be injected into the SCSI bus when an active drive is hot-plugged.

  • Page 74

    Hardware Problems Table 2-12: Tape Drive Problems continued Problem Possible Cause Possible Solution Server cannot write to A lost leader is dropped. Check every tape cartridge that has been tape (continued). used in the DLT drive. After you locate the bad cartridges, dispose of them. As soon as a good tape drive is installed with a bad cartridge, the tape drive will drop its leader, indicating that it needs...

  • Page 75

    Hardware Problems Table 2-12: Tape Drive Problems continued Problem Possible Cause Possible Solution DLT drive does not read Tape is not compatible Check your tape specifications to be sure tape (continued). with the drive. that the tape is designed to work with the drive.

  • Page 76

    Hardware Problems DAT Drives Use Table 2-13 to troubleshoot common DAT drive problems. Table 2-13: DAT Drive Problems Problem Possible Cause Possible Solution DAT drive sense error There is a drive error. DAT drive sense error codes are detailed codes are displayed. in the whitepaper Troubleshooting DAT Drives.

  • Page 77: External Device Problems, Video

    Hardware Problems External Device Problems Video When you first start the server, the monitor should display the server logo. Use Table 2-14 to troubleshoot video problems. Table 2-14: Video Problems Problem Possible Cause Possible Solution Screen is blank for more Monitor is not powered Power up the monitor and check if the than 60 seconds after...

  • Page 78

    Hardware Problems Table 2-14: Video Problems continued Problem Possible Cause Possible Solution Screen is blank for more The power-on password is Press any key or type your password, than 60 seconds after enabled. and then wait a few moments for the you power up the server screen to activate.

  • Page 79: Audio, Printers

    Refer to your server-specific connected. user documentation. You can access the documentation from the following website: www.compaq.com/support/servers Select your server, and then look in the Manuals section. Server does not have Continue booting the system. Some internal speakers.

  • Page 80: Mouse And Keyboard

    Hardware Problems Table 2-16: Printer Problems continued Problem Possible Cause Possible Solution Printer output is garbled. The correct printer Install the correct printer drivers for your drivers for your application. application are not installed. Mouse and Keyboard Use Table 2-17 to troubleshoot mouse and keyboard problems. Table 2-17: Mouse/Keyboard Problems Problem Possible Cause...

  • Page 81: Diagnostic Adapter

    Hardware Problems Table 2-17: Mouse/Keyboard Problems continued Problem Possible Cause Possible Solution Mouse does not respond System is not responding If the device or port is bad, there should to movement, or and must be restarted. be a POST error. If there is a POST keyboard does not work error, swap another PS/2 mouse or (continued).

  • Page 82: Integrated Management Display, Hot-plug Management Module

    Hardware Problems On some servers, operation of the Diagnostic Adapter with both USB and PS/2 devices simultaneously connected to the server can cause errors within the operating system including an inadvertent shutdown. If this occurs, switch the PS/2 devices on your system to USB devices.

  • Page 83: Network And Modem Problems, Network Controllers

    Use Table 2-18 to troubleshoot common problems with network controllers. To be sure that you are using the latest drivers and support files before installing your network controllers, access the support information on the product website on the following website: www.compaq.com/support Table 2-18: Network Controller Problems Problem Possible Cause...

  • Page 84

    Hardware Problems Table 2-18: Network Controller Problems continued Problem Possible Cause Possible Solution Network controller stops The cable is not securely Be sure that the cable is secured to the working when an connected. network connector and that the other end expansion board is of the cable is securely connected to the added.

  • Page 85: Modems

    Hardware Problems Modems Use Table 2-19 to troubleshoot modem problems. Table 2-19: Modem Problems Problem Possible Cause Possible Solution There is no dial tone. Cable is not plugged in or is Be sure that the cables are plugged not plugged in correctly. in as specified in the modem documentation.

  • Page 86

    Hardware Problems Table 2-19: Modem Problems continued Problem Possible Cause Possible Solution There is no response There may be a conflicting port Reconfigure the COM port address for when you type address. the modem. AT commands. Be sure that the communications software is set to the COM port to which the modem is connected.

  • Page 87

    Hardware Problems Table 2-19: Modem Problems continued Problem Possible Cause Possible Solution The modem does Auto-answer may not be Enable this option in the not answer an enabled. communications software. incoming call. An answering machine is Turn off the answering machine. connected and is answering -Or- before the modem can.

  • Page 88

    Hardware Problems Table 2-19: Modem Problems continued Problem Possible Cause Possible Solution You are receiving Your baud rate is too high for Check the maximum baud rate for the connection errors. the line you are trying to modem to which you are connecting, access.

  • Page 89: Software Problems

    Software Problems The best sources of information for software problems are your operating system and application software documentation, which may also point to fault detection tools that report errors and preserve your system configuration. Other useful resources include the Survey utility, available for servers running a Microsoft Windows, Linux, or Novell NetWare operating system, and Insight Manager 7 (and previous versions).

  • Page 90: Operating Systems, Operating System Problems

    Software Problems Operating Systems Operating System Problems Use Table 3-1 to troubleshoot operating system problems. Table 3-1: Operating System Problems Problem Possible Solution Operating system locks Scan for viruses with an updated virus scan utility. Errors are displayed in Follow the information provided in the error log, the error log.

  • Page 91

    Software Problems Table 3-1: Operating System Problems continued Problem Possible Solution The operating system For Microsoft Windows NT and does not load, and it Windows 2000 installations, note whether was factory installed. the factory installation halted after you (continued) pressed Ctrl+Alt+Del to display the logon screen and logged on as Administrator.

  • Page 92

    Software Problems Table 3-1: Operating System Problems continued Problem Possible Solution You have a factory- If you are unable to bind your NICs during the installed Novell Protocols Interview, your packet receive buffers NetWare 5 operating may not be set high enough. Toggle over to the system and you are console during the Protocols Interview and adjust unable to bind your...

  • Page 93: Operating System Updates, Restoring To A Backed-up Version

    4. Power up your server. If you apply the update and have problems, refer to the following website: www.compaq.com/support/files/server Select your server and operating system to find SoftPaq files to correct these problems. Restoring to a Backed-Up Version If you recently upgraded the operating system or software and cannot resolve the problem, you can try restoring a previously saved version of the system.

  • Page 94: When To Reconfigure Or Reload Software

    Software Problems When to Reconfigure or Reload Software If all other options have not resolved your problem, consider reconfiguring the system. Before you take this step: 1. Weigh the projected downtime against the time spent troubleshooting intermittent problems. It may be advantageous to start over by removing and reinstalling the problem software, or in some cases by using the System Erase Utility and reinstalling all system software.

  • Page 95

    Software Problems Table 3-2: Recovery Software Operating Systems Software Tool Microsoft Windows NT, Emergency Repair Diskette. Refer to your Windows 2000, and Windows NT, Windows 2000, or BackOffice BackOffice Small Small Business Server documentation for Business Server more information. HP servers shipped with factory-installed software include Start menu access to the Emergency Repair Disk Utility.

  • Page 96: Application Software

    Software Problems Application Software Use Table 3-3 to troubleshoot operating system or application software problems. Table 3-3: Application Software Problems Problem Possible Solution Software locks up. Power down the computer and peripheral devices. Reboot the computer. Scan for viruses with an updated virus scan utility.

  • Page 97: Clustering Software, Remote Rom Flash Problems

    Monitor that collects information on cluster configurations. Refer to the following website for a number of technical documents relating to clusters: www.compaq.com/highavailability Remote ROM Flash Problems During the remote ROM flash process, problems can occur, causing the ROM upgrade on the target system to fail. Some of these failures may be due to user error, while others may be due to an incorrect system configuration or a functional hardware failure.

  • Page 98

    Also be sure that you are following the instructions for the Remote ROM Flash procedure, located on the following website: www.compaq.com/support/files/server Select the family, model, and operating system of your server. If you continue to encounter errors, refer to Table 3-4, which describes the possible failures that can occur during this procedure, their symptoms, and possible corrective actions.

  • Page 99

    Flash utility. To see if your system is the program exits. supported: Access the following website: www.compaq.com/support/files/ server Select the family, model and operating system of your server. Look under the Utilities heading for Remote ROM Flash Utility.

  • Page 100: Maintaining Current Drivers

    Depending on the operating system, drivers are available through various methods: • Individual download • Driver Updates (DUs) • Compaq Support Paqs (CSPs) or ProLiant Support Packs (PSPs) • Support Software Diskettes (SSDs) • Extended Feature Supplements (EFSs) Check the SoftPaq download website to find these driver files: www.compaq.com/support/files/server...

  • Page 101

    SSDs are available for other versions of Microsoft Windows operating systems. Linux Linux drivers are available for individual download. Also, check the following website: www.compaq.com/products/servers/linux/linux- drivers.html Novell NetWare CSPs and PSPs are available for servers running the latest versions of Novell NetWare.

  • Page 102: Erasing The System

    Software Problems Erasing the System CAUTION: Perform a backup before running the System Erase Utility. All data and configuration information on your existing server is erased by the System Erase Utility. This utility sets the system to its original factory state, deleting the existing hardware configuration information (including array setup and disk partitioning) and erasing all connected hard drives completely.

  • Page 103

    Software Problems • Run from diskette a. Power up the server using the SmartStart CD. b. Click Create Support Software on the menu screen. c. The next screen displays a message asking if you want to create the software from CD only or from an integration server. Select Create Software from CD Only.

  • Page 104: Diagnostic Tools

    Diagnostic Tools The diagnostic tools for ProLiant servers were developed to assist you in diagnosing problems, as well as in testing, monitoring, and managing your server. To find out which diagnostic tools are provided with your server, refer to your server-specific user documentation.

  • Page 105: Array Diagnostic Utility

    Diagnostic Tools Array Diagnostic Utility Array Diagnostic Utility (ADU) is a Windows-based tool designed to run on all ProLiant servers that support HP array controllers and are running SmartStart 4.10 or later. The two main functions of ADU are to collect all possible information about the array controllers in the system and to generate a list of detected problems.

  • Page 106: Enterprise Diagnostics Lx32

    Diagnostic Tools Diagnostics Diagnostics is a diagnostic utility that tests the operation of your hardware. If problems are found, Diagnostics isolates the failure down to the replaceable part, whenever possible. Several versions of Diagnostics are available, including the latest version, Enterprise Diagnostics LX32, as well as previous versions for 32- and 64-bit systems.

  • Page 107

    Diagnostic Tools • Tests—provides the capability to test functionality of all the major pieces of hardware in the server, including the following tests: — Quick test—provides a predetermined script where each piece of hardware is exercised and requires no user intervention. This test script is designed to run quickly.

  • Page 108: Diagnostics On A 32-bit System

    Diagnostic Tools Accessing Enterprise Diagnostics LX32 To access Enterprise Diagnostics LX32: 1. Insert the SmartStart CD into the CD-ROM drive. 2. Restart the system using the SmartStart CD. 3. Select Server Diagnostics from the Maintenance tab on the SmartStart main menu.

  • Page 109: Diagnostics On A 64-bit System

    Diagnostics diskette in drive A. A Diagnostics diskette can be made by running the Server Diagnostics file available for download from the following website: www.compaq.com/support/files/server Select your server and operating system, and then select the Server Diagnostics file from the Utilities section on the website.

  • Page 110

    Diagnostic Tools Select your server and operating system, and then select the Server Diagnostics file from the Utilities section on the website. 2. Execute the downloaded file and it will unzip to the Diagnostics64 folder on your hard drive. 3. Insert an LS-120 diskette in your LS-240 drive. Copy all files in the Diagnostics64 directory to this diskette, but do not copy the Diagnostics64 folder itself.

  • Page 111: Server Management

    The server management tools are available on the Management CD, which ship with 32-bit servers, or the following website: www.compaq.com/support/files/server Select your server and operating system from the drop-down menus. Server Management on 64-bit Servers Neither Insight Manager 7 nor previous versions of Insight Manager can be run directly on a 64-bit server.

  • Page 112: Survey Utility, Running Survey Utility

    The Survey Utility can be installed from the SmartStart CD (if available for your server), the Management CD, Integration Maintenance Utility, or from a SoftPaq downloaded from the following website: www.compaq.com/support/files/server Refer to the Management CD for information on installing and running the Survey Utility.

  • Page 113: Diagnostic Station, Diagnostic Adapter, Inspect Utility, Running Inspect On A 32-bit System

    Diagnostic Tools Diagnostic Adapter Available for some ProLiant BL servers, the Diagnostic Adapter provides keyboard, mouse, monitor, and serial connection access to an individual server blade. This allows you to troubleshoot a specific server blade using the Integrated Administrator and other server diagnostic functionality. Refer to your server-specific user documentation for more information.

  • Page 114: Running Inspect On A 64-bit System

    Diagnostics diskette in drive A. A Diagnostics diskette can be made by running the Server Diagnostics file available for download from the following website: www.compaq.com/support/files/server Select your server and operating system, and then select the Server Diagnostics file from the Utilities section on the website.

  • Page 115: Integrated Administrator, Integrated Lights-out, Integrated Management Log

    Diagnostic Tools Integrated Administrator Available for ProLiant BL e-Series servers, the Integrated Administrator is a centralized management and monitoring system that acts as a combination terminal server and remote power controller, enabling out-of-band, secure, serial console connections to all server blades in the enclosure. The Integrated Administrator provides enclosure health, server blade health, and remote server manageability, and is a useful tool for diagnosing problems with your server.

  • Page 116: List Of Events

    Diagnostic Tools Events listed in the IML are categorized as one of four event-severity levels: • Status indicates that the message is informational only. • Repaired indicates that corrective action has been taken. • Caution indicates a nonfatal error condition. •...

  • Page 117

    Diagnostic Tools **001 of 010** ---caution--- 03/19/2002 12:54 PM FAN INSERTED Main System Location: System Board Fan ID: 03 **END OF EVENT** WARNING: To avoid potential problems, ALWAYS read the warnings and cautionary information in your server-specific user documentation before removing, replacing, reseating, or modifying system components.

  • Page 118

    Diagnostic Tools Table 4-1: Event Messages continued Event Message Event Type Action Uncorrectable Memory Uncorrectable error Replace the memory Error (Slot X, Memory module. If that does Module Y) not resolve the problem, replace the Uncorrectable Memory memory board. Error (System Memory) Uncorrectable Memory Error (Memory Module...

  • Page 119

    Diagnostic Tools Table 4-1: Event Messages continued Event Message Event Type Action PCI Bus Error PCI Bus Error (Slot X, Expansion bus error Replace the PCI Bus Y, Device Z, board. Function x) Automatic Server Recovery-2 (ASR-2) ASR Lockup Detected: System lockup Examine the Cause...

  • Page 120

    Diagnostic Tools Table 4-1: Event Messages continued Event Message Event Type Action System AC Power Power supply overload Switch the voltage Overload (Power from 110 V to 220 V or Supply X) add an additional power supply (if applicable to your system).

  • Page 121: Power-on Self-test, Bios Serial Console, Server Blade Health Driver

    Diagnostic Tools Power-On Self-Test Power-On Self-Test (POST) is a series of diagnostic tests that runs automatically on ProLiant and TaskSmart servers when the system is powered up. POST checks firmware and assemblies to ensure that the system is functioning properly. If POST finds an error in the system, an audible beep (if supported by your system), visual message, or both indicate an error condition.

  • Page 122

    Error Recovery Error recovery tools allow you to restore your server in case of error. This chapter describes the procedures for ROMPaq disaster recovery, which allows you to flash corrupt system ROM, and rapid error recovery tools, such as Automatic Server Recovery-2 (ASR-2), which are able to diagnose and quickly restore your system in the case of system error.

  • Page 123: Rompaq Disaster Recovery

    Error Recovery ROMPaq Disaster Recovery Any server that does not have a valid ROM image may utilize the following process. IMPORTANT: This operation should be performed on a server with redundant ROM only if both ROM images have been corrupted. If only one image is corrupted, read the “Redundant ROM Image Recovery”...

  • Page 124

    Error Recovery The server analyzes the diskette, searching for the latest ROM image. If the diskette is not in place, the system continues to beep until a valid ROMPaq diskette is inserted. NOTE: If your server has an IMD, it displays the Please insert ROMPaq message even after a valid ROMPaq diskette has been installed.

  • Page 125: Redundant Rom Image Recovery, Automatic Server Recovery-2

    Error Recovery Redundant ROM Image Recovery If power is interrupted during the flash process, a server with redundant ROM boots as usual, but the inactive (redundant) ROM image is corrupted. A Power-On Self-Test (POST) error message is displayed: 105 – Current System ROM is corrupt – now booting redundant System ROM In this instance, you are required to flash again for the invalid image to become valid.

  • Page 126: Unattended Recovery

    Error Recovery • On servers running the System Configuration Utility (SCU): a. Run SCU. b. Select the System Configuration menu, and then select Configure Hardware. c. Select View or Edit Details. d. Set the software error recovery status to Enabled. e.

  • Page 127: Attended Recovery, Hardware Requirements

    Error Recovery Attended Recovery For attended recovery, ASR-2 executes the following functions: NOTE: This feature is only available on servers that use the System Configuration Utility. Attended recovery is not available on systems that use RBSU. • Logs the error information to the IML •...

  • Page 128

    Error Recovery If you are notified that ASR-2 has restarted the server and initiated the Diagnostics and Utilities menu, use Insight Manager 7 (or previous version) to view the critical error in the IML. Run Diagnostics to diagnose and resolve the problem.

  • Page 129

    Error Recovery Table 5-1: Automatic Server Recovery-2 (ASR-2) Features Features Definition Software error When enabled, ASR-2 is activated if the recovery operating system stops responding, resulting in a lockup. Software error This feature determines how long the server recovery time-out waits before enabling ASR-2 after an operating system lockup.

  • Page 130: Health Driver

    For additional information about the ASR-2 flow of events, refer to your server maintenance and service guide on the on the following website: www.compaq.com/support/servers Select your server to display the reference library. HP Servers Troubleshooting Guide...

  • Page 131: Initiating Diagnostics

    Error Recovery Initiating Diagnostics Initiating Diagnostics is only supported on systems that use the System Configuration Utility. When a critical error occurs and you have enabled ASR-2 to initiate Diagnostics, the operating system-specific Health Driver logs critical error information in the IML, and ASR-2 restarts the server. When the system reinitializes, ASR-2 starts Diagnostics from the hard drive.

  • Page 132

    Error Recovery The System Configuration Utility settings should resemble the settings in Table 5-2 when you enable ASR-2 to initiate Diagnostics. Table 5-2: System Configuration Utility Settings for Booting into Diagnostics Option Setting Description Serial interface COM1, COM2 Select the communications port for the modem used by the pager and the remote ASR-2 functions.

  • Page 133

    Error Recovery Table 5-2: System Configuration Utility Settings for Booting into Diagnostics continued Option Setting Description Dial-out string 555-1234 Enter the dial string followed by the remote computer’s telephone number. Network status Enabled To allow network access to Diagnostics, set Network Status to Enabled and be sure that the Reset Boot option is set to Boot Diagnostics.

  • Page 134

    Error Recovery Table 5-2: System Configuration Utility Settings for Booting into Diagnostics continued Option Setting Description Network IP address xxx.xxx.xxx.xxx Enter the IP address for this server in standard dot notation. NOTE: The network IP address is not used if you select Custom for Network controller.

  • Page 135: Initiating The Operating System

    Error Recovery Initiating the Operating System Initiating the operating system is only supported on systems that use the System Configuration Utility. When you enable ASR-2 to restart the operating system and a critical error occurs, ASR-2 logs the error in the IML and restarts the server. The system then executes the normal restart process.

  • Page 136: Asr-2 Password Security, Asr-2 Integrated Management Log Messages

    Error Recovery Table 5-3: System Configuration Utility Settings for Initiating the Operating System continued Option Settings Network IP address xxx.xxx.xxx.xxx Network IP net mask xxx.xxx.xxx.xxx Network IP router xxx.xxx.xxx.xxx address ASR-2 Password Security The standard HP password features function differently during ASR-2 than during a typical system startup.

  • Page 137

    Error Recovery The IML identifies and records all of the errors described in Table 5-4. Table 5-4: Automatic Server Recovery-2 (ASR-2) IML Messages Message Description Abnormal Program The operating system encountered an abnormal Termination situation that caused a system failure. ASR-2 detected by An ASR-2 activity was detected and logged by the system ROM.

  • Page 138

    Error Recovery Table 5-4: Automatic Server Recovery-2 (ASR-2) IML Messages continued Message Description NMI – ASR-2 Timer The operating system received notice of an Expiration impending ASR-2 timer expiration. NMI – Expansion A board on the expansion bus indicated an error Board Error condition, resulting in a server failure.

  • Page 139: Automatic Revision Tracking

    Error Recovery Automatic Revision Tracking Reviewing the changes that have been made to the server configuration can resolve some errors. The server has an Automatic Revision Tracking (ART) feature that helps you review recent configuration changes. One ART feature is the Revision History Table, which contains the hardware version number of the system board and any other system boards providing ART-compatible revision information.

  • Page 140: Storage Fault Recovery Tracking, Storage Automatic Reconstruction, Network Interface Fault Recovery Tracking

    Error Recovery Storage Fault Recovery Tracking This feature tracks more than 12 failure-indication parameters for SCSI drives, such as time-out, spin-up, and self-test errors. You can use these parameters to pinpoint failed storage subsystem components and to recover from controller or hard drive failures.

  • Page 141

    Error Prevention This chapter provides information to help you avoid future problems. While many of the pointers provided are common-sense suggestions, these prevention tasks are too important to overlook. You will receive tips on: • Preparing for changes • Minimizing the impact of changes by using HP tools •...

  • Page 142: Preparing For Changes

    Error Prevention IMPORTANT: This guide provides information for multiple servers. Some of the hardware or software information may not apply to your specific server. You may need to modify some of the examples or procedures in this guide for your work environment. Refer to your server-specific user documentation for information on procedures, hardware options, software tools, and operating systems supported by, and specific to, your server.

  • Page 143: Minimizing The Impact Of Changes By Using Hp Tools, Managing Updates Through Integration Servers

    Error Prevention • Be sure that you allow enough time to make the changes. • Check for potential device conflicts before adding a device. • If you are adding a processor, keep the terminator board. If you have a processor problem, you can restore the server with the remaining processors.

  • Page 144: Installing Servers Consistently Through Replicated Install, Maintaining Server Software

    Error Prevention Installing Servers Consistently Through Replicated Install When initially setting up a server, SmartStart can access the integration server as the source of the system software instead of the system software CDs. SmartStart on the target server connects to the integration server during the Assisted Integration or Replicated Install interview process.

  • Page 145: Using A Methodology

    Error Prevention Minimizing Updates Stay aware of the latest software updates for your operating system and applications. Weigh the risks of updating versus the need for the changes. If you update the software, it may be harder to restore your system after a failure. On the other hand, you should update software for fixes that you require.

  • Page 146

    Error Prevention • Scan for viruses weekly. Use the latest virus-scanning utilities available to be sure that your data is not corrupted. • Keep historical data. You will not know that the CPU utilization has increased 50 percent if you do not know what it was initially. If you have problems, you can use the data to compare before and after scenarios.

  • Page 147: Visually Checking Your Server

    Error Prevention • Restock spare parts as they are used. • Do not clean card edge connectors with erasers; it removes the gold, causes static discharge, and leaves residue. If connectors have to be cleaned, use isopropyl alcohol or a special cleaning solution applied with a cotton-tipped swab. Visually Checking Your Server Periodically you should look at the following items on your server.

  • Page 148: Recognizing Power Problems Caused By Acts Of Nature, Preventing Power Management Problems

    Error Prevention Recognizing Power Problems Caused by Acts of Nature Some power problems are caused by acts of nature, which can range from lightning and excessive heat to ice, rain, and windstorms. Lightning can cause spikes and surges. A spike is a quick impulse of undesirable high voltage on a power line, typically lasting only a fraction of a second, whereas a surge is a sudden increase in line voltage of short duration.

  • Page 149: General Power Requirements, Power Consumption Considerations, Power Supply Considerations, Power Cord Considerations

    To obtain the most accurate power capacity and assessment of power margin, use the power calculator provided on the ActiveAnswers website at activeanswers.compaq.com/aaconfigurator Power Supply Considerations After you determine the appropriate amount of power for your server, install the power supplies needed for the level of redundancy you require.

  • Page 150: Power Redundancy Considerations, Preventing Grounding Problems, Preventing Temperature Problems

    Error Prevention Power Redundancy Considerations If available for your server, power redundancy protects your server from power failures caused by one of the following: • Power failure in one of the two AC circuits providing power to the server • Accidentally unplugging one of the power cords providing power to the server •...

  • Page 151: Preventing Damage To Removable Drives

    Error Prevention For more information on the temperature requirements of your server, refer to your server-specific user documentation. CAUTION: To reduce the risk of damage to the equipment when installing third-party options: • Be sure that the option equipment does not impede airflow to the rack-mountable ProLiant servers or increase the internal rack temperature beyond the specified maximum rating.

  • Page 152: Preventing Electrostatic Damage, Preventing Cable Damage

    Error Prevention Preventing Electrostatic Damage Many electronic components are sensitive to electrostatic discharge (ESD). Circuitry design and structure determine the degree of sensitivity. Networks built into many integrated circuits provide some protection, but in many cases the discharge contains enough power to alter device parameters or melt silicon junctions. A sudden discharge of static electricity from your finger or other conductor can destroy static-sensitive devices or micro-circuitry.

  • Page 153: Preventing Tape Drive Errors

    Error Prevention Preventing Tape Drive Errors Often tape drive problems are caused by defective or substandard media. The following are recommended cleaning procedures: • Clean the media every eight hours. This is much more frequent than the manufacturer’s recommendation of “once every 25 hours,” but this practice reduces the number of device failures.

  • Page 154

    HP Resources This appendix contains additional resources that may assist you in troubleshooting a particular problem. The resources covered in this chapter fall under the following categories: • Online resources • Software and option resources • General server resources • ProLiant BL e-Class system resources NOTE: For additional resources, contact your authorized service provider.

  • Page 155: Online Resources

    ROM images. Server-specific user The set of documents that ship Access the following website: documentation with a server, including the www.compaq.com/support/servers setup and installation guide, the maintenance and service guide, and select your server. and the hardware configuration and installation poster.

  • Page 156

    An online solutions service that Access the following website: solutions provides a set of tools, forums, www.compaq.com/ActiveAnswers and information for HP servers. Ask Compaq A search engine, powered by Access the following website: Ask Jeeves, that finds askq.compaq.com/askserver information on ProLiant servers.

  • Page 157

    Where to go Resource Paqs Tools, utilities, and information Access the following website: for HP servers. Available for www.compaq.com/partners systems running Microsoft Windows NT, Windows 2000, Click Microsoft, Novell, or Linux, Novell NetWare, or Linux depending on your operating system, operating systems.

  • Page 158

    HP Resources Table A-2: Software and Option Resources continued Resource What it is Where to go Management CD The latest tools available for Run the Management CD shipped with easily managing the server, your server. such as Insight Manager 7, For more information, refer to the Management Agents, and Management CD user documentation...

  • Page 159

    HP Resources Table A-2: Software and Option Resources continued Resource What it is Where to go BIOS Setup Utility Provided for some servers, this If BIOS Setup Utility is supported by utility allows you to modify your server, access the utility by server configuration settings, pressing the F10 key when prompted facilitating such functions as...

  • Page 160

    HP Resources Table A-2: Software and Option Resources continued Resource What it is Where to go Integrated Lights-Out Provides server health and Refer to the Integrated Lights-Out User (iLO) remote server manageability, Guide. If your server supports iLO, also accessible from a network client refer to your server-specific user using a supported Web browser.

  • Page 161

    HP Resources Table A-2: Software and Option Resources continued Resource What it is Where to go Remote Insight Allows remote monitoring of If your server supports this option, refer Board/PCI servers with features including to your server-specific user guide. dedicated LAN, with optional modem or serial connection, server management through a standard browser interface, on-...

  • Page 162

    General Server Resources Table A-3: General Server Resources Type of Information Document Where to go Additional product Product Information Access the following website: information www.compaq.com/products/servers Device driver Server Software Download Access the following website: information Center www.compaq.com/support/files External cabling Cabling information...

  • Page 163

    (for factory-installed operating systems) Operating system TechNotes Access the following website: integration with the www.compaq.com/support/ server platform techpubs/technotes Operating system Operating system support Access the following website: version support matrix ftp.compaq.com/pub/products/servers/ os-support-matrix-310.pdf Overview of server...

  • Page 164

    (for servers which support SmartStart) Specifications for Maintenance and service guide Access the following website: server and options, www.compaq.com/support/servers symbols, installation warnings and notices and select your server. Switch settings; LED Hood labels, Server-specific Hood labels inside the access panels functions;...

  • Page 165

    HP Resources ProLiant BL e-Class System Resources Table A-4: ProLiant BL e-Class System Resources Type of Information Document Where to go Configuration and ProLiant BL e-Class Integrated Documentation CD in the server management Administrator User Guide documentation box (where available) information for enclosure software Configuration and...

  • Page 166: Test Error Codes

    Test Error Codes This appendix contains a list of generated Diagnostics test error codes for every version of Diagnostics except Enterprise Diagnostics LX32. For more information on Diagnostics, refer to “Diagnostics” in Chapter 4. The following table shows the organization of the test error code series within this chapter: Table B-1: Test Error Codes 100 Series...

  • Page 167: Primary Processor Test Error Codes

    Test Error Codes 100-199, Primary Processor Test Error Codes The 100 series of diagnostic error codes identifies failures with processor and system board functions. Corrective action may require replacing system boards or processor assemblies. Table B-2: Primary Processor Test Error Codes Error Code Description Recommended Action...

  • Page 168: Memory Test Error Codes

    Test Error Codes Table B-2: Primary Processor Test Error Codes continued Error Code Description Recommended Action 122-XX Multiprocessor Dispatch The following steps apply to error codes 122-XX and test failed 123-XX: Check the system configuration and retest. 123-XX Interprocessor Communication test failed Replace the processor and retest.

  • Page 169

    Test Error Codes Table B-3: Memory Test Error Codes continued Error Code Description Recommended Action 203-XX Memory write/read test The following steps apply to error codes 203-XX failed through 206-XX: Replace the memory module and retest. 204-XX Memory address test failed Replace the processor and retest.

  • Page 170: Keyboard Test Error Codes

    Test Error Codes Table B-3: Memory Test Error Codes continued Error Code Description Recommended Action 0212-03 Error while restoring test block 0212-89 ECC error during testing 0214-01 Data error during test 0214-89 ECC error during testing Non-functioning DC-DC Replace the Processor Power Module (PPM). converter for processor X 300-399, Keyboard Test Error Codes The 300 series of diagnostic error codes identifies failures with keyboard and system...

  • Page 171: Parallel Printer Test Error Codes

    Test Error Codes 400-499, Parallel Printer Test Error Codes The 400 series of diagnostic error codes identifies failures with a parallel printer interface board or system board functions. Corrective action may require replacement of the serial/parallel interface board or the system board assembly. Table B-5: Parallel Printer Test Error Codes Error Code Description...

  • Page 172

    Test Error Codes 500-599, Graphics Controller Unit Test Error Codes The 500 series of diagnostic error codes identifies failures with graphics or system board functions. Corrective action may require replacement of the graphics board or the system board assembly. Table B-6: Graphics Controller Unit Test Error Codes Error Code Description Recommended Action...

  • Page 173: Diskette Drive Test Error Codes

    Test Error Codes Table B-6: Graphics Controller Unit Test Error Codes continued Error Code Description Recommended Action 514-XX Graphics white screen test failed 516-XX Graphics noise pattern test failed 600-699, Diskette Drive Test Error Codes The 600 series of diagnostic error codes identifies failures with diskette, diskette drive, or system board functions.

  • Page 174: Serial Test Error Codes

    Test Error Codes Table B-7: Diskette Drive Test Error Codes continued Error Code Description Recommended Action 609-XX Diskette reset controller test failed 610-XX Diskette change line test failed 699-XX Diskette drive/media ID Replace the media and retest. error Run the server setup utility and retest. 1100-1199, Serial Test Error Codes The 1100 series of diagnostic error codes identifies failures with serial interface board or system board functions.

  • Page 175: Modem Communications Test Error Codes

    Test Error Codes 1200-1299, Modem Communications Test Error Codes The 1200 series of diagnostic error codes identifies failures with the modem. Corrective action may require replacement of the modem. Table B-9: Modem Communications Test Error Codes Error Description Recommended Action Code 1201-XX Modem internal loopback...

  • Page 176

    Test Error Codes 6000-6099, Network Interface Controllers Test Error Codes The 6000 series of diagnostic error codes identifies failures with various network interface controllers. Corrective action may require replacement of the controller. Table B-10: Network Interface Controllers Test Error Codes Error Description Recommended Action...

  • Page 177

    Test Error Codes 6100-6199, Array Controller Test Error Codes The 6100 series of diagnostic error codes identifies failures with an array controller. Refer to “Array Diagnostic Utility” in Chapter 4 for more information on diagnosing problems with the drive array. Table B-11: Array Controller Test Error Codes Error Description...

  • Page 178: Scsi Hard Drive Test Error Codes

    Test Error Codes 6500-6599, SCSI Hard Drive Test Error Codes The 6500 series of diagnostic error codes identifies failures with SCSI hard drives, SCSI hard drive controller boards, SCSI hard drive cabling, and system board functions. If the system uses a drive array controller, refer to “Array Diagnostic Utility”...

  • Page 179

    Test Error Codes 6600-6699, SCSI/IDE CD-ROM Drive Test Error Codes The 6600 series of diagnostic error codes identifies failures with the CD-ROM cabling, CD-ROM drive, adapter board, or system board assembly. Corrective action may require replacement of the CD-ROM cabling, CD-ROM drive, adapter board, or system board assembly.

  • Page 180: Scsi Tape Drive Test Error Codes, Pointing Device Interface Test Error Codes

    Test Error Codes 6700-6799, SCSI Tape Drive Test Error Codes The 6700 series of diagnostic error codes identifies failures with the tape cartridge, tape drive, media changer, tape drive cabling, SCSI adapter, or system board assembly. Corrective action may require replacement of any of these parts. Table B-14: SCSI Tape Drive Test Error Codes Error Description...

  • Page 181: Post Error Messages

    POST Error Messages This appendix contains a complete list of all Power-On Self-Test (POST) error messages. For more information, refer to “Power-On Self-Test” in Chapter 4. The error messages and codes listed include all codes generated by ProLiant and TaskSmart products. Your system generates only the codes that are applicable to your configuration and options.

  • Page 182

    POST Error Messages WARNING: To avoid potential problems, ALWAYS read the warnings and cautionary information in your server-specific user documentation before removing, replacing, reseating, or modifying system components. Non-Numeric POST Error Messages or Beeps Only Table C-2: Non-Numeric Error Messages or Beeps Only Error Code Audible Probable Source of...

  • Page 183

    POST Error Messages Table C-2: Non-Numeric Error Messages or Beeps Only continued Error Code Audible Probable Source of Action Beeps Problem • Beeps only 2 long No valid memory is If no memory modules present in the system. are present, install at least one memory module to conform to minimum hardware...

  • Page 184

    POST Error Messages Table C-2: Non-Numeric Error Messages or Beeps Only continued Error Code Audible Probable Source of Action Beeps Problem ECC Multiple Bit Error None A memory module failure Run Diagnostics and replace Detected in DIMM X generated a multiple bit failed memory module as error that could not be indicated.

  • Page 185

    POST Error Messages Table C-2: Non-Numeric Error Messages or Beeps Only continued Error Code Audible Probable Source of Action Beeps Problem No Keyboard Present None Keyboard failure Turn off the computer, occurred, or keyboard not and then reconnect the connected to server. keyboard.

  • Page 186

    REDUNDANT ROM None ROM bootblock is Contact authorized service ERROR: Bootblock corrupt. provider. invalid. - contact COMPAQ Representative. RESUME – F1 key None As indicated to continue. Press the F1 key. (Run System None A configuration error Press the F10 key to run Configuration Utility –...

  • Page 187

    POST Error Messages Table C-2: Non-Numeric Error Messages or Beeps Only continued Error Code Audible Probable Source of Action Beeps Problem WARNING: A Type 2 2 short Only Type 0 and Type 1 Refer to the operating system Header PCI Device Header PCI Devices are documentation or the device has been detected.

  • Page 188

    POST Error Messages Table C-3: 100 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 101-Option ROM 1 long, An expansion board in If you have recently added Checksum Error 1 short your system is not new hardware, remove it and working correctly.

  • Page 189

    POST Error Messages Table C-3: 100 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 104-ASR Timer None System board failure. Run Diagnostics. Replace Failure failed assembly as indicated. 105-Current System 2 long Nonbooted ROM image Flash the ROM utilizing ROM is corrupt –...

  • Page 190

    POST Error Messages Table C-3: 100 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 163-Time & Date Not 2 short Invalid time or date in Run the server setup utility configuration memory. and correct. The system time is invalid.

  • Page 191

    POST Error Messages Table C-3: 100 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 171-2-IRC None One or more of the Run the server setup utility Configuration Invalid following messages may and correct. also be displayed: •...

  • Page 192

    POST Error Messages Table C-3: 100 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 173-Slot ID Mismatch None Board replaced, Run the server setup utility configuration not and correct. updated. 174-Configuration/ None EISA or PCI board not Run the server setup utility Slot Mismatch Device found.

  • Page 193

    POST Error Messages Table C-3: 100 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 178-Processor None Processor type or step Run the server setup utility Configuration Invalid does not match and correct. configuration memory. 179- System revision None Incomplete system...

  • Page 194

    POST Error Messages 200 Series POST Error Messages Table C-4: 200 Series POST Error Messages Error Code Audible Probable Source of Action Beeps Problem 201-Memory Error None RAM failure detected. Run Diagnostics. Replace failed assembly as indicated. 201-Memory Error None RAM failure detected.

  • Page 195

    POST Error Messages Table C-4: 200 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 206-cache controller 2 short A memory module is Run Diagnostics. Replace or error malfunctioning. remove malfunctioning memory module as indicated. 207-ECC Corrected 2 short A memory module is...

  • Page 196

    POST Error Messages Table C-4: 200 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 207-Invalid Memory 1 long, 1 Memory within 2-DIMM Install correctly matched Configuration – short Group X is not utilized. DIMMs in Group X. Mismatched DIMMs within DIMM Group X 207-Invalid Memory...

  • Page 197

    POST Error Messages Table C-4: 200 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 207-Invalid Memory 1 long, 1 Installed DIMMs are too Replace DIMMs in indicated Configuration - short slow. slots with supported type. Unsupported DIMM In DIMM Socket X Insufficient Timings...

  • Page 198

    POST Error Messages Table C-4: 200 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 207-Invalid Memory None A memory bank has been Check the location of the Configuration, Bank X incorrectly populated. bank specified. This bank number will correspond to the bank number shown on the memory expansion board...

  • Page 199

    POST Error Messages Table C-4: 200 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 209-Online Spare 1 long, 1 DIMM group mismatch Check the size and speed of Memory Configuration short between Bank B and the memory modules - Bank B does not Bank C.

  • Page 200

    POST Error Messages Table C-4: 200 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 209-Online Spare 1 long, Installed DIMMs for Install or reinstall DIMMs to Memory 1 short online spare bank are of support online spare Configuration-Spare a different primary width...

  • Page 201

    POST Error Messages Table C-4: 200 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 213-Processor None The PPM and voltage Check whether the PPM and Disabled, Processor X regulator module (VRM) VRM is installed and correctly for processor X is not seated.

  • Page 202

    POST Error Messages Table C-4: 200 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 216-Voltage None Indicated PPM has lost Run Diagnostics. Replace Regulator Module for redundancy. failed assembly as indicated. Processor X no longer redundant.

  • Page 203

    POST Error Messages Table C-4: 200 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 220-cache accelerator None Cache accelerator in slot Check cache accelerator Slot X Initialization X improperly installed or installation and if properly Failed.

  • Page 204

    POST Error Messages Table C-4: 200 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 258-Invalid Memory None Failed or incompatible Be sure that the specified TRCD reading DIMM. DIMM meets the requirements for supported DIMMs.

  • Page 205

    POST Error Messages 300 Series POST Error Messages Table C-5: 300 Series POST Error Messages Error Code Audible Probable Source of Action Beeps Problem 301-Keyboard Error None Keyboard failure Turn off the computer, occurred. and then reconnect the keyboard. Be sure that no keys are depressed or stuck.

  • Page 206

    POST Error Messages Table C-5: 300 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 303-keyboard None System board, keyboard, Be sure that the controller error or mouse controller keyboard and mouse are failure occurred. connected.

  • Page 207

    POST Error Messages 400 Series POST Error Messages Table C-6: 400 Series POST Error Messages Error Code Audible Probable Source of Action Beeps Problem 40X-Parallel Port X 2 short Both external and internal Run the server setup utility Address Assignment ports are assigned to and correct.

  • Page 208

    POST Error Messages 600 Series POST Error Messages Table C-8: 600 Series POST Error Messages Error Code Audible Probable Source of Action Beeps Problem 601-Diskette None Diskette-controller Be sure the diskette drive Controller Failed circuitry failure occurred. cables are connected. Replace the diskette drive, the cable, or both.

  • Page 209

    POST Error Messages Table C-8: 600 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 602-Diskette Boot None The boot sector on the Remove diskette from the Record Error boot disk is corrupt. diskette drive. Replace diskette in drive.

  • Page 210

    POST Error Messages 800 Series POST Error Messages Table C-9: 800 Series POST Error Messages Error Code Audible Probable Source of Action Beeps Problem 800-Server Feature None The system has detected Install the Server Feature Board must be the absence of the Board in PCI slot 1.

  • Page 211

    POST Error Messages Table C-9: 800 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 802-Processor X is None The system detected that Be sure that a processor missing or terminator the indicated processor or processor terminator board is not present.

  • Page 212

    POST Error Messages Table C-9: 800 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 804-100MHz memory None A memory module other Reseat all memory is incompatible. than 133-MHz ECC modules in the system. System Halted Registered SDRAM Remove incompatible DIMM has been...

  • Page 213

    POST Error Messages Table C-9: 800 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 806-CMOS has been None The system switch Remove power from the reset SW2.2 has been turned system. on and the system Please power off and Locate the system switch configuration data has...

  • Page 214

    POST Error Messages Table C-9: 800 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 806-NVRAM has None The system configuration Remove power from the been reset switch (SW1), position 2 system. has been turned on and Please power off the Locate the system the system configuration...

  • Page 215

    POST Error Messages Table C-9: 800 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 807-The password None The system switch Remove power from the has been cleared. SW2.1 has been turned system. Please power off and on and the system Locate the system switch restore the clear...

  • Page 216

    POST Error Messages 900 Series POST Error Messages Table C-10: 900 Series POST Error Messages Error Code Audible Probable Source of Action Beeps Problem 912-The computer None The server cover has Be sure that any system cover has been been removed since the access was authorized.

  • Page 217

    POST Error Messages 1100 Series POST Error Messages Table C-11: 1100 Series POST Error Messages Error Code Audible Probable Source of Action Beeps Problem 1151-Com Port 1 2 short Both external and internal Run the server setup utility Address Assignment serial ports are assigned and correct.

  • Page 218

    POST Error Messages 1600 Series POST Error Messages Table C-12: 1600 Series POST Error Messages Error Code Audible Probable Source of Action Beeps Problem 1610- Temperature None The ambient system Room temperature is too violation detected. - temperature has high; lower the room Waiting 5 Minutes for exceeded acceptable temperature.

  • Page 219

    POST Error Messages Table C-12: 1600 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1611-CPU Fan not None Required fan not Check fans. present installed. Reseat fan cable. If this does not solve the problem, replace the fan.

  • Page 220

    POST Error Messages Table C-12: 1600 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1611-Front CPU None Required fan not Check fans. Fan X not present installed. Reseat fan cable. If this does not solve the problem, replace the fan.

  • Page 221

    POST Error Messages Table C-12: 1600 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1611-Power Supply None Required fan not Check fans. Fan X not present installed. Reseat fan cable. If this does not solve the problem, replace the fan.

  • Page 222

    POST Error Messages Table C-12: 1600 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1611-Slot Fan Failure None Required fan not Check fans. Detected spinning. Reseat fan cable. If this does not solve the problem, replace the fan.

  • Page 223

    POST Error Messages Table C-12: 1600 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1615- Power Supply None Power supply has failed. Reseat power supply firmly. Failure -Or- Replace power supply as soon as possible. 1615-Power Supply None Power supply has failed.

  • Page 224

    Decrease the options in the system. Refer to the power calculator on the Web for power considerations: activeanswers.compaq.com/ aaconfigurator 1619- Server None The server management Install the server management board is board is not in the proper...

  • Page 225

    POST Error Messages Table C-12: 1600 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1621-Error - Power None There are more SPMs Power supplies and System Supply required with than power supplies. Power Modules (SPMs) must System Power be installed in pairs and in line Module...

  • Page 226

    POST Error Messages Table C-12: 1600 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1627-System Power None The System Power Note the details of the Module Reported Module has returned an error status and act Error Status error status.

  • Page 227

    POST Error Messages 1700 Series POST Error Messages Table C-13: 1700 Series POST Error Messages Error Code Audible Probable Source of Action Beeps Problem 1701-SCSI Controller None Embedded SCSI Check SCSI cables and Failure controller has failed or a be sure that they are fully connected device is seated.

  • Page 228

    POST Error Messages Table C-13: 1700 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1705-Locked SCSI None A SCSI controller cannot Check the SCSI cable Bus Detected communicate with installation. devices connected to a Be sure that the SCSI SCSI bus.

  • Page 229

    POST Error Messages Table C-13: 1700 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1713-Slot z Drive None Flash ROM is going bad. Replace controller when Array Controller - Controller detected a possible, or try upgrading Redundant ROM checksum failure but is firmware using Options...

  • Page 230

    POST Error Messages Table C-13: 1700 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem • 1720-S.M.A.R.T. Hard None This is a prefailure If configured as a Drive Detects notification of a SCSI non-RAID 0 array, Imminent Failure hard drive that will fail replace the failing/failed...

  • Page 231

    POST Error Messages Table C-13: 1700 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1722-Slot x Drive None Controllers are not The Smart Array 3100ES or Array - Redundant operating redundantly. 4250ES Controllers are not Controller Pair Not operating properly.

  • Page 232

    POST Error Messages Table C-13: 1700 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1724-Slot X Drive None Indicates that logical No action is needed. Array – Physical Drive drive configuration has Position Change(s) been updated Detected –...

  • Page 233

    POST Error Messages Table C-13: 1700 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1728-Drive Array- None There is no array Restore data from backup. Abnormal Shut-Down accelerator battery Detected With Write- backup on this array Cache controller but caching was enabled.

  • Page 234

    POST Error Messages Table C-13: 1700 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1753-Slot z Drive None Controller is overheating. Be sure that there is adequate Array – Array system cooling and sufficient Controller Maximum airflow across controller.

  • Page 235

    POST Error Messages Table C-13: 1700 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1757 - Array None 4-MB array accelerator Replace 4-MB array Accelerator board installed on Smart accelerator board with a Daughterboard Array 42XX Controller.

  • Page 236

    System ROM upgrade is Run the latest system Array requires System required ROMPaq utility to upgrade the ROM Upgrade. Run system ROM as indicated. Systems ROMPaq Download it from the following Utility. website: www.compaq.com/support. continued C-56 HP Servers Troubleshooting Guide...

  • Page 237

    Array Option ROM is problem. Smart Array Controllers using not programmed Options ROMPaq. correctly. Download it from on the following website: www.compaq.com/support 1768-Slot X Drive None Power was lost while a No action required. This Array – Resuming logical expansion message is displayed logical drive operation was performed.

  • Page 238

    Options Download it from the following ROMPaq (available website: from www.compaq.com): www.compaq.com/support SCSI Port Y SCSI ID Z 1771-Primary Disk None Internal and external hard Run the server setup utility...

  • Page 239

    POST Error Messages Table C-13: 1700 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1775-Slot X Drive None Storage system problem Turn off power to system. Array – ProLiant detected. A SCSI Check external ProLiant Storage System Not enclosure seems to be power switch—external...

  • Page 240

    POST Error Messages Table C-13: 1700 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1777-Slot X Drive None Environment threshold Check cooling fan operation Array – ProLiant Drive was violated on the drive by placing hand over fan. Be Storage Enclosure enclosure.

  • Page 241

    POST Error Messages Table C-13: 1700 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1779-Slot X Drive None More drives were failed If this message is displayed Array – Replacement (or were replaced) than and drives have not been drive(s) detected OR the fault-tolerance level...

  • Page 242

    POST Error Messages Table C-13: 1700 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1783-Slot X Drive None Bad controller. If this Reseat the array accelerator Array Controller message is displayed module if detached. Try Failure.

  • Page 243

    Array not Configured not detected. Configuration Utility. (followed by one of Turn off system and the following): check SCSI cable (1) Run Compaq connections to be sure Array Configuration that drives are connected Utility properly. (2) No drives detected Run Array Diagnostic...

  • Page 244

    POST Error Messages Table C-13: 1700 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1786-Slot 1 Drive None System in Interim Data Press the F1 key to allow Array Recovery Recovery mode. Automatic Data Recovery to Needed.

  • Page 245

    POST Error Messages Table C-13: 1700 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1787-Drive Array None Hard drive X failed or Replace drive X as soon Operating in Interim cable is loose or as possible (depending Recovery Mode.

  • Page 246

    POST Error Messages Table C-13: 1700 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1789-Slot X Drive None Drives that were working Power down the system, Array SCSI Drive(s) when the system was last check cable connections, Not Responding.

  • Page 247

    POST Error Messages Table C-13: 1700 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1790-Disk X None Hard drive error or wrong Run the server setup utility Configuration Error drive type detected. and Diagnostics and correct. Replace failed assembly as indicated.

  • Page 248

    POST Error Messages Table C-13: 1700 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1794-Drive Array – None This is a warning that the Replace the array accelerator Array Accelerator battery charge is below board if batteries do not Battery Charge Low.

  • Page 249

    POST Error Messages Table C-13: 1700 Series POST Error Messages continued Error Code Audible Probable Source of Action Beeps Problem 1797-Drive Array – None Hard parity error detected Replace array accelerator Array Accelerator while reading data from daughter board. Read Error Occurred. posted-writes memory.

  • Page 250

    Missing or Invalid system ROM. Processor Microcode Patch. Please contact Compaq Computer Corporation for a new ROM BIOS to support the new Processor Stepping C-70 HP Servers Troubleshooting Guide...

  • Page 251: Adu Error Messages

    ADU Error Messages This appendix contains a complete alphabetical list of all Array Diagnostic Utility (ADU) error messages. For more information, refer to “Array Diagnostic Utility” in Chapter 4. IMPORTANT: This guide provides information for multiple servers. Some of the hardware or software information may not apply to your specific server.

  • Page 252

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages Message Description Recommended Action Accelerator board not detected Array controller did not detect Install an array accelerator a configured array accelerator board on an array controller. If board. an array accelerator board is installed, check for proper seating on the array controller board.

  • Page 253

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages continued Message Description Recommended Action Accelerator status: Dirty data Number of cache lines Resolve the problem with the detected has reached limit. containing dirty data that drive or drives. The controller Cache still enabled, but writes cannot be flushed (written) to can then write the dirty data to...

  • Page 254

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages continued Message Description Recommended Action Accelerator status: Obsolete During reset initialization, Normal operations should data was discarded obsolete data was found in the continue. cache, and was discarded (not written to the drives).

  • Page 255

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages continued Message Description Recommended Action Adapter/NVRAM ID mismatch EISA nonvolatile RAM Run the server setup utility. (NVRAM) has an ID for a different controller from the one physically present in the slot.

  • Page 256

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages continued Message Description Recommended Action Configuration signature is zero ADU detected that nonvolatile Run the latest version of RAM (NVRAM) contains a server setup utility to configure configuration signature of zero. the controller and NVRAM.

  • Page 257

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages continued Message Description Recommended Action Controller reported POST The controller returned an Replace the controller. error. error from its internal Power-On Self-Test (POST). Error Code: X Controller restarted with a ADU did not find a valid Run the server setup utility to signature of zero...

  • Page 258

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages continued Message Description Recommended Action Drive (bay) X is a replacement This drive has been replaced. If the replacement was drive This message is displayed if a intentional, allow the drive to drive is replaced in a fault rebuild.

  • Page 259

    RIS. If there are multiple this drive. errors, this drive may need to be replaced. FYI: Drive (bay) X is Compaq did not supply the If problems exist with this non-Compaq supplied. installed drive. drive, replace it with a supported drive.

  • Page 260

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages continued Message Description Recommended Action Identify controller data did not The controller identification Run the Inspect Utility to check match with NVRAM. data from the array controller the controller identification does not match with the data.

  • Page 261

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages continued Message Description Recommended Action Less than 75% of batteries at Battery pack on the array Allow enough time for batteries sufficient voltage Battery pack accelerator is below the to recharge (36 hours).

  • Page 262

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages continued Message Description Recommended Action Logical Drive X status = loose A physical drive or an external Power the system down and cable detected storage unit may have a attempt to reconnect any loose cabling or connection problem.

  • Page 263

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages continued Message Description Recommended Action Loose cable detected – logical Controller is unable to Check all controller and drive drives may be marked FAILED communicate with one or more cable connections.

  • Page 264

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages continued Message Description Recommended Action Other controller indicates The other controller in the Be sure that both controllers different cache size redundant controller are using the same capacity configuration has a different array accelerator.

  • Page 265

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages continued Message Description Recommended Action SCSI port X, Drive ID Y has The monitor and performance Check and resolve the exceeded the following threshold for this drive has threshold that has been threshold(s) been violated.

  • Page 266

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages continued Message Description Recommended Action SCSI Port X, Drive A predictive failure warning for Replace this drive at the ID Y…S.M.A.R.T. predictive this hard drive has been earliest opportunity. failure errors Have been generated, indicating that a Refer to “SCSI Hot-Plug Hard...

  • Page 267

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages continued Message Description Recommended Action Storage enclosure on SCSI The current cabling Refer to your server-specific bus X has a cabling error (bus configuration is not supported. user documentation for cabling disabled).

  • Page 268

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages continued Message Description Recommended Action Storage enclosure on SCSI The external storage unit is Be sure that all fans are bus X indicated an overheated generating a temperature alert. connected and operating condition.

  • Page 269

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages continued Message Description Recommended Action Storage enclosure on SCSI One or more fans in the Replace the failed fans. bus X indicated that the fan is external storage unit have degraded.

  • Page 270

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages continued Message Description Recommended Action Swapped cables or One or more drive locations Power down the server. Configuration error detected. A were changed while an Place the drives in their drive rearrangement was expand operation was in original locations.

  • Page 271

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages continued Message Description Recommended Action Swapped cables or The current firmware version Upgrade the firmware. Configuration error detected. on the controller cannot -Or- The configuration information interpret the configuration on the attached drives is not information on the connected If that does not solve the...

  • Page 272

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages continued Message Description Recommended Action System board is unable to Slot indicator on system board Be sure that both identify which slots the is not working correctly. controllers are fully seated controllers are in Firmware recognizes both in their slots.

  • Page 273

    If If this does not solve the this doesn’t help, contact your problem, contact an authorized COMPAQ service provider. service provider. continued HP Servers Troubleshooting Guide D-23...

  • Page 274

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages continued Message Description Recommended Action Unknown disable code A code was returned from the Obtain the latest version of array accelerator board that ADU. ADU does not recognize. Unrecoverable read error Read parity errors were Replace the array accelerator detected when an attempt to...

  • Page 275

    ADU Error Messages Table D-1: Array Diagnostic Utility (ADU) Error Messages continued Message Description Recommended Action Write memory error Data can not be written to the With the system powered cache memory. This typically down, be sure that the cache means that a parity error was board is fully connected to the detected while writing data to...

  • Page 276

    Index accelerator parity read errors D-2 accelerator parity write .SCI files, backup 6-2 errors D-2 accelerator status:cache was automatically configured D-2 100MHz memory is accelerator status:data in the incompatible C-32 cache was lost D-2 accelerator status:dirty data detected has reached limit D-3 a correctable memory error accelerator status:dirty data...

  • Page 277

    POST error ECC (error checking and messages C-45 correcting) D-3 ART (Automatic Revision Tracking) 5-18 fans D-18, D-19 Ask Compaq A-3 firmware upgrading D-6, D-7, D-16, ASR timer failure C-9 D-18, D-19, D-21 ASR-2 (Automatic Server Recovery-2) identify controller data D-23...

  • Page 278

    Index event list messages 4-16 thermal shutdown 5-8 features 5-8 timer 5-9 hardware requirements 5-6 timer failure C-9 Health Driver 5-9 unattended recovery 5-5 IML log messages 5-16 UPS shutdown 5-8 lockup 4-16 UPS shutdown threshold 5-8 network AT commands protocol, restart setting 5-12 baud rate, settings 2-46 network access 5-10...

  • Page 279

    Index UPS, warning 2-8 ADU error messages D-2, D-3, D-4, batteries less than 75% at D-11, D-14, D-25 sufficient charge D-10 drive write D-24 battery pack X below reference cache accelerator Slot X voltage D-11 initialization failed C-23 battery pack X below reference cache accelerators not voltage, array installed C-22...

  • Page 280

    4-2 conflict C-37 controller is located in COMM port, POST error messages C-37 special video slot D-6 Compaq Support Paq See CSP controller is not comport invalid C-11 configured D-6 computer cover has been controller order not set...

  • Page 281

    ADU 4-2 detected C-3, C-39 Diagnostic Adapter 4-10 critical fan not present C-39 Diagnostic Station 4-10 CSP (Compaq Support Paq) 3-12, 3-13 Diagnostics 4-3, 4-6, 4-7 current SCSI bus cable Enterprise Diagnostics LX32 4-3 configuration is not iLO 4-12...

  • Page 282

    Index diskette ID drive types test DIMMs (dual inline memory modules), failed B-8 POST error messages C-4, C-15, C-16, diskette ID media failed B-8 C-17, C-18, C-19, C-20, C-21, C-23, diskette random seek test C-24, C-32 failed B-8 direct memory access channels See DMA diskette read test failed B-8 channels diskette reset controller test...

  • Page 283

    OK D-8 drive array, POST error messages C-52, drive (bay) X is failed D-8 C-56, C-57, C-58, C-60, C-62, C-65, drive (bay) X is non-Compaq C-67, C-69 supplied D-9 drive monitoring features are drive (bay) X is undergoing...

  • Page 284

    Index communication problems D-23 EFS (Extended Feature Supplement) 3-12, damage, preventing 6-11 3-13 data guarding D-11 EISA handling, precautions 6-11 boards, not found C-12 low voltage differential (LVD) D-24 event list messages 4-15 magnetic exposure 6-11 expansion device, POST error overheating D-12 messages C-10 replacement, caution 2-26...

  • Page 285

    Index error occurred reading RIS EISA Expansion Bus Master copy from SCSI Port X Drive Timeout 4-15 ID D-9 fatal exception 4-17 error while restoring test kernel panic 4-17 block B-4, B-5 PCI Bus Error 4-16 error while saving test processor correctable error threshold block B-4 passed 4-15...

  • Page 286

    4-14, 4-17 fuse problems 2-8 hot-plug, problems 2-21 FYI, Drive (bay) X is non- not redundant 4-14 Compaq supplied D-9 periodic spin-up 2-21 POST error messages C-2, C-3, C-38, C-39, C-40, C-41, C-42, C-43, C-44 precautions 6-7 graphics 320 x 200 mode color...

  • Page 287

    Index hot-plug fans See fans Hot-Plug Management Module, hard drives problems 2-40 adding 6-3 cannot access data 2-25 contacting 1-20 caution 2-29 website address A-2 error C-53 failure 2-24 fault tolerance 2-28 hot-plug guidelines 2-29 I/O address, determining 1-13 hot-plug installation 2-27 I/O board temperature hot-plug replacement, caution 2-27, violation detected C-38...

  • Page 288

    Index described A-8 required system fan failure 5-17 problems 2-40 self-test failure 1-12 IML (Integrated Management Log) server manager failure 5-17 accessing 4-12 UPS A/C line failure shutdown or battery ASR-2 errors 5-15 low 5-17 ASR-2 messages 5-16 IML Viewer A-8 Caution security level, described 4-13 Important Safety Information document 2-2 corruption C-13...

  • Page 289

    Index intelligent drive array invalid memory configuration controller failure C-62 mismatched DIMMS within DIMM group X C-16 inter-controller link connection could not be invalid memory configuration only registered SDRAM DIMMs established D-10 are supported C-16 interface board, serial/parallel, test error invalid memory configuration codes B-9 unsupported DIMM in board...

  • Page 290

    Index invalid processor PPM Linux installed C-20 event list messages 4-17 invalid voltage regulator kernel panic 4-17 module installed C-20 recovery software 3-7 IP, accessing 5-10 technical support, information IP/IPX, using network features 5-6 required 1-24 IRC (Integrated Remote Console) A-8 locating information IRC configuration invalid C-11 HP resources A-1...

  • Page 291

    Index memory configurtion warning C-18 magnetized items, precaution 6-7 memory detection failure C-18 maintenance and service guides, accessing memory device failure C-21 through website A-2 memory error C-14 Management Agents memory installation error C-18 accessing A-5 memory machine ID test described 5-1 failed B-3 Management Agents, enabling 5-1...

  • Page 292

    Index mirror memory configuration interface fault recovery tracking, board 2 present in single defined 5-19 board mirror C-20 topology map 6-6 mirror memory configuration network controller, troubleshooting 2-42 DIMMs on both boards do not network interface card match C-20 configuration failed B-11 mirror memory configuration network interface card single board mirror DIMMs...

  • Page 293

    Index non-functioning voltage online spare memory regulator module for configuration - bank C insufficient for bank processor X C-21 A C-19 nonsystem disk message 2-22 online spare memory non-system disk or disk configuration - bank C error C-5 insufficient for bank nonvolatile RAM See NVRAM B C-19 Novell NetWare...

  • Page 294

    Index option ROM checksum error C-8 PCI boards Option ROM Configuration for Arrays not found C-12 Utility See ORCA Utility POST error messages C-12 ORCA (Option ROM Configuration for PCI comport invalid C-11 Arrays) Utility A-6 PCI config slot mismatch C-11 PCI config slot mismatch - not OS/2 See IBM OS/2 found C-12...

  • Page 295

    Index 500 series C-27 port address conflict error C-27 600 series C-28 power supply C-42, C-43, C-45, C-46 800 series C-30 PPM C-4, C-20, C-21, C-22, C-23 900 series C-36 processors C-2, C-4, C-6, C-13, C-20, array enabler board C-45 C-21, C-22, C-23, C-31, C-32 ASR-2 C-9 recording 1-12...

  • Page 296

    Index server, steps 1-3 lost redundancy C-21, C-22 warning xii no redundancy 2-17 power fault detected in hot- not installed C-21 plug PCI slot X C-6 POST error messages C-4, C-20, C-21, power fault on processor C-22, C-23 bus C-23 problems 1-8, 1-11, 1-14, 2-16, 2-17 power line, precautions 6-7 test error codes B-5...

  • Page 297

    Index printer failed or not switches and jumpers 2-16 connected B-6 terminator boards 6-3, 6-6 printer pattern test test error codes B-2 failed B-6 testing 2-14, 2-15, 2-16 problem resolution notebook 6-6 unsupported, system halted C-6 problems programmable timer load data diagnosing 1-1 test failed B-2 preventing 6-1...

  • Page 298

    Index rear CPU fan X failure Remote Insight Lights-Out Edition A-7 detected C-41 remote management A-7 rear CPU fan X not Remote ROM Flash Utility present C-41 errors 3-10 reboot, defined xii requirements 3-9 reconfiguring, software 3-6 reserve information sector See RIS recovery Resource Paqs A-4 automatic data reconstruction 5-19...

  • Page 299

    Index RIS copies between drives do firmware, unsupported D-18 not match D-14 overheating D-18 RJ-45 receptacle, warning 2-2 power supply failure D-17 SCSI cable error detected C-47 accessing latest A-2 SCSI cable, POST error messages C-44, disaster recovery 5-2 C-45, C-47, C-48 possible failures 3-10 SCSI controller failure C-47 POST error messages C-4, C-6, C-7,...

  • Page 300

    Index SCSI port X, Drive ID Y RIS maintenance, suggestions 6-5 copies within this drive do management, website 4-8 not match D-15 naming conventions 6-5 SCSI port X, Drive ID Y periodic visual check 6-7 S.M.A.R.T. predictive powering down 1-3 failure errors have been shutdown procedure 1-3 detected D-16...

  • Page 301

    Index slot X drive array optional setup and installation guides, accessing SIMM failure detected C-52 through website A-2 slot X drive array physical short circuits 1-7, 1-10, 2-8 drive position change(s) shut down, defined xiii detected C-52 shutting down the server 1-3 slot X drive array ProLiant SIMMs (single inline memory modules) drive storage enclosure...

  • Page 302

    Index slot z drive array controller error recovery, ASR-2 5-5, 5-8 - redundant ROM checksum NetWare, installation 3-4 error C-49 operating system recovery 3-7 slot z drive array controller problems 3-8 - redundant ROM reconfiguring 3-6 reprogramming C-49 reloading 3-6 slot z drive array RAID 5 restoring 3-15 logical drive(s) configured...

  • Page 303

    Index storage enclosure on SCSI bus swapped cables or X indicated that the fan configuration error detected an unsupported module is unplugged D-19 drive arrangement was storage enclosure on SCSI bus attempted D-20 X is unsupported with its current firmware swapped cables or configuration error version D-18...

  • Page 304

    Index incomplete C-12 damage, preventing 6-13 items to record 1-13 DAT drives See DAT drives options not set C-9 DLT drives See DLT drives POST error messages C-9, C-10, C-11, drivers 2-34 C-12 head clog error 6-13 time & date not set C-9, C-10 label placement 6-6 system configuration problems 2-31, 2-32, 2-33, 2-34...

  • Page 305

    Index precautions 6-10 system board B-2, B-9 rack 6-10 tape drives B-15 SCSI bus, overheating D-18 video B-7 UPS batteries 2-8 testing a device 2-12 temperature alert C-70 the bootstrap processor is not the lowest stepping temperature violation processor in the detected C-38 system C-32 terminator board...

  • Page 306

    Index methodology 1-3 battery 2-8 online spare memory C-19 circuit breaker or fuse 2-8 operating system information connection problems 2-8 needed 1-22 failure 2-7 plan, developing 1-2 problems 1-7, 1-10, 2-5, 2-7 preparing 1-3 sleep mode, disabling 2-8 previously released servers 1-6, 1-7, 1-8 specifications 2-7 problems, diagnosing 1-1 USB (universal serial bus) devices,...

  • Page 307

    D-4 Windows NT See Microsoft Windows NT warranty information, obtaining A-3 worldwide technical support, telephone websites numbers 1-20 ActiveAnswers A-3 write memory error D-25 ActiveUpdate A-2 wrong accelerator D-25 Ask Compaq A-3 clusters 3-9 Index-32 HP Servers Troubleshooting Guide...

Comments to this Manuals

Symbols: 0
Latest comments: