HP ProLiant series Troubleshooting Manual
HP ProLiant series Troubleshooting Manual

HP ProLiant series Troubleshooting Manual

Hide thumbs Also See for ProLiant series:
Table of Contents

Advertisement

Quick Links

HP ProLiant Servers

Troubleshooting Guide

September 2005 (Third Edition)
Part Number 375445-003

Advertisement

Table of Contents
loading

Summary of Contents for HP ProLiant series

  • Page 1: Troubleshooting Guide

    HP ProLiant Servers Troubleshooting Guide September 2005 (Third Edition) Part Number 375445-003...
  • Page 2 © Copyright 2004, 2005 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. The only warranties for HP products and services are set forth in the express warranty statements accompanying such products and services. Nothing herein should be construed as constituting an additional warranty. HP shall not be liable for technical or editorial errors or omissions contained herein.
  • Page 3: Table Of Contents

    Contents Introduction ..........................10 Getting started ............................10 Common problem resolution ......................11 Loose connections ........................... 11 Service notifications..........................11 Updating firmware ..........................11 Hard drive guidelines ..........................12 SAS and SATA hard drive guidelines ....................12 SCSI hard drive guidelines ......................12 Hot-plug SCSI hard drive LED combinations ....................
  • Page 4 SmartStart Scripting Toolkit ......................50 HP ROM-Based Setup Utility......................50 Option ROM Configuration for Arrays ..................... 52 HP ProLiant Essentials Rapid Deployment Pack................... 52 Re-entering the server serial number and product ID ................52 Management CD........................... 53 Management tools........................... 53 Automatic Server Recovery ......................
  • Page 5 Types of ROM..........................59 Methods for updating firmware ....................... 60 Current firmware versions ....................... 61 Updating firmware ........................61 HP resources for troubleshooting....................62 Online resources ............................. 62 HP website ........................... 62 Server documentation ........................62 Service notifications ........................62 Subscriber's choice........................
  • Page 6 Accelerator Error Log ........................66 Accelerator Parity Read Errors: X ..................... 66 Accelerator Parity Write Errors: X ....................67 Accelerator Status: Cache was Automatically Configured During Last Controller Reset ......67 Accelerator Status: Data in the Cache was Lost................. 67 Accelerator Status: Dirty Data Detected has Reached Limit...
  • Page 7 Invalid memory types were found on the same node. Please check DIMM compatibility. - Some DIMMs may not be used ..........................74 Inter-Controller Link Connection Could Not Be Established ..............74 Less Than 75% Batteries at Sufficient Voltage ..................74 Less Than 75% of Batteries at Sufficient Voltage Battery Pack X Below Reference Voltage......
  • Page 8 Processor-related port 85 codes..................... 121 Memory-related port 85 codes ...................... 122 Expansion board-related port 85 codes ..................123 Miscellaneous port 85 codes ......................123 Contacting HP .......................... 125 Contacting HP technical support or an authorized reseller ................125 Customer self repair..........................125 Contents 8...
  • Page 9 Server information you need ........................126 Operating system information you need ....................126 Microsoft operating systems ......................126 Linux operating systems ........................ 127 Novell NetWare operating systems ....................128 SCO operating systems ........................ 128 IBM OS/2 operating systems ......................129 Sun Solaris operating systems .......................
  • Page 10: Introduction

    Use this section to locate a complete list of ADU error messages (on page 66), POST error messages and beep codes (on page 84), event list error messages (on page 114), HP BladeSystem infrastructure error codes (on page 117), and Port 85 codes and iLO messages (on page 121).
  • Page 11: Common Problem Resolution

    Updating firmware To update the system ROM or option firmware, use HP Smart Components. These components are available on the Firmware Maintenance CD and the HP website (http://www.hp.com/support). The most recent version of a particular server or option firmware is available on the following: •...
  • Page 12: Hard Drive Guidelines

    View additional documentation on updating firmware, such as the Regular Firmware Updates Essential for Optimal Performance and Functionality of HP ProLiant Servers white paper, on the HP ROM- BIOS/Firmware Updates website (http://h18023.www1.hp.com/support/files/server/us/romflash.html). Hard drive guidelines...
  • Page 13 (1) The drive is part of an array being selected by an array configuration utility; (2) Drive Identification has been selected in HP SIM; or (3) drive firmware is being updated. The drive has been placed offline due to hard disk drive failure or subsystem communication failure.
  • Page 14: Diagnostic Flowcharts

    Diagnostic flowcharts In this section Pre-diagnostic steps ..........................14 Troubleshooting flowcharts ........................17 Pre-diagnostic steps WARNING: To avoid potential problems, ALWAYS read the warnings and cautionary information in the server documentation before removing, replacing, reseating, or modifying system components. IMPORTANT: This guide provides information for multiple servers. Some information may not apply to the server you are troubleshooting.
  • Page 15 Warnings and cautions WARNING: Only authorized technicians trained by HP should attempt to repair this equipment. All troubleshooting and repair procedures are detailed to allow only subassembly/module-level repair. Because of the complexity of the individual boards and subassemblies, no one should attempt to make repairs at the component level or to make modifications to any printed wiring board.
  • Page 16: Symptom Information

    If the problem occurs randomly, what is the duration or frequency? To answer these questions, the following information may be useful: • Run HP Insight Diagnostics (on page 55) and use the survey page to view the current configuration or to compare it to previous configurations. •...
  • Page 17: Troubleshooting Flowcharts

    NOTE: To verify the server configuration, connect to the System Management homepage (on page 54) and select Version Control Agent. The VCA gives you a list of names and versions of all installed HP drivers, Management Agents, and utilities, and whether they are up to date.
  • Page 18: Start Diagnosis Flowchart

    Start diagnosis flowchart Use the following flowchart to start the diagnostic process. General diagnosis flowchart Diagnostic flowcharts 18...
  • Page 19 The General diagnosis flowchart provides a generic approach to troubleshooting. If you are unsure of the problem, or if the other flowcharts do not fix the problem, use the following flowchart. Diagnostic flowcharts 19...
  • Page 20: Power On Problems Flowchart

    Power on problems flowchart Server power-on problems flowchart Symptoms: • The server does not power on. • The system power LED is off or amber. • The external health LED is red or amber. • The internal health LED is red or amber. NOTE: For the location of server LEDs and information on their statuses, refer to the server documentation.
  • Page 21 Server blade power-on problems flowchart Symptoms: • The server does not power on. • The power on/standby LED is off or amber. • The health LED is red or amber. NOTE: For the location of server LEDs and information on their statuses, refer to the server documentation. Possible causes: •...
  • Page 22 • Loose or faulty power cord • Power source problem • Power on circuit problem • Improperly seated component or interlock problem • Faulty internal component Diagnostic flowcharts 22...
  • Page 23: Post Problems Flowchart

    POST problems flowchart Symptoms: • Server does not complete POST NOTE: The server has completed POST when the system attempts to access the boot device. • Server completes POST with errors Possible Problems: • Improperly seated or faulty internal component •...
  • Page 24: Operating System Boot Problems Flowchart

    Operating system boot problems flowchart Symptoms: • Server does not boot a previously installed OS • Server does not boot SmartStart Possible Causes: • Corrupted OS • Hard drive subsystem problem Diagnostic flowcharts 24...
  • Page 25: Server Fault Indications Flowchart

    Server fault indications flowchart Symptoms: • Server boots, but a fault event is reported by Insight Management Agents (on page 54) • Server boots, but the internal health LED or external health LED is red or amber NOTE: For the location of server LEDs and information on their statuses, refer to the server documentation. Possible causes: •...
  • Page 26 • Unsupported component installed • Redundancy failure • System overtemperature condition Diagnostic flowcharts 26...
  • Page 27: Hardware Problems

    Hardware problems In this section Procedures for all ProLiant servers ......................27 Power problems ............................. 27 General hardware problems........................28 Internal system problems ......................... 30 System open circuits and short circuits ...................... 38 External device problems ........................38 Procedures for all ProLiant servers The procedures in this section are comprehensive and include steps about or references to hardware features that may not be supported by the server you are troubleshooting.
  • Page 28: Ups Problems

    UPS problems UPS is not working properly Action: Be sure the UPS batteries are charged to the proper level for operation. Refer to the UPS documentation for details. Be sure the UPS power switch is in the On position. Refer to the UPS documentation for the location of the switch.
  • Page 29: Unknown Problem

    After you check the settings in RBSU, save and exit the utility, and then restart the server. For more information on RBSU, refer to the HP ROM-Based Setup Utility User Guide on the Documentation CD or the HP website (http://www.hp.com/servers/smartstart).
  • Page 30: Third-Party Device Problems

    • If the system boots and video is working, add each component back to the server one at a time, restarting the server after each component is added to determine if that component is the cause of the problem. When adding each component back to the server, be sure to disconnect power to the server and follow the guidelines and cautionary information in the server documentation.
  • Page 31: Dat Drive Problems

    DAT drive problems Sense error codes are displayed Action: Refer to the Troubleshooting DAT Drives white paper for information on DAT drive sense error codes. Search for it on the HP website (http://www.hp.com). DAT drive error or failure occurs Action: Be sure drivers, software, and firmware are upgraded to the latest revisions.
  • Page 32: Diskette Drive Problems

    Latest firmware indicates a defective tape, or head clogs occur regularly Action: Replace the tape. Other errors are occurring Action: Replace the drive. Diskette drive problems Diskette drive light stays on Action: Be sure no loose connections (on page 11) exist. Be sure the diskette is not damaged.
  • Page 33: Dlt Drive Problems

    DLT drive problems Server cannot write to tape Action: • If the drive cleaning light is on, clean the drive. NOTE: DLT cleaning cartridges are good for only 20 uses. If the cleaning cartridge is near that limit and the drive cleaning light is still on after running the cleaning cartridge, use a new cleaning tape to clean the drive.
  • Page 34: Fan Problems

    NOTE: It is recommended that no more than two DLT drives per bus exist. • If using an external DLT drive that requires a SCSI terminator to be secured to the unused SCSI IN connector on the back of the drive, be sure the SCSI terminator is connected. DLT drives can be daisy chained, but do not connect more than three units per SCSI controller.
  • Page 35: Hard Drive Problems

    Action: Check the LEDs on the hard drive to be sure they indicate normal function. Refer to the server documentation or the HP website (http://www.hp.com) for information on hard drive LEDs. Be sure no loose connections (on page 11) exist.
  • Page 36: Memory Problems

    DIMM. • Remove any third-party memory. • Run HP Insight Diagnostics (on page 55) to test the memory. Server is out of memory Action: Be sure the memory is configured properly. Refer to the application documentation to determine the memory configuration requirements.
  • Page 37: Ppm Problems

    Be sure the memory is the correct type for the server and is installed according to the server requirements. Refer to the server documentation or HP website (http://www.hp.com). Be sure you have not exceeded the memory limits of the server or operating system. Refer to the server documentation.
  • Page 38: System Open Circuits And Short Circuits

    If the server has only one processor installed, replace it with a known functional processor. If the problem is resolved after you restart the server, the original processor failed. If the server has multiple processors installed, test each processor: Remove all but one processor from the server. Replace each with a processor terminator board or blank, if applicable to the server.
  • Page 39: Mouse And Keyboard Problems

    • For rack-mounted servers, check the cables to the KVM switch and be sure the switch is correctly set for the server. You may need to connect the monitor directly to the server to be sure the KVM switch has not failed. •...
  • Page 40: Audio Problems

    Action: Be sure the correct printer drivers are installed. Local I/O cable problems NOTE: The local I/O cable is used only with HP ProLiant p-Class server blades. Action: If the local I/O cable does not have hot-plug functionality, be sure you are not using a PS/2 keyboard or mouse.
  • Page 41 Type AT&F at the command prompt to reset the modem to factory-default settings. Be sure you are in terminal mode and not MS-DOS mode. Refer to the HP website (http://www.hp.com) for a complete list of AT commands. AT commands are not visible Action: Set the echo command to On using the AT command ATE.
  • Page 42: Network Controller Problems

    AT command initialization string is not working Action: Use the most basic string possible to perform the task. The default initialization string is AT&F&C1&D2&K3. Connection errors are occurring Action: Check the maximum baud rate for the modem to which you are connecting, and then change the baud rate to match.
  • Page 43 Be sure the controller drivers are up to date. Be sure a valid IP address is assigned to the controller and that the configuration settings are correct. Run Insight Diagnostics ("HP Insight Diagnostics" on page 55) and replace failed components as indicated.
  • Page 44: Software Problems

    Other useful resources include HP Insight Diagnostics (on page 55) and HP SIM ("HP Systems Insight Manager" on page 54). Use either utility to gather critical system hardware and software information and to help with problem diagnosis.
  • Page 45: Operating System Updates

    Install the current drivers. If you apply the update and have problems, refer to the Software and Drivers Download website (http://h18007.www1.hp.com/support/files/server) to find files to correct the problems. Restoring to a backed-up version If you recently upgraded the operating system or software and cannot resolve the problem, you can try restoring a previously saved version of the system.
  • Page 46: Linux Operating Systems

    • Linux—Refer to the operating system documentation for information. Linux operating systems For troubleshooting information specific to Linux operating systems, refer to the Linux for ProLiant website (http://h18000.www1.hp.com/products/servers/linux). Application software problems Software locks up Action: Check the application log and operating system log for entries indicating why the software failed.
  • Page 47: Errors Occur After A Software Setting Is Changed

    Errors occur after a software setting is changed Action: Check the system logs to determine what changes were made, and then change settings to the original configuration. Errors occur after the system software is changed Action: Change settings to the original configuration. If more than one setting was changed, change the settings one at a time to isolate the cause of the problem.
  • Page 48: Invalid Or Incorrect Command-Line Parameters

    If the target system is not listed in the supported servers list, an error message is displayed and the program exits. Only supported systems can be upgraded using the Remote ROM Flash utility. To see if the server is supported, refer to the Software and Drivers Download website (http://h18007.www1.hp.com/support/files/server). Software problems 48...
  • Page 49: Software Tools And Solutions

    Servers running Microsoft® operating systems require Internet Explorer 5.5 (with Service Pack 1) or later. For Linux servers, refer to the README.TXT file for additional browser and support information. For more information, refer to the HP Array Configuration Utility User Guide on the Documentation CD or the HP website (http://www.hp.com).
  • Page 50: Smartstart Scripting Toolkit

    Configuring memory options • Language selection For more information on RBSU, refer to the HP ROM-Based Setup Utility User Guide on the Documentation CD or the HP website (http://www.hp.com/servers/smartstart). Using RBSU The first time you power up the server, the system prompts you to enter RBSU and select a language.
  • Page 51: Configuring Online Spare Memory

    RBSU by pressing the F9 key when prompted. After the settings are selected, exit RBSU and allow the server to reboot automatically. For more information, refer to the HP ROM-Based Setup Utility User Guide on the Documentation CD or the HP website (http://www.hp.com/servers/smartstart).
  • Page 52: Option Rom Configuration For Arrays

    It enables you to perform imaging or scripting functions and maintain software images. For more information about the RDP, refer to the HP ProLiant Essentials Rapid Deployment Pack CD or refer to the HP website (http://www.hp.com/servers/rdp).
  • Page 53: Management Cd

    ASR increases server availability by restarting the server within a specified time after a system hang or shutdown. At the same time, the HP SIM console notifies you by sending a message to a designated pager number that ASR has restarted the system. You can disable ASR from the HP SIM console or through RBSU.
  • Page 54: Erase Utility

    HP SIM provides device management capabilities that consolidate and integrate management data from HP and third-party devices. IMPORTANT: You must install and use HP SIM to benefit from the Pre-Failure Warranty for processors, SCSI hard drives, and memory modules.
  • Page 55: Usb Support

    USB support HP provides both standard USB support and legacy USB support. Standard support is provided by the operating system through the appropriate USB device drivers. HP provides support for USB devices before the operating system loading through legacy USB support, which is enabled by default in the system ROM.
  • Page 56: Survey Utility

    If a significant change occurs between data-gathering intervals, the Survey Utility marks the previous information and overwrites the Survey text files to reflect the latest changes in the configuration. Survey Utility is installed with every SmartStart-assisted installation or can be installed through the HP PSP ("ProLiant Support Packs"...
  • Page 57: Remote Support And Analysis Tools

    ISEE is a proactive remote monitoring and diagnostic tool to help manage your systems and devices, a feature of HP support. ISEE provides continuous hardware event monitoring and automated notification to identify and prevent potential critical problems. Through remote diagnostic scripts and vital system configuration information collected about your systems, ISEE enables fast restoration of your systems.
  • Page 58: Version Control

    Software and Drivers Download website (http://h18007.www1.hp.com/support/files/server). Change control and proactive notification HP offers Change Control and Proactive Notification to notify customers 30 to 60 days in advance of upcoming hardware and software changes on HP commercial products. For more information, refer to the HP website (http://h18023.www1.hp.com/solutions/pcsolutions/pcn.html).
  • Page 59: Firmware Maintenance

    A backup copy of the ROM image existing on the target server is made in the ROM image backup subdirectory: \CPQSYSTEM\FWBACKUP\SYSTEM For additional information, refer to the HP Online ROM Flash User Guide on the HP website (http://h18023.www1.hp.com/support/files/server/us/romflash.html). Software tools and solutions 59...
  • Page 60: Methods For Updating Firmware

    ROM upgrades required by each target server To download the tool and for more information, refer to the HP website (http://h18000.www1.hp.com/support/files/index.html). For OS-specific procedures, refer to the HP Online ROM Flash User Guide on the HP website (http://h18023.www1.hp.com/support/files/server/us/romflash.html). Offline ROM flash technology...
  • Page 61: Current Firmware Versions

    NOTE: Option ROMPaqs have been retired as an upgrade delivery method for storage options. Firmware upgrades for storage options are now delivered using Smart Components and Smart Component deployment utilities. For additional information about the ROMPaq utility, refer to the server documentation or the HP website (http://www.hp.com/support/files). ROM Update Utility The ROM Update Utility is offline ROM flash technology.
  • Page 62: Hp Resources For Troubleshooting

    Support on commercial online networks Refer to online forums to post questions to technical support or other HP users by using the Message Base Feature, which is a standard on support forums found on all three online networks. You can access HP utility files, drivers, software, and other information related to HP.
  • Page 63: Natural Language Search Assistant

    • HP Technical Documentation website (http://www.docs.hp.com) Installation and configuration information for the server management system Refer to the HP Systems Insight Manager Installation and User Guide on the Management CD or the HP website (http://www.hp.com/go/hpsim). HP resources for troubleshooting 63...
  • Page 64: Installation And Configuration Information For The Server Setup Software

    Refer to the power calculator on the HP Enterprise Configurator website (http://h30099.www3.hp.com/configurator/). Registering the server To register a server, refer to the registration card in the HP ProLiant Essentials Foundation Pack or the HP Registration website (http://register.hp.com). Server configuration information...
  • Page 65: Software Installation And Configuration Of The Server

    Software installation and configuration of the server If the server supports SmartStart, refer to the SmartStart installation poster in the HP ProLiant Essentials Foundation Pack. Switch settings, LED functions, drive, memory, expansion board and processor installation instructions, and board layouts Refer to the hood labels and the server user guide.
  • Page 66: Error Messages

    In this section ADU error messages..........................66 POST error messages and beep codes...................... 84 Event list error messages ........................114 HP BladeSystem infrastructure error codes....................117 Port 85 codes and iLO messages ......................121 ADU error messages Introduction to ADU error messages This section contains a complete alphabetical list of all ADU ("Array Diagnostic...
  • Page 67: Accelerator Parity Write Errors: X

    Accelerator Parity Write Errors: X Description: Number of times that write memory parity errors were detected during transfers to memory on the array accelerator board. Action: If many parity errors occurred, you may need to replace the array accelerator board. Accelerator Status: Cache was Automatically Configured During Last Controller Reset Description: Cache board was replaced with one of a different size.
  • Page 68: Accelerator Status: Obsolete Data Detected

    Description: The number of cache lines experiencing excessive ECC errors has reached a preset limit. Therefore, the cache has been shut down. Action: Reseat the cache to the controller. If the problem persists, replace the cache. Accelerator Status: Obsolete Data Detected Description: During reset initialization, obsolete data was found in the cache due to the drives being moved and written to by another controller.
  • Page 69: Accelerator Status: Valid Data Found At Reset

    Accelerator Status: Valid Data Found at Reset Description: Valid data was found in posted-write memory at reinitialization. Data will be flushed to disk. Action: No error or data loss condition exists. No action is required. Accelerator Status: Warranty Alert Description: Catastrophic problem exists with array accelerator board. Refer to other messages on Diagnostics screen for exact meaning of this message.
  • Page 70: Cache Has Been Disabled; Likely Caused By A Loose Pin On One Of The Ram Chips

    Cache Has Been Disabled; Likely Caused By a Loose Pin on One of the RAM Chips Description: Cache has been disabled due to a large number of ECC errors detected while testing the cache during POST. This is probably caused by a loose pin on one of the RAM chips. Action: Try reseating the cache to the controller.
  • Page 71: Controller Reported Post Error. Error Code: X

    page 56) examines each physical drive and looks for drives that have been moved to a different drive bay. Action: Look for messages indicating which drives have been moved. If no messages are displayed and drive swapping did not occur, run ACU ("Array Configuration Utility"...
  • Page 72: Drive (Bay) X Is A Replacement Drive

    If the problem persists, power down the system and replace the cable. If the problem persists, power down the system and replace the drive. Drive (Bay) X is a Replacement Drive Description: This drive has been replaced. This message is displayed if a drive is replaced in a fault- tolerant logical volume.
  • Page 73: Drive Monitoring Features Are Unobtainable

    Utility" on page 56) was trying to read the RIS from this drive. Action: HP stores the hard drive configuration information in the RIS. If multiple errors occur, the drive may need to be replaced. FYI: Drive (Bay) X is Third-Party Supplied Description: Third-party supplied the installed drive.
  • Page 74: Invalid Memory Types Were Found On The Same Node. Please Check Dimm Compatibility. - Some Dimms May Not Be Used

    Identify Logical Drive Data did not Match with NVRAM Description: The identify unit data from the array controller does not match with the information stored in NVRAM. This can occur if new, previously configured drives have been placed in a system that has also been previously configured.
  • Page 75: Logical Drive X Status = Failed

    Logical Drive X Status = Failed Description: This status could be issued for several reasons: • Logical drive is configured for No Fault Tolerance, and one or more drives failed. • Mirroring is enabled, and any two mirrored drives failed. •...
  • Page 76: Logical Drive X Status = Overheating

    Logical Drive X Status = Overheating Description: The temperature of the Intelligent Array Expansion System drives is beyond safe operating levels. Action: Check the fans and the operating environment. Logical Drive X Status = Recovering (rebuilding data on a replaced drive) Description: A physical drive in this logical drive has failed and has now been replaced.
  • Page 77: One Or More Drives Is Unable To Support Redundant Controller Operation

    If drives or arrays were moved, be sure the movement follows the guidelines listed in the documentation for the array controller. If the error persists after completing steps 1 through 4, contact an HP authorized service provider ("Contacting HP technical support or an authorized reseller"...
  • Page 78: Scsi Port X, Drive Id Y Has Exceeded The Following Threshold(S)

    SCSI Port X, Drive ID Y Has Exceeded the Following Threshold(s) Description: The monitor and performance threshold for this drive has been violated. Action: Check and resolve the threshold that has been violated. SCSI Port X, Drive ID Y is not Stamped for Monitoring Description: The drive has not been stamped with monitor and performance features.
  • Page 79: Scsi Port X, Drive Id Y Was Replaced On A Good Volume: (Failure Message)

    Description: A predictive failure warning for this hard drive has been generated, indicating a drive failure is imminent. Action: Replace this drive at the earliest opportunity. Refer to the server documentation for drive replacement information before performing this operation. SCSI Port X, Drive ID Y Was Replaced On a Good Volume: (failure message) Description: ADU ("Array Diagnostic Utility"...
  • Page 80: Storage Enclosure On Scsi Bus X Is Unsupported With Its Current Firmware Version

    Description: The external storage unit is generating a temperature alert. Action: Be sure all fans are connected and operating properly. Be sure the operating environment of the storage unit is within specifications. For better airflow, remove any dust buildup from fans or other areas. Check the server documentation for allowable temperature specifications and additional tips.
  • Page 81: Swapped Cables Or Configuration Error Detected. A Drive Rearrangement

    Description: You have exceeded the maximum number of drives supported for this controller, and the connected controller was not part of the original array configuration. Action: Update the controller to the latest firmware version. If the problem persists: Replace this controller with the original controller. Replace this controller with a new controller that supports the number of drives in the array.
  • Page 82: Swapped Cables Or Configuration Error Detected. The Maximum Logical Volume Count X

    If the problem persists, this might indicate a controller problem or a system board problem. CAUTION: Only authorized technicians trained by HP should attempt to remove the system board. If you believe the system board requires replacement, contact HP Technical Support before proceeding.
  • Page 83: This Controller Can See The Drives But The Other Controller Can't

    Unable to Retrieve Identify Controller Data. Controller May be Disabled or Failed ...SOLUTION: Power down the system. Verify that the controller is fully seated. Then power the system on and look for helpful error messages displayed by the controller. If this doesn’t help, contact your HP service provider. Description: ADU ("Array Diagnostic...
  • Page 84: Warning - Drive Write Cache Is Enabled On X

    A server generates only the codes that are applicable to its configuration and options. HP ProLiant p-Class server blades do not have speakers and thus do not support audio output. Disregard the audible beeps information if the server falls into this category.
  • Page 85: Non-Numeric Messages Or Beeps Only

    Non-numeric messages or beeps only Advanced Memory Protection mode: Advanced ECC Audible Beeps: None Possible Cause: Advanced ECC support is enabled. Action: None. Advanced Memory Protection mode: Advanced ECC with hot-add support Audible Beeps: None Possible Cause: Advanced ECC with Hot-Add support is enabled. Action: None.
  • Page 86 Critical Error Occurred Prior to this Power-Up Audible Beeps: None Possible Cause: A catastrophic system error, which caused the server to crash, has been logged. Action: Run Insight Diagnostics ("HP Insight Diagnostics" on page 55) and replace failed components as indicated. Fan Solution Not Fully Redundant...
  • Page 87: Ilo Generated Nmi

    Audible Beeps: None Possible Cause: The hub link interface has experienced a critical failure that caused an NMI. Action: Run Insight Diagnostics ("HP Insight Diagnostics" on page 55) and replace failed components as indicated. FATAL ROM ERROR: The System ROM is not Properly Programmed.
  • Page 88: Network Server Mode Active And No Keyboard Attached

    Invalid Password - System Halted! Audible Beeps: None Possible Cause: An invalid password was entered. Action: Enter a valid password to access the system. Invalid Password - System Restricted! Audible Beeps: None Possible Cause: A valid password that does not have permissions to access the system has been entered.
  • Page 89 Audible Beeps: None Possible Cause: An uncorrectable error memory event occurred in a memory DIMM. Action: Run Insight Diagnostics ("HP Insight Diagnostics" on page 55) to identify failed DIMMs. Then, use the DIMM LEDs to identify failed DIMMs and replace the DIMMs.
  • Page 90 Action: Run ROMPaq Utility ("SoftPaqs" on page 58) to flash the system so that the primary and backup ROMs are valid. REDUNDANT ROM ERROR: Bootblock Invalid. - ..contact HP Representative. Audible Beeps: None Possible Cause: ROM bootblock is corrupt.
  • Page 91: 100 Series

    The system will run in Full Performance mode. Audible Beeps: None Possible Cause: The system is configured for HP Static Low mode and the current processor cannot support this mode. Action: For more information about the Power Regulator for ProLiant option, refer to the HP ROM-Based Setup Utility User Guide on the Documentation CD or the HP website (http://www.hp.com/servers/smartstart).
  • Page 92 Audible Beeps: None Possible Cause: 8237 DMA controllers, 8254 timers, and similar devices. CAUTION: Only authorized technicians trained by HP should attempt to remove the system board. If you believe the system board requires replacement, contact HP Technical Support before proceeding.
  • Page 93: Series

    Action: Event message, no action is required. 200 Series 201-Memory Error Audible Beeps: None Possible Cause: Memory failure detected. Action: Run Insight Diagnostics ("HP Insight Diagnostics" on page 55) and replace failed components as indicated. 203-Memory Address Error Audible Beeps: None Possible Cause: Memory failure detected.
  • Page 94 207-Invalid Memory Configuration - Insufficient Timings on DIMM Audible Beeps: 1 long, 1 short Possible Cause: The installed memory module is not supported. Action: Install a memory module of a supported type. 207-Invalid Memory Configuration - Mismatched DIMMs within DIMM Bank Audible Beeps: 1 long, 1 short Possible Cause: Installed DIMMs in the same bank are of different sizes.
  • Page 95 207-Memory Configuration Warning - DIMM In Socket x does not have Primary Width of 4 and only supports standard ECC Advanced ECC does not function when mixing DIMMs with Primary Widths of x4 and x8. Audible Beeps: 1 long, 1 short, or none Possible Cause: Installed DIMMs have a primary width of x8.
  • Page 96: 300 Series

    Action: Exchange DIMMs and retest. Replace the memory board if problem persists. 212-Processor Failed, Processor X Audible Beeps: 1 short Possible Cause: Processor in slot X failed. Action: Run Insight Diagnostics ("HP Insight Diagnostics" on page 55) and replace failed components as indicated. 214-Processor PPM Failed, Module X Audible Beeps: None Possible Cause: Indicated PPM failed.
  • Page 97: 400 Series

    CAUTION: Only authorized technicians trained by HP should attempt to remove the system board. If you believe the system board requires replacement, contact HP Technical Support before proceeding. Run Insight Diagnostics ("HP Insight Diagnostics" on page 55) and replace failed components as indicated.
  • Page 98: 600 Series

    Possible Cause: Diskette controller circuitry failure occurred. Action: Be sure the diskette drive cables are connected. Replace the diskette drive, the cable, or both. Run Insight Diagnostics ("HP Insight Diagnostics" on page 55) and replace failed components as indicated. 602-Diskette Boot Record Error Audible Beeps: None Possible Cause: The boot sector on the boot disk is corrupt.
  • Page 99: 1100 Series

    Run Insight Diagnostics ("HP Insight Diagnostics" on page 55) and replace failed components as indicated. 1100 Series 1151-Com Port 1 Address Assignment Conflict Audible Beeps: 2 short Possible Cause: Both external and internal serial ports are assigned to COM X.
  • Page 100 1611-CPU Zone Fan Assembly Failure Detected. Single fan..failure. Assembly will provide adequate cooling. Audible Beeps: None Possible Cause: Required fan is not spinning. Action: Replace the failed fan to provide redundancy, if applicable. 1611-Fan Failure Detected Audible Beeps: 2 short Possible Cause: Required fan is not installed or spinning.
  • Page 101 1611-Fan x Not Present (Fan Zone I/O) Audible Beeps: 2 short Possible Cause: Required fan is not installed or spinning. Action: Check the fans to be sure they are working. Be sure each fan cable is properly connected, if applicable, and each fan is properly seated. If the problem persists, replace the failed fans.
  • Page 102: 1700 Series

    1615-Power Supply Configuration Error Audible Beeps: None Possible Cause: The server configuration requires an additional power supply. A moving bar is displayed, indicating that the system is waiting for another power supply to be installed. Action: Install the additional power supply. 1615-Power Supply Configuration Error - A working power supply must be installed in Bay 1 for proper cooling.
  • Page 103 Audible Beeps: None Possible Cause: Upgrade the Array Accelerator module to a larger size. Action: Migrate logical drives to RAID 0 or 1, reduce the number of drives in the array, or upgrade to a larger-size array accelerator module. 1713-Slot Z Drive Array Controller - Redundant ROM Reprogramming Failure..Replace the controller if this error persists after restarting system.
  • Page 104 1720-S.M.A.R.T. Hard Drive Detects Imminent Failure Audible Beeps: None Possible Cause: A hard drive SMART predictive failure condition is detected. It may fail at some time in the future. Action: • If configured as a non-RAID 0 array, replace the failing or failed drive. Refer to the server documentation.
  • Page 105 1727-Slot X Drive Array - New Logical Drive(s) Attachment Detected..If more than 32 logical drives, this message will be followed by: “Auto-configuration failed: Too many logical drives.” Audible Beeps: None Possible Cause: The controller has detected an additional array of drives that was connected when the power was off.
  • Page 106 1770-Slot X Drive Array - SCSI Drive Firmware Update Recommended - ..Please upgrade firmware on the following drive(s) using ROM Flash Components (download from www.hp.com/support/proliantstorage): Model XYZ (minimum version = ####) Audible Beeps: None Possible Cause: Drive firmware update needed.
  • Page 107 1774-Slot X Drive Array - Obsolete Data Found in Array Accelerator Audible Beeps: None Possible Cause: Drives were used on another controller and reconnected to the original controller while data was in the original controller cache. Data found in the array accelerator is older than data found on the drives and has been automatically discarded.
  • Page 108 Replace the integrated array enabler board. Update the integrated Smart Array option to the latest firmware version. CAUTION: Only authorized technicians trained by HP should attempt to remove the I/O board. If you believe the I/O board requires replacement, contact HP Technical Support before proceeding.
  • Page 109 1779-Slot X Drive Array - Replacement drive(s) detected OR previously failed drive(s) now operational:..Port Y: SCSI ID Z: Restore data from backup if replacement drive X has been installed. Audible Beeps: None Possible Cause: More drives failed (or were replaced) than the fault-tolerance level allows. Unable to rebuild array.
  • Page 110 Be sure all drives are fully seated. Replace defective cables, drive X, or both. 1785-Slot X Drive Array Not Configured... (followed by one of the following): ...(1) Run Array Configuration Utility (2) No drives detected (3) Drive positions appear to have changed – Run Drive Array Advanced Diagnostics if previous positions are unknown.
  • Page 111: Drive Array Operating In Interim Recovery Mode

    1786-Slot 1 Drive Array Recovery Needed. Automatic Data Recovery Previously Aborted!..The following SCSI drive(s) need Automatic Data Recovery: SCSI Port Y: SCSI ID Z Select F1 to retry Automatic Data Recovery to drive. Select F2 to continue without starting Automatic Data Recovery. Audible Beeps: None Possible Cause: System is in Interim Data Recovery Mode and a failed or replacement drive has not yet been rebuilt.
  • Page 112 Repair the connection and press the F2 key. If the problem persists, run ADU ("Array Diagnostic Utility" on page 56) to resolve. • Be sure the cable is routed properly. 1789-Slot X Drive Array SCSI Drive(s) Not Responding..Check cables or replace the following SCSI drives: SCSI Port Y: SCSI ID Z Select F1 to continue –...
  • Page 113: 1794-1799 Drive Array

    1794-Drive Array - Array Accelerator Battery Charge Low..Array Accelerator is temporarily disabled. Array Accelerator will be re-enabled when battery reaches full charge. Audible Beeps: None Possible Cause: The battery charge is below 75 percent. Posted writes are disabled. Action: Replace the array accelerator board if the batteries do not recharge within 36 powered-on hours.
  • Page 114: Event List Error Messages

    ("Integrated Management Log" on page 56) to determine the cause of the lockup, and then refer to the HP ROM-Based Setup Utility User Guide, on the server Documentation CD or at the SmartStart website (http://www.hp.com/servers/smartstart), for more information. Error messages 114...
  • Page 115: Automatic Operating System Shutdown Initiated Due To Fan Failure

    Automatic operating system shutdown initiated due to fan failure Event Type: Fan failure Action: Replace the fan. Automatic Operating System Shutdown Initiated Due to Overheat Condition..Fatal Exception (Number X, Cause) Event Type: Overheating condition Action: Check fans. Also, be sure the server is properly ventilated and the room temperature is set within the required range.
  • Page 116: Real-Time Clock Battery Failing

    Real-Time Clock Battery Failing Event Type: System configuration battery low Action: Replace the system configuration battery. System AC Power Overload (Power Supply X) Event Type: Power supply overload Action: Switch the voltage from 110 V to 220 V or add an additional power supply (if applicable to the system).
  • Page 117: Uncorrectable Memory Error (Slot X, Memory Module Y)

    Event Type: Host bus error CAUTION: Only authorized technicians trained by HP should attempt to remove the system board. If you believe the system board requires replacement, contact HP Technical Support before proceeding. Action: Replace the board on which the processor is installed.
  • Page 118 Action: Perform the following steps to resolve the problem. Stop when the problem is resolved. Press the server blade management module reset button. Replace the signal backplane. For more information, refer to the HP BladeSystem Maintenance and Service Guide on the HP website (http://www.hp.com/products/servers/proliant-bl/p-class/info). Server blade management module power backplane A error codes...
  • Page 119 Action: Perform the following steps to resolve the problem. Stop when the problem is resolved. Press the server blade management module reset button. Reseat the interconnect device. For more information, refer to the HP BladeSystem Maintenance and Service Guide on the HP website (http://www.hp.com/products/servers/proliant-bl/p-class/info). Replace the interconnect device.
  • Page 120: Power Management Module Error Codes

    For more information, refer to the HP BladeSystem Maintenance and Service Guide on the HP website (http://www.hp.com/products/servers/proliant-bl/p-class/info). Replace the interconnect module. For more information, refer to the HP BladeSystem Maintenance and Service Guide on the HP website (http://www.hp.com/products/servers/proliant-bl/p-class/info). Interconnect Module B (6-Connector) Error Code...
  • Page 121: Port 85 Codes And Ilo Messages

    Power management module board error codes LED code: 7-1, 7-2, 7-3, 7-4, 7-5, 7-6, 7-7, 7-8, 7-9, 7-10, 7-11, 7-12, or 7-13 Location: Power management board Action: Perform the following steps to resolve the problem. Stop when the problem is resolved. Reseat the power management module.
  • Page 122: Memory-Related Port 85 Codes

    IMPORTANT: Reboot the server after completing each numbered step. If the error condition continues, proceed with the next step. To troubleshoot processor-related error codes: Bring the server to base configuration by removing all components that are not required by the server to complete POST.
  • Page 123: Expansion Board-Related Port 85 Codes

    Reseat the remaining memory boards, rebooting after each installation to isolate any failed memory boards. Replace the DIMMs with a remaining bank of memory. Replace the memory board. Replace the system board. IMPORTANT: If replacing the system board or clearing NVRAM, you must re-enter the server serial number through RBSU ("Re-entering the server serial number and product ID"...
  • Page 124 IMPORTANT: Processor socket 1 and PPM slot 1 must be populated at all times or the server will not function properly. • PPMs, except the PPM installed in slot 1 • DIMMS, except the first bank from one memory board •...
  • Page 125: Contacting Hp

    HP's customer self-repair program offers you the fastest service under either warranty or contract. It enables HP to ship replacement parts directly to you so that you can replace them. Using this program, you can replace parts at your own convenience.
  • Page 126: Server Information You Need

    Server information you need Before contacting HP technical support, collect the following information: • Explanation of the issue, the first occurrence, and frequency • Any changes in hardware or software configuration before the issue surfaced • Third-party hardware information: •...
  • Page 127: Linux Operating Systems

    • An updated Emergency Repair Diskette • If HP drivers are installed: • Version of the PSP used • List of drivers from the PSP • The drive subsystem and file system information: • Number and size of partitions and logical drives •...
  • Page 128: Novell Netware Operating Systems

    A list of the drivers and NLM files used on the server, including the names, versions, dates, and sizes (can be taken directly from the CONFIG.TXT or SURVEY.TXT files) • If HP drivers are installed: • Version of the PSP used •...
  • Page 129: Sun Solaris Operating Systems

    Type of installation selected: Interactive, WebStart, or Customer JumpStart • Which software group selected for installation: End User Support, Entire Distribution, Developer System Support, or Core System Support • If HP drivers are installed with a DU: Contacting HP 129...
  • Page 130 A detailed description of the problem and any associated error messages • Printouts or electronic copies (to e-mail to a support technician) of: • /usr/sbin/crash (accesses the crash dump image at /var/crash/$hostname) • /var/adm/messages • /etc/vfstab • /usr/sbin/prtconf Contacting HP 130...
  • Page 131: Acronyms And Abbreviations

    Acronyms and abbreviations ACPI Advanced Configuration and Power Interface Array Configuration Utility Advanced Data Guarding (also known as RAID 6) Array Diagnostics Utility CCITT International Telegraph and Telephone Consultative Committee cable select direct memory access driver update Extended Feature Supplement EULA end user license agreement Fibre Channel...
  • Page 132 integrated device electronics Integrated Lights-Out Integrated Management Display Integrated Management Log Internet Protocol ISEE Instant Support Enterprise Edition Internet service provider keyboard, video, and mouse light-emitting diode low-voltage differential non-maskable interrupt NVRAM non-volatile memory OBDR One Button Disaster Recovery ORCA Option ROM Configuration for Arrays Acronyms and abbreviations 132...
  • Page 133 operating system POST Power-On Self Test Processor Power Module ProLiant Support Pack RBSU ROM-Based Setup Utility RILOE Remote Insight Lights-Out Edition RILOE II Remote Insight Lights-Out Edition II reserve information sector read-only memory serial attached SCSI SATA serial ATA Systems Insight Manager SIMM single inline memory module SMART...
  • Page 134 SNMP Simple Network Management Protocol support software diskette uninterruptible power system universal serial bus Version Control Agent VCRM Version Control Repository Manager Acronyms and abbreviations 134...
  • Page 135: Index

    84 data recovery 31, 35 BIOS upgrade 53 deployment software 52 blank screen 38 diagnose tab, HP Insight Diagnostics 55 blue screen event 115 diagnosing problems 14, 16, 55 booting problems 30 diagnostic steps 14 booting the server 30...
  • Page 136 HP Enterprise Configurator 64 drivers 57, 63 HP Insight Diagnostics 55, 114 DVD-ROM drive 30 HP ProLiant Essentials Foundation Pack 54, 64, 65 HP ProLiant Essentials Rapid Deployment Pack 52 HP Systems Insight Manager, overview 54, 63, 64 ECC errors 67...
  • Page 137 logical drive, failure of 75, 76, 105 ORCA (Option ROM Configuration for Arrays) 52 logical drive, migrating 102 OS boot problems flowchart 24 loose connections 11 panic error 45 maintenance and service guide 65 parameters 48 Management Agents 54 parity errors 66, 67, 73, 89 Management CD 53, 63, 64 part numbers 64, 65 management tools 53...
  • Page 138 redundant ROM 59, 89, 90, 103 software troubleshooting 44, 46, 47 registering the server 64 specifications, option 65 reloading software 45 specifications, server 65 Remote Insight Lights-Out Edition II (RILOE II) 45, 53 start diagnosis flowchart 18 remote ROM flash 47, 48 storage enclosure 79, 80 remote ROM flash problems 47 storage, external 63...
  • Page 139 15, 65 Web-Based Enterprise Service 57 website, HP 62, 63, 125 white papers 63, 65 Index 139...

Table of Contents