Compaq 234664-002 - ProLiant - ML330T02 Troubleshooting Manual

Hp proliant servers troubleshooting guide
Hide thumbs Also See for 234664-002 - ProLiant - ML330T02:
Table of Contents

Advertisement

HP ProLiant Servers

Troubleshooting Guide

Part Number 375445-008
July 2009 (Eighth Edition)

Advertisement

Table of Contents
loading

Summary of Contents for Compaq 234664-002 - ProLiant - ML330T02

  • Page 1: Troubleshooting Guide

    HP ProLiant Servers Troubleshooting Guide Part Number 375445-008 July 2009 (Eighth Edition)
  • Page 2 © Copyright 2004, 2009 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. The only warranties for HP products and services are set forth in the express warranty statements accompanying such products and services. Nothing herein should be construed as constituting an additional warranty. HP shall not be liable for technical or editorial errors or omissions contained herein.
  • Page 3: Table Of Contents

    Contents Introduction ..........................9 What's new.............................. 9 Revision history ............................9 375445-xx7 (November 2008) ......................9 375445-xx6 (September 2007) ...................... 10 375445-xx5 (June 2006) ....................... 10 375445-xx4 (May 2006)....................... 11 375445-xx3 (September 2005) ...................... 11 Getting started..........................12 HP ProLiant 100 Series Server troubleshooting information ................12 How to use this guide ..........................
  • Page 4 Problems with new hardware ......................38 Unknown problem ......................... 39 Third-party device problems ......................39 Internal system problems .......................... 40 CD-ROM and DVD drive problems....................40 Diskette drive problems ........................41 Drive problems (hard drives and solid state drives) ................42 SD card problems..........................
  • Page 5 Option ROM Configuration for Arrays ..................... 68 HP ProLiant Essentials Rapid Deployment Pack .................. 68 Re-entering the server serial number and product ID ................68 Management tools........................... 69 Automatic Server Recovery ......................69 ROMPaq utility..........................69 Remote Insight Lights-Out Edition II ....................69 Integrated Lights-Out technology ......................
  • Page 6 Product information resources ........................86 Additional product information......................86 Registering the server ........................86 Overview of server features and installation instructions..............86 Key features, option part numbers ....................86 Server and option specifications, symbols, installation warnings, and notices ........86 Teardown procedures, part numbers, specifications ................
  • Page 7 System AC Power Overload (Power Supply X)................. 158 System AC Power Problem (Power Supply X)................... 158 System Fan Failure (Fan X, Location) ....................158 System Fans Not Redundant......................158 System Overheating (Zone X, Location) ..................158 System Power Supplies Not Redundant................... 158 System Power Supply Failure (Power Supply X)................
  • Page 8 Acronyms and abbreviations...................... 177 Index............................183 Contents 8...
  • Page 9: Introduction

    Introduction What's new The eighth edition of the HP ProLiant Servers Troubleshooting Guide, part number 375445-xx8, includes the following additions and updates: • Added information to the introduction for the server power-on problems flowchart (on page 26). • Added new steps to Power problems (on page 36). •...
  • Page 10: 375445-Xx6 (September 2007)

    • Added information about HP BladeSystem c-Class enclosure fan problems to Fan problems (on page • Added HP Trusted Platform Module problems (on page 46) to Hardware problems (on page 36) • Added SD card problems (on page 44) to Hardware problems (on page 36) •...
  • Page 11: 375445-Xx4 (May 2006)

    375445-xx4 (May 2006) The fourth edition of the HP ProLiant Servers Troubleshooting Guide, part number 375445-xx4, included the following additions: • Hot-plug SAS and SATA hard drive LED combinations (on page 19) • Operating system issues with Intel® dual-core processors (Hyper-Threading enabled) (on page 59) •...
  • Page 12: Getting Started

    Getting started HP ProLiant 100 Series Server troubleshooting information For HP ProLiant 100 Series Server troubleshooting information, see the respective server user guides. How to use this guide NOTE: For common troubleshooting procedures, the term "server" is used to mean servers and server blades.
  • Page 13: Pre-Diagnostic Steps

    Use this section for a complete list of the following messages: ADU error messages (on page 90) POST error messages and beep codes (o n page 114) Event list error messages (on page 155) HP BladeSystem infrastructure error codes ("HP BladeSystem p-Class infrastructure error codes"...
  • Page 14: Warnings And Cautions

    This symbol on an RJ-45 receptacle indicates a network interface connection. WARNING: To reduce the risk of electric shock, fire, or damage to the equipment, do not plug telephone or telecommunications connectors into this receptacle. This symbol indicates the presence of a hot surface or hot component. If this surface is contacted, the potential for injury exists.
  • Page 15: Electrostatic Discharge

    WARNING: To reduce the risk of personal injury or damage to the equipment: Observe local occupation health and safety requirements and guidelines for • weight in kg manual handling. weight in lb • Obtain adequate assistance to lift and stabilize the chassis during installation or removal.
  • Page 16: Symptom Information

    For more information on static electricity or assistance with product installation, contact an authorized reseller. Symptom information Before troubleshooting a server problem, collect the following information: • What events preceded the failure? After which steps does the problem occur? • What has been changed since the time the server was working? •...
  • Page 17: Common Problem Resolution

    Common problem resolution Loose connections Action: • Be sure all power cords are securely connected. • Be sure all cables are properly aligned and securely connected for all external and internal components. • Remove and check all data and power cables for damage. Be sure no cables have bent pins or damaged connectors.
  • Page 18: Dimm Handling Guidelines

    DIMM a h ndling guidelines CAUTION: Failure to properly handle DIMMs can c ause damage to DIMM components and the system board connector. When handling a DIMM, observe the following guidelines: • Avoid electrostatic discharge (on page 15). • Always hold DIMMs by the side edges only •...
  • Page 19: Hard Drive Led Combinations

    Hard drive LED combinations Hot-plug SCSI hard drive LED combinations Activity Online Fault LED Interpretation LED (1) LED (2) On, off, or On or off Flashing A predictive failure alert has been received for this drive. flashing Replace the drive as soon as possible. On, off, or The drive is online and is configured as part of an array.
  • Page 20: Server Updates With An Hp Trusted Platform Module And Bitlocker™ Enabled

    Online/activity Fault/UID LED Interpretation LED (green) (amber/blue) On, off, or flashing Alternating amber The drive has failed, or a predictive failure alert has been and blue received for this drive; it also has been selected by a management application. On, off, or flashing Steadily blue The drive is operating normally, and it has been selected by a management application.
  • Page 21 • Moving a BitLocker™-protected drive to another server • Adding an optional PCI device, such as a storage controller or network adapter Common problem resolution 21...
  • Page 22: Diagnostic Flowcharts

    Diagnostic flowcharts Troubleshooting flowcharts To effectively troubleshoot a problem, HP recommends that you start with the first flowchart in this section, "Start diagnosis flowchart (on page 24)," and follow the appropriate diagnostic path. If the other flowcharts do not provide a troubleshooting solution, follow the diagnostic steps in "General diagnosis flowchart (on page 25)."...
  • Page 23 Under Product support, select the product. Select Manuals (guides, supplements, addendums, etc). Under Service and maintenance information, locate the link for the maintenance and service guide. HP BladeSystem c-Class Technical Documentation (http://www.hp.com/go/bladesystem/documentation) Select Support, Drivers and Manuals, and then select the product. Select Manuals, and then locate the link for the maintenance and service guide.
  • Page 24: Start Diagnosis Flowchart

    Start diagnosis flowchart Use the following flowchart to start the diagnostic process. Diagnostic flowcharts 24...
  • Page 25: General Diagnosis Flowchart

    Gene ral diagnosis flowchart The Genera l diagnosis flowchart provides a generic approach to troubleshooting. If you are unsure of the lem, or if the other flowcharts do not fix the problem, use the following flowchart. Diagnostic flowcharts 25...
  • Page 26: Power-On Problems Flowchart

    Power-on problems flowchart Server power-on problems flowchart Some servers have an internal health LED and an external health LED, while other servers have a single system health LED. The system health LED provides the same functionality as the two separate internal and external health LEDs.
  • Page 27 Diagnostic flowcharts 27...
  • Page 28 p-Class se er blade power-on problems flowchart c-Class server blade power-on problems flowchart For the location of server LEDs and information on their statuses, see the server documentation on the HP website (http://www.hp.com/support). Symptoms: • The server does not power on. Diagnostic flowcharts 28...
  • Page 29: Post Problems Flowchart

    • The system power LED is off or amber. • The health LED is red or amber. Possible causes: • Improperly seated or faulty power supply • Loose or faulty power cord • Power source problem • Improperly seated component or interlock problem POST problems flowchart Symptoms: •...
  • Page 30 NOTE: The server has completed POST when the system attempts to access the boot device. • Server completes POST with errors Possible problems: • Improperly seated or faulty internal componen • Faulty KVM device • Faulty video d evice Server and p-Class server blade POST problems flowchart Diagnostic flowcharts 30...
  • Page 31: Operating System Boot Problems Flowchart

    c-Class server blade POST problems flowchart Operating system boot problems flowchart Symptoms: • Server does not boot a previously installed OS • Server does not boot SmartStart Possible causes: • Corrupted OS • Hard drive subsystem problem • Incorrect boot order setting in RBSU Diagnostic flowcharts 31...
  • Page 32: Server Fault Indications Flowchart

    There are two ways to use SmartStart when diagnosing OS boot problems on a server blade: • Use iLO to remotely attach virtual d evices to mount the SmartStart CD onto the server blade. • Use a local I/O cable and drive to connect to the server blade, and then restart the server blade.
  • Page 33 • Server boots, but the internal health LED, external health LED, or component health LED is red or amber NOTE: For the location of server LEDs and inform ation on their statuses, refer to the server docum entation. Possible causes: •...
  • Page 34 Server and p-Class server blade fault indications flowchart Diagnostic flowcharts 34...
  • Page 35 c-Class server blade fault indications flowchart Diagnostic flowcharts 35...
  • Page 36: Hardware Problems

    Hardware problems Proced ures for all ProLiant serv The proce dures in this section are comprehensive and include steps about or references to hardwa featu s th at may not be supported by the server you are troubleshooting. Power problems Power source problems Action: Press the Power On/Standby button to be sure it is on.
  • Page 37: Ups Problems

    If running a red undant configuration, be sure that all of the power supplies in the system are the same. For a list of su pported power supplies, see the server documentation on the HP website (htt p://www.hp.com/support UPS p roblems UPS is not working properly...
  • Page 38: General Hardware Problems

    General hardware problems Problems with new hardware Action: Be sure the hardware being installed is a supported option on the server. For information on supported hardware, see the server documentation. If necessary, rem ove unsupported hardware. To be su re the problem is not caused by a change to the hardware release, see the release notes included with the hardware.
  • Page 39: Unknown Problem

    Be sure all boards are properly installed in the server. To s ee if the utility recognizes and tests the device, run HP Insight Diagnostics (on page 74). Uninstall the new hardware. Unkno wn problem Action: Disconnect power to the server. Following the guidelines and cautionary i nformation in the server documentation, strip the server to its most basic configuration by removing every card or device that is not necessary to start the...
  • Page 40: Internal System Problems

    To a different slot on the same bus (not applicable for PCI Express) To a PCI, PCI-X, or PCI Express slot on a different bus To the same slot in another working server of the same or similar design If the board w orks in any of these slots, either the original slot is bad or the board was not properly seated.
  • Page 41: Diskette Drive Problems

    Drive is not detected Action Be sure no loose connections (on page 17) exist. Refer to the drive documentation to be sure cables are connected as r equired. Be sure the cables are working properly. Replace with known functional cables to test whether th original cables were faulty.
  • Page 42: Drive Problems (Hard Drives And Solid State Drives)

    Be sure you are attempting to write to the proper drive by checking the drive letter in the path statement. Be sure enough space is available on the diskette. Drive problems (hard drives and solid state drives) HP SIM report s a drive fault on a parallel SCSI hard drive managed by an HP Smart Array co ntroller...
  • Page 43 System completes POST but drive fails Action: Be sure no loose connections (on page 17) exist. Be sure no device conflict exists. Be sure the hard drive is cabled properly and term inated if necessary. Be sure the hard drive data cable is working by replacing it with a known functional cable. Be sure the access panel is installed properly when th e server is operating.
  • Page 44: Sd Card Problems

    Run HP Insight Diagnostics (on page 74). Then, replace failed components as indicated. When the drive is a replacement drive on an array controller, be sure that the drive is the sam e type and of th e same or larger capacity than the original drive. Data is inaccessible Action:...
  • Page 45 Be sure the fans are properly seated and working. Follow the procedures and warnings in the server documentation for removing the access panels and accessing and replacing fa Unseat, and then reseat, each fan according to the proper procedures. Replace the access panels, and the n attempt to restart the server.
  • Page 46: Hp Trusted Platform Module Problems

    HP Trusted Platform Module problems Action: If the TPM fails and is no longer detected by RBSU, request a new syste m board and TPM board from an HP authorized service provider ("Contacting HP technical support or an authorized reseller" on page 171).
  • Page 47 Server is o ut f memory Action Be sure the memory is configured properly. R efer to the application documentation to determine the memory configuration requirements. Be sure no operating system errors are indicated. Be sure a memory count error ("Memory count error exists"...
  • Page 48: Ppm Problems

    Server fails to boot, all DIMM LEDs illumi nate amber, ..the health LED is blinking red, the system emits continuo us beeps, and an entry is logged to the Integrated Management Log (IML) Possible cause: The s erver is an HP ProLiant G6 server with 5500-series Xeon processors installed. The DIMMs are not installed according to the server requirements.
  • Page 49: Tape Drive Problems

    Be sure each processor is supported by the server and is installed properly. For processor requirements, see the server documentation. Be sure the server ROM is current. If an "unsupported processor detected" message is displayed, see "Unsupported processor stepping with Intel® process ors (on page 83)."...
  • Page 50 Check for conflicts in backup soft ware services. Check the SCSI/HBA/Driver con figuration of the drive. nspe ct media and cables, and discard any that are faulty or damaged. Contact HP support ("Contacting HP technical support or an authorized reseller" on page 171). Read/w rite issue Action:...
  • Page 51: Graphics And Video Adapter Problems

    Broken, missing, or loose leader pin Damaged cartridge seam Usage in incorrect environment Check if the Tape Error LED is flashing: Reload the suspect tape. If the Tape Error LED stops fla shing, the problem has cleared. Load a new or k nown good tape.
  • Page 52: External Device Problems

    External device problems Video r p oblems Screen is lank for more than 60 seconds after you power up the server Action: Be sure the monitor power cord is plugged into a working grounded (earthed) AC outlet. Power up the monitor and be sure the m onitor light is on, indicating that the monitor is receiving power.
  • Page 53: Mouse And Keyboard Problems

    • Be sure the monitor and any KVM switch are compatible with the VGA output of the server. Slow-m in ov g horizontal lines are displayed Action: Be sure magnetic field interference is not occurring. Move the monitor away from o ther monitors or power transformers.
  • Page 54: Local I/O Cable Problems

    Local I/O cable problems NOTE: The local I/O cabl e is used only with HP ProLiant p-Class server blades. Action: If the local I/O cable does not have hot-plug functionality, be sure you are not using a PS/2 keyboard or mouse. With a PS/2 keyboard or mouse, the local I/O cable cannot be connected as a hot- plug device.
  • Page 55 Modem does not answer an incoming call Action: Enable the auto-answer option in the communications software. Be sure an answering machine is not answering the line before the modem is able to answ Turn off the answering machine. Reconfigure the auto-answer option to respond in fewer ri ngs than the answering machine.
  • Page 56: Network Controller Problems

    You ar e unable to connect to an online subscription service Action: If the line you are accessing requir es error control to be turned off, do so using the AT command AT&Q6%C0. If the ISP you are accessing requires access at a decreased baud rat e, reconfigure the communications software to correct the connection baud rate to match the ISP.
  • Page 57: Expansion Board Problems

    Be sure the correct network driver is installed for the controller and that the driver file is not rupted. Reinstall the driver. sure no loose connections (on page 17) exist. Be sure the n etwork cable is working by replacing it with a known functional cable.
  • Page 58: Software Problems

    Softwa re problem The best sources of information for software problems are the operating system and application software documentation, which may also point to fault detection tools that report errors and preserve the system configuration. Other useful resources include HP Insight Diagnostics (on page 74) and HP SIM ("HP Systems Insight Manager"...
  • Page 59: Operating System Updates

    or specific GPF messages, refer to the Microsoft website (http://www.microsoft.com/whdc/devtools/debugging/default.mspx). Errors are displayed in the error log Action: Follow the information provided in the error log, and then refer to the operating system documentation. Problem s occur after the installation of a service pack Action: Follow the instructions for updating the operating system ("Operating system...
  • Page 60: Restoring To A Backed-Up Version

    from the update, it is recommended that you do not apply the updates. Some updates overwrite files specific to HP. If you decide to apply an operating system update: Perform a full system backup. Apply the operating system update, using the instructions provided. Install the current drivers ("Drivers"...
  • Page 61: Linux Operating Systems

    Novell NetWare—Repair traditio nal volumes with VREPAIR. On NetWare 5.X systems, repair NSS volumes with the NSS menu command, and on NetWare 6 systems, repair NSS volumes using the NSS/PoolVerify command followed by the NSS/PoolRebuild command, if necessary. Refer to the NetWare documentation for more information.
  • Page 62: Remote Rom Flash Problems

    • Check for overwritten files. Refer to the application documentation to find out which files are added by the applicatio • Reinstall the appl ication. • Be sure y ou have the most current dr ivers (" Drivers" on page 77). Remote ROM flash problems Gene ral remote ROM flash problems are occurring...
  • Page 63: Failure Occurs During Rom Flash

    proc edure fails during the ROM flash online preparation, the ROM flash does not occur for the target syste m. An error message describing the broken connection displays and the program exits. Attempt to as rtain and correct the cause of connection failure, and then restart the process. Failure occurs during ROM flash After the online flash preparation has b een successfully completed, the system ROM is flashed offline.
  • Page 64: Software Tools And Solutions

    Software tools and solutions Conf iguration tools Smart ta S rt software SmartStart is a collection of software that optimizes single-server setup, providing a simple and consistent way to deploy server configuration. SmartStart has been tested on many ProLi ant server products, resulting in proven, reliable configurations.
  • Page 65: Hp Rom-Based Setup Utility

    HP ROM-Based Setup Utility RBSU is a configuration utility embedded in ProLiant servers that performs a wide range of configuration activities that can include the following: • Configuring system devices and installed options • Enabling and disabling system features • Displaying system information •...
  • Page 66: Boot Options

    Drives installed Drives used RAID level RAID 0 RAID 1 3, 4, 5, or 6 3, 4, 5, or 6 RAID 5 More than 6 None To change any ORCA default settings and override the auto-configuration process, press the F8 key when prompted By default, th e auto-configuration process confi...
  • Page 67: Array Configuration Utility

    • Mirrored Memory Mode—Provides protection against uncorrectable memory errors that wou otherwise result in system failure. In this mode, the system maintains two copies of all data. If uncorrectable memory error occurs, the system automatically retrieves the good data from the mirrored copy.
  • Page 68: Option Rom Configuration For Arrays

    ACU Diagnostics never collects information about the data content of logical drives. The diagnostic report does not collect or include the following: • File system types, contents, or status • Partition types, sizes, or layout • Software RAID information • Operating system device names or mount points Optio n ROM Configuration for Arrays...
  • Page 69: Management Tools

    Enter the serial number and press the Enter key. Select Product ID. Enter the product ID and press the Enter key. Press the Esc key to close the menu. Press the Esc key to exit RBSU. Press the F10 key to confirm exiting RB SU.
  • Page 70: Erase Utility

    independent of the host se rver and its operating system. The iLO and iLO 2 subsystem provides remote access to any authorized n etwork client, sends alerts, and provides other server management functions. Using iLO and iLO 2, you can: •...
  • Page 71: Management Agents

    Management Agents Management Agents provide the information to enable fault, performance, and configuration management. The agents allow easy manageability of the server through HP SIM software, and third- party SNMP management platforms. Management Agents are installed with every SmartStart assisted installation or can be installed through the HP PSP.
  • Page 72: Hp Insight Server Migration Software For Proliant

    The Vulnerability and Patch Management Pack is included on the In sight Control Data Center Edition DVD. For more information, see the HP ProLiant Essentials Vulnerab ility and Patch Management Pack User Guide, or the HP website (http://www.hp.com). HP Insight Server Migration software for ProLiant The HP Insight Server Migration software for ProLiant (SMP) enables administrators to upgrade or replace an existing server through server migration.
  • Page 73: Hp Insight Control Linux Edition

    • Rapid server deployment • Health and performance monitoring • Comprehensive remote control • Vulnerability scanning and patch management • Power and thermal measurement, reporting, capping, and regulation • Integrated management of v irtual and physical infrastructure • Third-party device management For more informati on about Insight Control Environment suites, see the HP website (http://www.hp.co...
  • Page 74: Usb Support

    USB s p u port HP p rovides both standard USB support and legacy USB support. Standard support is provided by the OS throu gh the appropriate USB device drivers. Before the OS loads, HP provides support for USB devices throug le gacy USB support, which is enabled by default in the system ROM.
  • Page 75: Hp Insight Diagnostics Survey Functionality

    The HP Insight Dia gnostics Online Edition ("HP Insight Diagnostics" on page 74) provides the capability to use non-intrusiv e system level checks to diagnose Smart Array SCSI hard drives. Diagnosis supports SCSI, SATA, and SAS ha rd drives that are attached to a Smart Array controller and configured as part of a logical volume.
  • Page 76: Array Diagnostic Software

    For more in formation, refer to the Management CD in the HP ProLiant Essentials Foundation Pack. Array diagnostic software Determine the version of SmartStart supporte d by the server and choose one of the following array diagnostic methods: • For servers that support SmartStart 8.25 or later, array diagnostic capability is part of the ACU software.
  • Page 77: Keeping The System Current

    Keeping the system current Drivers IMPORTANT: Always perform a backup before installing or updating device drivers. The server includes new hardware that may not have driver support on all OS installation media. If you are installing a SmartStart-sup ported OS, use the SmartStart software (on page 64) and its Assisted Path feature to install the OS and latest driver support.
  • Page 78: System Online Rom Flash Component Utility

    • Works offline and online • Supports Microsoft® Windows® and Linux operating systems • Integrates with other software maintenance, deployment, and operating system tools • Automatically checks for hardware, firmware, and operating system dependencies, and installs only the correct ROM upgrades required by each target server For more information, se e the HP Smart Update Manager User Guide.
  • Page 79: Firmware Maintenance

    Firmware maintenance HP has developed technologies to help ensure that HP servers provide maximum uptime with minimal maintenance. Many of these technologies also reduce server management efforts, enabling administrators to work on issues and resolve problems without taking servers offline. The process of updating system or option firmware is referred to as a flash process or flashing the ROM.
  • Page 80: Verifying Firmware Versions

    Disaster recovery support The Disaster Recovery feature is supported on servers that do not sup port Redundant ROM. When a ROM flash fails or the system ROM becomes corrupted, disaster recovery en ables administrators to update the system ROM with a Systems ROMPaq Diskette or USB key. To use this feature on supported servers, boot to the appropriate media (ROM flash diskette or USB key) for the affected platform, and then allow the flash recovery process to complete.
  • Page 81: Updating Firmware

    Updating firmware To update the firmware: Check the firmware version on the device ("Verifying firmware versions" on page 80). Determine the latest firmware version available. If a TPM is installed and enabled on the server, disa ble BitLocker™ before updating the firmware. For more information, see the operating system docu mentation.
  • Page 82 System ROMPaq Firmware Upgrade Utility The Systems ROMPaq Firmware Upgrade Utility for ProL iant servers is available as a SoftPaq download from the HP website (http://www.hp.com/supp ). The Enhanced SoftPaq download contains utilities to res re or upgrade the System ROM on ProLiant servers: •...
  • Page 83: Unsupported Processor Stepping With Intel® Processors

    If you use a USB drive key, you must start the interface manually. Open a command-line interface and enter one of the following commands to access the Firmware Maintenance CD: In Windows®: \_autorun\autorun_win In Linux: /autorun Read the End-User License Agreement. If you agree to the terms of the license agreement, click Agree to continue.
  • Page 84: Unsupported Processor Stepping With Amd Processors

    Updatin s g ystem ROM without removing the processor If the "Unsupported Processor Detected" message is displayed, and you choose to leave the processor installed, the system will only boot the following devices: • Systems ROMPaq Diskette installed in a legacy diskette drive •...
  • Page 85: Hp Resources For Troubleshooting

    HP resources for troubleshooting Online resources HP Technical Support website Troubleshooting tools and information, as well as the latest drivers and flash ROM images, are available on the HP website (http://www.hp.com/support). HP Guided Troubleshooting website HP Guide d Trouble shooting is available for many products and components on the HP website http://www.hp.com/support/gts Server docu mentation...
  • Page 86: Care Pack

    Care Pack HP Care Pack Services offer upgraded service levels to extend and expand standard product warranty with easy-to-buy, easy-to-use support packages that help you make the most of yo ur server investments. Refer to the Care Pack website ( http://www.hp.com/hps/carepack/servers/cp_proliant.html Product information resources Addit...
  • Page 87: Technical Topics

    Technical topics Refer to white papers on one of the following: • HP Business Support Center (http://www.hp.com/go/bizsupport) • HP Industry Standard Server Technology Papers (http://h18004.www1.hp.com/products/servers/technology/whitepapers/index.html) Prod uct installation resources Switch settings, LED functions, drive, memory, expansion board and processor installation instructions, and board la youts Refer to the hood labels and the server user guide.
  • Page 88: Operating System Installation And Configuration Information (For Factory-Installed Operating Systems)

    Operating system installation and configuration information (for factor y-installed operating systems) Refer to the factory-installed operating system installation documentation that ships with the server. Server configuration infor mation Refer to the server user guide on the Documentation CD, the server installation poster shipped with the server, and the SmartStart installatio n poster (if the server supports SmartStart) in the HP ProLiant Essentials Foundation Pack.
  • Page 89 • HP Business Support Center website (http://www.hp.com/go/bizsupport) • HP Technical Documentation website (http://www.docs.hp.com) HP resources for troubleshooting 89...
  • Page 90: Error Messages

    Error messages error messages Introduction to ADU error messages This section contains a com plete alphabetical list of all ADU error messages for ADU version 7.85.16.0 and earlier. IMPORTANT: This guide provides information for multiple servers. Some information may not apply to the server you are troubleshooting.
  • Page 91 Accelerator Status: Cache was Automatically Configured During Last Controller Reset Description: Cache board was replaced with one of a different size. Action: No action is required. Accelerator Status: Data in the Cache was Lost..due to s ome reason other than the battery being discharged. Description: Data in cache was lost, but not because of the battery being discharged.
  • Page 92 Accelerator Status: Obsolete Data Detected Description: During reset initialization, obsolete data was found in the cache due to the drives being moved and written to by another controller. Action: No action is required. The controller either writes the data to the drives or discards the data completely.
  • Page 93 Accelerator Status: Warranty Alert Description: Catastrophic problem exists with array accelerator board. Refer to other messages on Diagnostics screen for exact meaning of this message. Action: Replace the array accelerator board. Adapter/NVRAM ID Mismatch Description: EISA NVRAM has a n ID for a different controller from the one physically present in the slot. Action: Run the server setup utility.
  • Page 94 Config uration Signature is Zero Description: ADU detected that NVRAM contains a configuration signature of zero. Old versions of the server setup utility could cause this. Acti on: Run the latest version of server setup utility to configure the controller and NVRAM. Config uration Signature Mismatch Description: The array accelerator board is configured...
  • Page 95 Controller Reported POST Error. Error Code: X Description: The controller returned an error from its internal POST. Action: Replace the controller. Contro ller Restarted with a Signature of Zero Description: ADU did not find a valid configuration signature to use to get the data. NVRAM may not be prese nt (un configured) or the signature present in NVRAM may not match the signature on the contro...
  • Page 96 Drive (Bay) X is a Replacement Drive Description: This drive has been replaced. This message is displayed if a drive is replaced in a fault- tolerant logical volume. Action: If the replacement was intentional, allow the drive to rebuild. Drive (Bay) X is a Replacement Drive Marked OK Description: The drive has been replaced and marked OK by the firmware, in one of three possible scenarios : the drive was replaced in a non-fault-tolerant configuration;...
  • Page 97 Drive Monitoring Features Are Unobtainable Description: ADU is unable to get monitor and performance data due to a fatal command problem (such as drive time-out), or is unable to get data due to these featu res not being supported on the controller. Action: C heck for other errors such as time-outs.
  • Page 98 Identify Logical Drive Data did not Match with NVRAM Description : The identify unit data from the array controller does not match with the information stored in NVRAM . This can occur if new, previously configured drives have been placed in a system that has also been previously configured.
  • Page 99 Action: Check for drive failures, wro ng drive replaced, or loose cable messages. If a drive failure occurred, replace the failed drive or drives, and then restore the data for this logical drive from the tape backup. Otherwise, follow the procedures for correcting problems when an incorrect drive is replaced or a loose cable is detected.
  • Page 100 Logical Drive X Status = Recovering (rebuilding data on a rep laced drive) Description: A physical d rive in this logical drive has failed and has now been replaced. The replaced drive is rebuilding from the mirror drive or the parity data. Action: No action is required.
  • Page 101 One or M re Drives is Unable to Support Redundant Controller Operation Description: At least one drive in use does not support redundant c ontroller operation. Action: Replace the drive that does not support redundant controller operation. Other Controller Indicates Different Hardware Model Description: The other controller in the redundant controller configuration i s a different hardware model.
  • Page 102 RIS Co pies Between Drives Do Not Match Description: The drives on this controller contain copies of the RIS that do not match. The hard drives in the array do not have matching configuration information. Action solv e all other errors encountered. Obtain the latest version of ADU, and then rerun ADU.
  • Page 103 Description: SMART is unable to communicate with the drive, because the cab le is not securely connected, or the drive cage connection has failed. Action: Power down the system. Reconnect the cable securely. Restart the system. If the problem persists, replace the cables and connectors as needed. SCSI P ort X, Drive ID Y RIS Copies Within This Drive Do Not Match Descripti...
  • Page 104 Soft firmware upgrade required Description: ADU has determined that the controller is running firmware that has been soft upgraded the Upgrade Utility. However, the firmware running is not present on all drives. This could be caused by the addition of new drives in the system.
  • Page 105 Description: The cooling fan located in the external storage unit has failed. Action: Replace the fan. Storage Enclosure on SCSI Bus X Indicated that the Fan is Degraded..SOLUTI ON: this condition usually occurs on enclosures with multiple fans and one of those fans has failed. eplace any fans not opera ting properly.
  • Page 106 Place the drives in their original locations. Restart the server, and then complete the expand operation. Move the drives to their new locat ions after the expand operation is completed. Swapped Cables or Configuration Error Detected. An Unsupported Drive Arrangement Was Attempted..SOLUTION: Power down system then move drives back to their original location.
  • Page 107 Description: More logical drives were created than are supported on this controller, causing lost logical drive volumes. Action: Identify the drives containing lost volumes, and then move them to another controller so the lost volumes can be recreated. CAUTION: Removing a drive that contains valid volume data causes all valid data to be lost. System Board is Unable to Identify which Slots the Controllers are in Description: The slot indicator on the system board is not working correctly.
  • Page 108: Unknown Disable Code

    Unable to Communicate with Drive on SCSI Port X, Drive ID Y Description: The array controller cannot communicate with the drive. Action: If the hard drive amber LED is on, replace the drive. Unable to Retrieve Identify Controller Data. Controller May be Dis abled or Failed ...SOLUTION: Power down the system.
  • Page 109 WARNING - Drive Write Cache is Enabled on X Description: Drive has its internal write cache enabled. The drive may be a third-party drive, or the operating parameters of the drive may have been altered. Condition can cause data corruption if power to the drive is interrupted.
  • Page 110: Adu Version 8.0 Through 8.28 Error Messages

    Write M emory Error Description: Data cannot be written to the cache memory. This typically means that a parity error was detected while writing data to the cache. This can be caused by an incomplete connection between the cache and the c ontroller.
  • Page 111 Action: Adjust the memory in stalled in the array accelerators to matching sizes. Array status: The array has failed. Action: Check for failed physical or logical drives in the array and correct the fa ilure cause. Controller State: The HBA does not have an access ID..External SCSI array co ntrollers that support SSP require HBAs to have an access ID.
  • Page 112 Logical drive state: Background parity initialization is currently queued..or in pr ogress on this logical drive. If b ackground parity initialization is queued, it will start when I/O is performed on the drive. When bac kground parity initializa tion completes, the performance of the logical drive will improve.
  • Page 113 NVRAM Error: Board ID could not be read..(Read-Only Table failed checksum test). Unrecoverable error. Action: Contact HP support ("Contacting HP" on page 171). NVRAM Error: Bootstrap NVRAM image failed checksum test,..bu t a bac kup image was found and successfu lly restored.
  • Page 114: Post Error Messages And Beep Codes

    Redundant Path Failure: The physical drive is degraded due to the loss of a redund ant path. Action: Check the storage box I/O module and cables to restore redundant paths to the physical drive. Redund ant Path Failure: Warn ing: Redundant I/0 modules of this storage box..are not cabled in a recommended configuration.
  • Page 115 Audible Beeps: None Possible Cause: This m essage indicates Online Spare Memory is enabled and indicates the amount of memory reserved for this feature. Action: None. Advanc ed Memory Protection mode: Multi-board mirro red memory with Advanced ...Xxxx MB System memory and xxxx MB memory res erved for Mirroring.
  • Page 116 Fan Solution Not Sufficient Audible Beeps: Possible Cause: The minimum number of required fans is missing or failed. Action: Install fans or replace any failed fans. Fatal DMA Error Audible Beeps: None Possible Cause: The DMA controller has experienced a cr itical error that has caused an NMI.
  • Page 117 FATAL ROM ERROR: The System ROM is not Properly Programmed. Audible Beeps: 1 long, 1 short Possible Cause: The System ROM is not properly programmed. Action: Replace the physical RO M part. Fibre Channel Mezzanine/Balcony Not Supported. Audible Beeps: 2 short Description: The Fibre Channel adapter is not supported on the server.
  • Page 118 Invalid memory types were found on the same node. Please check DIMM compatibility . - Some DIMMs may not be used Description: Invalid or mixed memory types were detected during POST. Action: Use only supported DIMM pairs when populating memory sockets. Refer to the applicable server user guid e memory requirements.
  • Page 119 Network Server Mode Active and No Keyboard Attached Audible Beeps: None Possible Cause: A keyboard is not connected. An error has not occurred, but a message is displayed to indicate the keyboard status. Action: No action is required. NMI - Button Pressed! Audible Beeps: None Possible Cause: The NMI button was pressed, initiating a m emory dump for crash dump analysis.
  • Page 120: Processor X Unsupported Wattage

    Parity C heck 2 - System DIMM Memory Audible Beeps: None Possible Cause: An uncorrectable err or memory event occurred in a memory DIMM. Action: Run Insight Diagnostics ("HP Insight Diagnostics" on page 74) to identify fa iled DIMMs. Then, use the DIMM LEDs to identify failed DIMMs and replace the DIMMs.
  • Page 121 Audible Beeps: None Possible Cause: The backup system ROM is corrupted. The primary ROM is valid. Action: Run ROMPaq Utility to flash the system so that the primary and backup ROMs are valid. REDUN DANT ROM ERROR: Bootblock Invalid. - ..contact HP Representative.
  • Page 122 This system only supports 667 MHz Front Side Bus Speed Processors. One or more 800 MHz Front Side Bus Speed Proce ssors have been initialized at 667 MHz. System Halted! Audible beeps: 1 long, 1 short Possible cause: One or more 800-MHz front side bus speed processors have been initialized at 667- MHz.
  • Page 123 Select a configuration option: 1. Enable OBDR 2. Exit Audible Beeps: None Possible Cause: A USB tape device that su pports One Button Disaster Recovery (OBDR) is installed in the system. Action: Press 1 or 2. Pressing 2 exits the con figuration.
  • Page 124: 100 Series

    WARNING: ProLiant Demand Based Power Management cannot be supported with th e following processor configuration. The syste m will run in Full Performance mode. Audible Beeps: None Possible Cause: The system is con figured for HP Static Low mode and the current processor cannot support this mode.
  • Page 125 Action: Replace the system board. Run the server setu p utility. 102-System Board Failure, CMOS Test Failed. Audible Beeps: None Possible Cause: 8237 DMA controllers, 8254 timers, and similar devices. CAUTION: Only authorized technicians trained by HP should attempt to remove the system board.
  • Page 126: Series

    162-Sy Options Not Set Audible Beeps: 2 long Possible Cause: Config uration is incorrect. The system configuration has changed since the last boot (addition of a hard driv e, for example) or a loss of power to the real-time clock has occurred. The real- time clock loses power if the onboard battery is not functioning correctly.
  • Page 127 207 - Invalid Memory Configuration Detected. DIMMs installed when no corresp onding processor is detected. Descripti on: Processor is required to be installed for memory to be used. Action: Populate the processor socket or remove the DIMM. 207-Inv id Memory Configuration - DIMMs must be installed in pairs or sequen tia y Audi...
  • Page 128 207-Invalid Memory Configuration - Mismatched DIMMs within DIMM Bank Audible Beeps: 1 long, 1 short Possible Cause: Installed DIMMs in the same bank are of different sizes. Action: I nstall correctly matched DIMMs. 207-Invalid Memory Configuration - Mismatched DIMMs within DIMM Bank..Memory in ank X Not Utilized.
  • Page 129 Audible Beeps: 1 long, 1 short, or none Possible Cause: Installed DIMMs have a primary width of x8. Action: Install DIMMs that have a primary width of x4 if Advanced ECC memory support is required. 209-O nline Spare Memory Configuration - No Valid Banks for Online Spare Audible Beeps: 1 long, 1 short Possible Cau...
  • Page 130 210-Memory Board Power Fault on board X Audible Beeps: 1 long, 1 short Possible Cause: A problem exists with a memory board po wering up properly. Action: E xchange DIMMs and retest. Replace the memory board if problem persists. 210-M em ry Board Failure on board X Audible Beeps: 1 long, 1 short Possible Cause: A problem exists with a memory board powering...
  • Page 131: 400 Series

    If the failure reoccurs, replace the keyboard. 303-Ke yb ard Controller Error Audible Beeps: None Possible Cause: System board, keyboard, or mouse controller failure occurred. Action: Be sure the keyboard and mouse are connected. CAUTION: Only authorized technicians trained by HP should attempt to remove the system board.
  • Page 132: 600 Series

    Audible Beeps: 2 short Possible Cause: A hardware conflict in the system is preventing the parallel port from working correctly. Action: If you have recently added new hardware, remove it to see if the hardware is the cause of the conflict.
  • Page 133: 1100 Series

    611-Primary Floppy Port Address Assignment Conflict Audible Beeps: 2 short Possible Cause: A hardware conflict in the system is preventing the diskette drive from operating properly. Action: Run the server setup utility to configure the diskette driv e port address and manually resolve the conflict.
  • Page 134 Audible Beeps: None Possible Cause: The temperature measured by one of the system temperature sensors has exceeded acceptable levels. In m any cases, this is due to the ambient inlet air temperature exceeding acceptable levels. Action: Be sure to follow all of the environmental requ irements for the server.
  • Page 135 1611-Fan x Failure Detected (Fan Zone CPU) Audible Beeps: 2 short Possible Cause: Requir ed fan is not installed or spinning. Action: Check the fans to be sure they are working. Be s ure each fan cable is properly connected, if applicable, and each fan is properly seated. If the problem persists, replace the failed fans.
  • Page 136 Be sure the assembly is properly connected and each fan is properly seated. If the problem persists, replac e the failed fans. If a known working replacement fan is not spinning, replace the assembly. 1611-Power Supply Zone Fan Assembly Failure Detected. Single fan..failure.
  • Page 137: 1700 Series

    1615-Power Supply Failure, Power Supply Unplugged, or Power Supply F Failure in Ba Audible Beeps: None Possible Cause: The power su pply has failed, or it is installed but not connected to the system board or AC power source. Action: Reseat the power supply firmly and check the power cable or replace power supply. 1616-Power Supply Configuration Failure -A workin er supply must be installed in Bay 1 for proper cooling.
  • Page 138 1711-S t lo Z Drive Array - RAID ADG Logical Drive(s) Configured but Array Accelerator Size < = 32 MB..This configuration is not recommended. Consider migrating logical drive(s) to RAID 5 or upgrading the Array Accelerator module. Audible Beeps: None Possible Cause: This con figuration is not recommended.
  • Page 139 1715-Slot X Drive Array Controller - Memory Error(s) Occurred ...Warnin g: Corrected Memory Error(s) were detected during controller memory self-test... Audible Beeps: None Possible Cause: The memory is be ginning to fail. Action: If this error persists, replace the controller. 1716-Slot z Drive Array Controller - Unrecoverable Media Errors Detected on Drives ...Errors will be fixed automatically when these...
  • Page 140 1720-Slot X Drive Array - S.M.A.R.T. Hard Drive(s) Detect Imminent Failure SCSI: Port Y: SCSI ID Z. Audible Beeps: None Possible Ca use: A hard drive SMART predictive failure condition is detected. It may fail at some time in the future. Action: •...
  • Page 141 1725-Slot X Drive Array-Optional SIMM Failure Detected Audible Beeps: None Possible Cause: SIMM has been automatically disabled due to memory errors or unsupported SIMM type installed. Action: Replace the SIMM memory module on the indicated controller. 1726-S lot X Drive Array - Array Accelerator Memory Size C hange Detected.
  • Page 142 1733-Slot z Drive Array - Storage Enclosure Firmware Upgrade Problem Detecte d..Port x b ox y: Enclosure firmware upgrade needed - run Flash Components. Port x box y: Unable to read firmware version of one or more components Audible Beeps: None Possible Cause: An incorrect enclosure firmware version is installed or an enclosure firmware upgrade is needed.
  • Page 143 • HP is not liable for blocked data access caused by improper TPM use. For operating instructions, see the encryption technology feature documentation provided by the operating system. 1737-Slot z Drive Array - Redundant Cabling Configuration has excess Device Paths..Redundant I/O paths to some devic es attached to the controller are exceeding per device limit by firmware.
  • Page 144 Audible Beeps: None Possible Cause: The system detects that the drives a re still present during POST and that the user previously requested them to be removed from the controllers configuration information. Actio n: Do one o f the following: •...
  • Page 145 Audible Beeps: None Possible Cause: The controller or firmware version does not support the attached drive enclosure. Action: Upg rade the controller or detach the expander-based storage connections. 1747-Slot z Drive Array - Unsupported Array Configuration Detected..Followe d by one of many possible reasons, including the following: More logical drives are configured than this controller can support when the memory module is detached.
  • Page 146 1754-Slot X Drive Array - RAID ADG configured but ADG is not supported on this control ler model. Audible Beeps: None Possible Cause: RAID ADG configured by ADG is not suppo rted on this controller model. n: Replace the controller with a model that supports RAID ADG. 1754-Slot z Drive Array - One or more RAID levels are configured...
  • Page 147 Expansion will resume when Arra y Accelerator battery reaches full charge. Expansion will resume when automatic data recovery has been completed. Audible Beeps: None Possible Cause: The capacity expansion process has been temporarily disabled. Action: Fo llow the action that is displayed onscreen to resume the capacity expansion process. 1768-Slot X rive Array - Resuming Logical Drive Expansion Process Audible...
  • Page 148 1774-Slot X Drive Array - Obsolete Data Foun d in Array Accelerator Audible Beeps: None Possible Cause: Drives were used on another controller and reconnecte d to the original controller while data was in the original controller cache. Data found in the array a lerator is older than data found on the drives and has been automatically discarde Action: Check the file system to determine whether any data has been lost.
  • Page 149 1776-Slot X Drive Array - Shared SAS Port Connection Conflict Detected - Ports 1I, 1E: Storage connections detected on both shared internal and external ports..Control ler selects internal port until connection is removed from one of the ports. Audible Beeps: None Action: For cable configuration information, refer to the controller documentati 1776-D ve Array Reports Improper SCSI Port 1 Cabling...
  • Page 150 • Check the LEDs. If the ProLiant Storage System power LED is amber instead of green, this indicates a redundant power s upply failure. • If the message ind icates to check SCSI cables: Compare the cabling against the diagrams in the HP Smart Array Controller User Guide. If the routing is correct, replace cables on the specified port until the POS T error message is eliminated.
  • Page 151 1784-Slot X rive Array Drive Failure. The Following SCSI Drive(s) Should Be Replaced: SCS I Port Y: SC SI ID Z Audible Beeps: None Possible Cause: Defective drive or SCSI cables detected. Action: Be sure all cables are connec ted properly and securely. Be sure all drives are fully seated.
  • Page 152 Audible Beeps: None Possible Cause: A failed or replacement drive has not yet been rebuilt. Action: • Perform one of the following actions: Press the F1 key to continue with recovery of data to the drive. Data will be automatically restored to drive X when a failed drive has been replaced, or to the original drive if it is working again without error Press the F2 key to contin...
  • Page 153 Replace drive X. (depending on the fault-tolerance l evel, all data may be lost if another drive fails). 1788-Slot X Drive Array Reports Incorrect Drive Replacement..The foll owing SCSI drive(s) should have been replaced: SCSI Port Y: SCSI ID Z. The following SCSI drive(s) were incorrectly replaced: SCSI Port y: SCSI ID z.
  • Page 154 • When powering down the system, the server must be powered down before external storage systems. 1792-Drive Array Reports Valid Data Found in Array Accelerator..Data will automatically be written to drive array. Audible Beeps: None Possible Cause: Power was interrupted while data was in the array accelerator memory. Power was then restored within several days, and the data in the array accelerator was flushed to the drive array.
  • Page 155: Event List Error Messages

    Audible Beeps: None Possible Cause: Array accelerator is defective or is missing. Depending on the array controller model, the cache may be disabled or the controller might not be usable until this problem is corrected. Action: Reseat the array accelerator daughter board if the connector is loose. If the problem persists, replace the board.
  • Page 156: A Cpu Power Module (System Board, Socket X)

    03/19/2002 12:54 PM FAN INSERTED Main System Location: System Board Fan ID: 03 **END OF EVENT** WARNING: To avoid potential problems, ALWAYS read the warnings and cautionary information in the server documentation before removing, replacing, reseating, or modifying system components. IMPORTANT: This guide provides information for multiple servers.
  • Page 157: Blue Screen Trap: Cause [Nt]

    Blue Screen Trap: Cause [NT]..Kernel Panic: Cause [UNIX] Abnormal Program Termination: Cause [NetWare] Event Type: System lockup Action: Refer to the operating system documentation. Corrected Memory Error Threshold Passed (Slot X, Memory Module Y)..Corrected Memory Error Threshold Passed (System Memory) Corrected Memory Error Threshold Passed (Memory Module Unknown) Event Type: Correctable error threshold exceeded Action: Continue normal operation, and then replace the memory module during the next scheduled...
  • Page 158: System Ac Power Overload (Power Supply X)

    System AC Power Overload (Power Supply X) Event Type: Power supply overload Action: Switch the voltage from 110 V to 220 V or add an additional power supply (if applicable to the system). If the problem persists, remove some of the install ed options.
  • Page 159: Uncorrectable Memory Error (Slot X, Memory Module Y)

    CAUTION: Only authorized technicians trained by HP should attempt to remove the system board. If you believe the system board requires replacement, contact HP Technical Support ("Contacting HP" on page 171) before proceeding. Action: Replace the board on which the processor is installed. Uncorrectable Memory Error (Slot X, Memory Module Y)...
  • Page 160 Location: Server blade management module Action: Perform the following steps to resolve the problem. Stop when the problem is resolved. Press the server blade management module reset button. Replace the server blade management module. Server blade management module signal backplane error codes LED code: 10-1, 10-2, or 10-3 Location: Server blade management backplane Action: Perform the following steps to resolve the problem.
  • Page 161 For more information, refer to the HP BladeSystem Maintenance and Service Guide on the HP website (http://www.hp.com/products/servers/proliant-bl/p-class/info). Replace the interconnect device. For more information, refer to the HP BladeSystem Maintenance and Service Guide on the HP website (http://www.hp.com/products/servers/proliant-bl/p-class/info). Interconnect B Error Code LED code: 14-1, 14-2, 14-3, or 14-4 Location: Interconnect device - side B Action: Perform the following steps to resolve the problem.
  • Page 162: Power Management Module Error Codes

    For more information, refer to the HP BladeSystem Maintenance and Service Guide on the HP website (http://www.hp.com/products/servers/proliant-bl/p-class/info). Interconnect Module B (10-Connector) Error Code LED code: 16-1 or 16-2 Location: Interconnect module - side B (10-connector) Action: Perform the following steps to resolve the problem. Stop when the problem is resolved. Press the server blade management module reset button.
  • Page 163: Port 85 Codes And Ilo Messages

    Location LED codes Power Supply - Slot 2 2-1 or 2-2 Power Supply - Slot 3 3-1 or 3-2 Power Supply - Slot 4 4-1 or 4-2 Power Supply - Slot 5 5-1 or 5-2 Power Supply - Slot 6 6-1 or 6-2 Action: Perform the following steps to resolve the problem.
  • Page 164 IMPORTANT: Be sure the port 84/85 switch is set to display port 85 codes. Locate the code in the following table. For example, if the port 85 code displays "31h," see "Processor-related port 85 codes (on page 164)" for more information. Port 85 code Description Port 85 codes in this format indicate processor-related errors.
  • Page 165: Memory-Related Port 85 Codes

    Replace the processor in socket 1. Replace the processor board, if applicable. eplace the system board. IMPORTANT: If replacing the system board or clearing NVRAM, you must re-enter the server serial number through RBSU ("Re-entering the server serial number and product ID"...
  • Page 166: Miscellaneous Port 85 Codes

    Reseat all expansion boards. Bring the server to base configuration by removing all components that are not required by the server to complete POST. This process can include removing all: Expansion boards Processors, except the processor installed in socket 1 IMPORTANT: Processor socket 1 and PPM slot 1 must be populated at all times or the server does not function properly.
  • Page 167: Message Id: 4169

    IMPORTANT: If replacing the system board or clearing NVRAM, you must re-enter the server serial number through RBSU ("Re-entering the server serial number and product ID" on page 68). Windows® Event Log processor error codes Message ID: 4137 Severity: Error Description: The processor in slot X, socket X has corrected an excessive number of internal errors.
  • Page 168: Msg_Cpu_Rr_1

    Insight Diagnostics processor error codes MSG_ P C U_RR_1 Even t ty e: Unable to divide and multiply with zero and inf inity. Actio • Ensure proper ventilation and cooling for the server. • Ensure the processor heatsinks are attached correctly (do not remove them). •...
  • Page 169: Msg_Cpu_Rr_7

    • Replace the processor. _CPU_RR_7 Event type: CPU speed is out of range. Action: Replace the processor. MSG_ P C U_RR_8 Event type : Unable to update the CMOS time. Action: Replace the board that CMOS is on. MSG_ P C U_RR_9 Even t ty e: MMX hardware is not...
  • Page 170: Msg_Cpu_Rr_15

    MSG_ P C U_RR_15 Even t ty e: MMX pack/unpack instru ction has failed. Action: Replace the processor. MSG_ P C U_RR_16 Event type: MMX compare instruction has failed. Action: Replace the processor. MSG_CPU_RR_17 Event type: Stress integer math test has failed.
  • Page 171: Contacting Hp Technical Support Or An Authorized Reseller

    Contacting HP Contacting HP technical support or an authorized reseller Before contacting HP, always attempt to resolve proble ms by completing the procedures in this guide. IMPORTANT: Collect the appropriate server information ("Server information you need" on page 172) and operating system information ("Operating system information you need"...
  • Page 172: Server Information You Need

    Number of processors and speed Number of DIMMs and their size and speed List of controllers and NICs List of connected peripheral devices List of any other optional HP or Compaq hardware Network configu ration • ecific software information: Operating system information ("Operating system information you...
  • Page 173: Linux Operating Systems

    Event logs Dr. W atson log (drwtsn32.log) if a user mode application, such as the Insight Agents, is having a problem IRQ and I/O address information in text format • An updated Emergency Re pair Diskette • If HP drivers are installed: Version of the PSP used List of drivers from the •...
  • Page 174: Novell Netware Operating Systems

    • If HP drivers are installed: Version o f the PSP used List of drivers from the PSP (/var/log/hppldu.log) • A list of each third-party hardware component installed, with the firmware revisions • A list of each third-party software component installed, with the versions •...
  • Page 175: Ibm Os/2 Operating Systems

    Output of /etc/hwconfig command Output of /usr/bin/swconfig comm Output of /etc/ifconfig command /etc/conf/cf.d/sdevice /etc/inittab /etc/conf/cf.d/stune /etc/conf/cf.d/config.h /etc/conf/ cf.d/sdevice /var/adm/messages (if PANIC messages are displayed) • If HP drivers are installed: Version of the EFS used List of drivers from the EFS •...
  • Page 176: Sun Solaris Operating Systems

    Number and size of partitions and logical drives File system on each logical drive • Warp Server version used and: Whether Entry, Advanced, Advanced with SMP, or e-Business All services runnin g at the time the problem occurred • A list of each third-party hardware component installed, with the firmware revisions •...
  • Page 177 Acro nyms and abbreviations ACPI Advanced Configuration and Power Interface Array Configuration Utility Advanced Data Guarding (also known as RAID 6) Array Diagnostics Utility Advanced Memory Protection Automatic Server Recovery BIOS Basic Input/Output System CCITT International Telegraph and Telephone Consultative Committee CMOS complementary metal-oxide semiconductor central processing unit...
  • Page 178 drive r update error checking and correcting Extended Feature Supplement EISA Extended Industr y Standard Architecture electr ostatic discharge FBDIMM fully buffered DIM HP SIM HP Systems Insight Manager HTTP hypertext transfer protocol input/output integrated device electronics Integrated Ligh ts-Out Integrated Lig hts-Out 2 Integrated Management Displa...
  • Page 179 Internet Proto col interrupt r equest Internet service provider keyboard, video, and mouse light-emitting diode low-voltage differential multimedia extensions non-mask able interrupt NVRAM non-volatile memory OBDR One Button Disaster Recov ORCA Option ROM Configuration for Arrays operating system physical to virtual peripheral compon ent interface Acronyms and abbreviations 179...
  • Page 180 PCI-X peripheral comp onent interconnect extended Performance Management Pa POST Power-On Self Test process or power module ProLiant Support Pack Preboot Execution Environment RAID redundant array of inexpensive (or inde pendent) dis RBSU ROM-Based Setup Utility Rapid Dep loyment Pack RILOE Remote Insigh t Lights-Out Edition...
  • Page 181 SATA serial ATA Systems Insigh t Manager single inline memory mod SNMP Simple Network Manage ment Protocol Service Pa ck 1 support software d iskette TCP/IP Transmission C ontrol Protocol/Internet Protocol trusted platform module uninterruptible power system universal serial bus virtual to physical virtual to v irtual...
  • Page 182 video gra phics array Acronyms and abbreviations 182...
  • Page 183 Index 120PCI.HAM 59 backup issue, tape drive 50 backup, restoring 60 batteries, insufficient warning when low 37 battery 37, 98, 110, 133 accelerator error log 90 battery pack, array accelerator 141 accelerator status 91, 92, 93 beep codes 114 ACPI support 59 BIOS Serial Console 66 ACU (Array Configuration Ut ility) 67...
  • Page 184 contacting technical support 171 DVD-ROM drive 40 controller failure 139 controller state 111 controllers, array 101, 107, 138, 139, 145, 146 ECC errors 91 creating a disk image 68 electrostatic discharge 15 critical error 115 end user license agreement (EULA) 82 CSR (customer self repair) 171 energy saver features 52 customer self repair (CSR) 171...
  • Page 185 installation and configuration 87 installation instructions 86, 87 hard drive LEDs 19, 42 Integrated Lights-Out (iLO) 69, 88, 123 hard drive problems, diagnosing 42 Integrated Management Log (IML) 75, 155 hard drive, failure of 43, 140 interim recovery mode 152 hard drives 18, 19, 42 internal health LED 19 hard drives, determining status of 19...
  • Page 186 memory, RAID 66, 129 PCI slots 120 memory-related port 85 codes 165 performance optimization scan 141 Microsoft operating systems 172 phone numbers 171 mirrored memory 66, 67, 115 physical drive state 113 miscellaneous port 85 codes 166 port 85 code, expansion board-related 165 modem problems 54 port 85 code, list 163 modems 54, 55...
  • Page 187 serial number 68 server blade management module error codes 159 rack stability 14 server documentation 85, 86, 87, 88 rack warnings 14 server fault indications flowchart 34, 35 RAID configuration 146 server features and options 86 RAID memory 66, 115 server management 88 RBSU (ROM-Based Setup Utility) 40, 65, 66 server setup 88, 93...
  • Page 188 system not supported 63 system ROM 79, 83 warnings 14, 86 System ROMPaq Firmware Upgrade Utility 82 website, HP 85, 86 Systems Insight Manager 70 websites, reference 22, 85 what's new 9 white papers 85, 87 tape drives 49 Windows Event Log processor error codes 167 tape drives, failure of 49 teardown procedures 86 technical support 171...

Table of Contents