Page 1
This document describes common procedures and solutions for the many levels of troubleshooting servers. This document is intended for the person who installs, administers, and troubleshoots servers, server blades, and HPE Apollo systems. Hewlett Packard Enterprise assumes you are qualified to service computer equipment and are trained in recognizing hazards in products with hazardous energy levels.
Common issue resolution..............20 Resolving loose connections...................... 20 Searching for service notifications....................20 Firmware updates........................20 Server updates with an HPE Trusted Platform Module and BitLocker enabled....21 DIMM handling guidelines......................21 DIMM and NVDIMM population information................22 Component LED definitions......................22 SAS, SATA, and SSD drive guidelines................22...
Page 4
Data is inaccessible....................81 Server response time is slower than usual............82 HPE SmartDrive icons or LEDs illuminate errors for the wrong drive or an error message is displayed in POST, HPE SSA, or HPE SSADUCLI....82 SSD Smart Wear error..................83 512e Physical drive support..................
Page 5
Data failure or disk errors on a server with a 10SFF drive backplane or a 12LFF drive backplane..................96 HPE Smart Array S100i SR Gen10 drives are not found when RAID mode is disabled......................... 96 HPE Smart Array S100i SR Gen10 drives are not recognized......97 Fan and thermal issues....................97...
Page 6
Network controller or FlexibleLOM stopped working when an expansion board was added....................124 Network interconnect blade issues..............125 HPE Smart Storage Battery issues................125 HPE Smart Storage Battery might lose charge when shelved for long periods of time........................125 HPE Smart Storage Battery configuration error..........126 HPE Smart Storage Battery failure..............126 Cable issues..........................
Page 7
Downloading the AHS Log using the AHS CLI for Linux........157 Clearing the AHS log using iLO..................157 Using AHSV to troubleshoot or open a support case.............158 Integrated Management Log.....................158 Viewing the IML......................159 HPE SSA diagnostic tasks......................159 HPE Smart Storage Administrator Diagnostics Utility CLI............160 About the utility.......................160 Contents...
Page 8
Linux reports..........................165 Resources for troubleshooting............166 Online resources........................166 Hewlett Packard Enterprise Support Center website............. 166 Hewlett Packard Enterprise Information Library.............166 Troubleshooting resources for previous HPE ProLiant server models......166 Server blade enclosure troubleshooting resources............166 Troubleshooting resources.....................166 Server documentation....................167 Server user guide....................167 Server maintenance and service guide...............
Page 9
Software installation and configuration of the server............. 172 HPE iLO information.......................172 Management of the server..................... 172 Installation and configuration information for the server management system....172 Fault tolerance, security, care and maintenance, configuration and setup....172 Websites....................173 Support and other resources.............174 Accessing Hewlett Packard Enterprise Support...............
If a server exhibits symptoms that do not immediately pinpoint the issue, use this section to begin troubleshooting. The section contains a series of flowcharts that provide a common troubleshooting process for HPE ProLiant servers. The flowcharts identify a diagnostic tool or a process to help solve the issue.
Additional troubleshooting resources are available for HPE Gen10 server products in the following documents: • Error Message Guide for HPE ProLiant Gen10 servers and HPE Synergy provides a list of error messages and information to assist with interpreting and resolving error messages. •...
Active Health System Viewer. For more information, see the AHSV user guide at the Hewlett Packard Enterprise website (http:// www.hpe.com/support/ahsv-docs). 5. Prepare the server for diagnosis. 6. To begin the diagnostic process, see Initial diagnosis.
This symbol indicates the presence of hazardous energy circuits or electric shock hazards. Refer all servicing to qualified personnel. WARNING: To reduce the risk of injury from electric shock hazards, do not open this enclosure. Refer all maintenance, upgrades, and servicing to qualified personnel. This symbol indicates the presence of electric shock hazards.
Page 14
WARNING: To reduce the risk of personal injury or damage to the equipment, consult the safety information and user documentation provided with the server before attempting the installation. Some servers contain high energy circuits, high current circuits, moving parts (such as fan blades), or any combination of these hazards, that may be exposed if covers and access panels are removed while the product is connected to a power source.
CAUTION: To properly ventilate the system, you must provide at least 7.6 cm (3.0 in) of clearance at the front and back of the server. CAUTION: The server is designed to be electrically grounded (earthed). To ensure proper operation, plug the AC power cord into a properly grounded AC outlet only.
Page 16
• Does the server power on? • Does the server complete POST? ◦ If not, then what do the server LEDs indicate? Which LEDs are illuminated? Which LEDs are illuminated but not flashing? Which LEDs are flashing and at what rate are they flashing? What color is the LED? Are LEDs flashing in unison? ◦...
Download the Active Health System Log. For more information, see the Active Health System Viewer documentation on the Hewlett Packard Enterprise website (http://www.hpe.com/support/ahsv-docs). g. Record survey data. 7. To diagnose the server offline, power down the server and peripheral devices. If possible, always...
a. Exit any applications. b. Exit the operating system. c. Power down the server. 8. Disconnect any peripheral devices not required for testing (any devices not necessary to power up the server). More information Active Health System Log on page 139 Active Health System Viewer on page 138 Server user guide on page 167 Scripting Toolkit for Windows and Linux on page 143...
Page 19
Before removing the components, be sure to determine the minimum configuration for each component and follow all guidelines in the server user guide. Always use the recommended minimum configuration above before removing any DIMMs. If you are unable to isolate the issue with the configuration above, you will then remove all but the minimum number of associated DIMMs for each processor as directed by the Hewlett Packard Enterprise Support Center.
Service notifications are created to provide solutions for known issues. Check to see if your issue is covered by an existing service notification. Procedure 1. See the Hewlett Packard Enterprise website (http://www.hpe.com/support/hpesc). 2. Enter the product name or number, and then press Enter. 3. To show the documents available for your product, select Documents.
More information Service Pack for ProLiant on page 147 Subscription services on page 169 Server updates with an HPE Trusted Platform Module and BitLocker enabled When a TPM is installed and enabled in UEFI System Utilities, and when the Microsoft Windows...
Drives must be of the same capacity to provide the greatest storage space efficiency when drives are grouped in the same drive array. • Drives in the same logical volume must be of the same type. HPE SSA does not support mixing SAS, SATA, and SSD drives in the same logical volume. Hot-plug drive LED definitions...
Item LED Status Definition No drive activity Do not Solid white Do not remove the drive. Removing the drive causes one or more of remove the logical drives to fail. Removing the drive does not cause a logical drive to fail. Drive Solid green The drive is a member of one or more logical drives.
Item Status Definition Flashing amber/blue The drive has failed, or a predictive failure alert has been received for this drive; it also has been identified by a (1 flash per second) management application. Flashing amber A predictive failure alert has been received for this drive. Replace the drive as soon as possible.
Item LED Status Definition Locate Solid blue The drive is being identified by a host application. Flashing blue The drive carrier firmware is being updated or requires an update. Activity Rotating green Drive activity ring No drive activity Drive Solid green The drive is a member of one or more logical drives.
Flashing Red = Critical condition Front panel LED and buttons Standard descriptions for most HPE ProLiant Gen10 servers are provided in this section. HPE ProLiant Gen10 server blades have a Health status LED bar instead of the Health LED described in this section.
When all four front panel LEDs flash simultaneously, a power fault has occurred. • The server UID also flashes to indicate the status of the iLO Service Port. For more information, see the iLO documentation at http://www.hpe.com/support/iLO-docs. • When the system power LED is off, one of the following is true: ◦...
◦ Power supplies are not seated properly. ◦ No power supplies are installed. ◦ A power supply failure has occurred. ◦ The power backplane, if required, is not seated properly. ◦ The PDU has issues. ◦ The Power On/Standby Button cable is disconnected. •...
Page 29
2. Press the UID button again to close the Server Health Summary screen. Server Health Summary details Server screen thumbnail A thumbnail image of the server screen. Server Power The server power status. Product Name The server model. Serial Number The server serial number.
HPE iLO iLO is available for all HPE ProLiant and HPE Synergy servers. iLO consists of an intelligent processor and firmware that allows for remote server management. The iLO VSP provides bidirectional data flow with a server serial port. Using VSP, you can operate as if a physical serial connection exists on the remote server serial port.
6. To terminate the remote access SSH session, close the communication software or enter Exit at the CLI command prompt. For more information, see the HPE Virtual Connect Manager Command Line Interface for c-Class BladeSystem User Guide on the Hewlett Packard Enterprise website (http://www.hpe.com/info/vc/ manuals).
a. Review all installed Subsystems and Devices and verify that all devices show the OK status with a green icon. b. If any degraded subsystems or devices exist, then click the degraded subsystems or device to review the current status. 4.
7. If the blade is not displayed in the Insight Display on the chassis or within the Onboard Administrator GUI, then troubleshoot the issue further using the procedures in the HPE BladeSystem c-Class Enclosure Troubleshooting Guide on the Hewlett Packard Enterprise website (http://www.hpe.com/ support/BladeSystem_Enclosure_TSG_en).
Page 35
4. To search for support information about the issue (if available), go to the Hewlett Packard Enterprise website (http://www.hpe.com/support/hpesc) and do the following: a. Enter HPE BladeSystem Onboard Administrator in the search field and press Enter. b. Select Documents from the results list at the left side of the page.
• Symptom information • • POST error messages • OA SHOW ALL report (for HPE ProLiant BL Server Blades only) • Other More information Active Health System Viewer on page 138 Using the OA CLI on page 34 Collecting symptom information on page 15...
Controller issues flowchart on page 53—A controller is not shown during POST or if your controller shows errors. • HPE Smart Storage Battery issues for Smart Array controllers on page 56 — An HPE Smart Storage battery is not recognized or is not charging. •...
Figure 1: Remote troubleshooting flowchart More information Active Health System Viewer on page 138 Using Onboard Administrator for remote troubleshooting of server blades on page 33 Using iLO for remote troubleshooting of servers and server blades on page 32 Accessing Hewlett Packard Enterprise Support on page 174 Power-on issues flowchart Server power-on issues flowchart with ML and DL series servers Symptom...
Page 39
To resolve the issue, follow the steps in the flowchart. Figure 2: Server power-on issues flowchart for ML and DL series servers For the location of server LEDs and information on their statuses, see the server documentation on the Hewlett Packard Enterprise website (http://www.hpe.com/info/proliantgen10-docs). Diagnostic flowcharts...
Cause • The server is not properly installed in the chassis. • The server is not configured to automatically power on in HPE iLO. • The power being supplied is not sufficient for the server installed in the chassis. •...
Page 41
Figure 3: Server power-on issues flowchart for XL series servers For the location of server LEDs and information on their statuses, see the server documentation on the Hewlett Packard Enterprise website (http://www.hpe.com/info/proliantgen10-docs). For removal and replacement procedures, see the server maintenance and service guide on the Hewlett Packard Enterprise website (http://www.hpe.com/info/proliantgen10-docs).
• The Onboard Administrator module is not properly installed in the enclosure. • A possible communication failure between HPE iLO and the Onboard Administrator is causing the server blade to wait for permission to power on. • The server blade has a mismatched fabric installed on the mezzanine 1 connector or the mezzanine 2 connector.
Page 43
Figure 4: Power on issues flowchart for BL series server blades (page 1 — Remote troubleshooting) For more information about the BladeSystem c-Class enclosure, see the documentation on the Hewlett Packard Enterprise website (http://www.hpe.com/info/bladesystem/docs). For removal and replacement procedures, see the server blade maintenance and service guide on the Hewlett Packard Enterprise website (http://www.hpe.com/info/bladesystem/docs).
Page 44
Enterprise website (http://www.hpe.com/info/bladesystem/docs). For removal and replacement procedures, see the server blade maintenance and service guide on the Hewlett Packard Enterprise website (http://www.hpe.com/info/bladesystem/docs). For more information about the Active Health System Viewer, see the AHSV user guide on the Hewlett Packard Enterprise website (http://www.hpe.com/support/ahsv-docs).
Front panel LED power fault codes on page 30 Accessing Hewlett Packard Enterprise Support on page 174 POST issues flowchart Symptom • The server does not complete POST • The server completes POST with errors Cause • Improperly populated memory •...
Figure 6: POST issues flowchart More information Power-on issues flowchart on page 38 POST issues—Server hang or reboot during POST flowchart on page 46 POST issues—Boot, no video flowchart on page 49 OS boot issues flowchart on page 50 POST issues—Server hang or reboot during POST flowchart Symptom The server hangs or reboots during POST and does not boot.
Page 47
Figure 7: POST issues—Server hangs or reboots during POST flowchart For the location of server LEDs and information on their statuses, see the server documentation on the Hewlett Packard Enterprise website (http://www.hpe.com/info/proliantgen10-docs). For removal and replacement procedures, see the server maintenance and service guide on the Hewlett Packard Enterprise website (http://www.hpe.com/info/proliantgen10-docs).
Figure 8: POST issues—No boot, no video flowchart For the location of server LEDs and information on their statuses, see the server documentation on the Hewlett Packard Enterprise website (http://www.hpe.com/info/proliantgen10-docs). For more information about the Active Health System Viewer, see the AHSV user guide on the Hewlett Packard Enterprise website (http://www.hpe.com/support/ahsv-docs).
To resolve the issue, follow the steps in the flowchart. Figure 9: POST issues—Boot, no video flowchart For the location of server LEDs and information on their statuses, see the server user guide on the Hewlett Packard Enterprise website (http://www.hpe.com/info/proliantgen10-docs). POST issues—Boot, no video flowchart...
For removal and replacement procedures, see the server maintenance and service guide on the Hewlett Packard Enterprise website (http://www.hpe.com/info/proliantgen10-docs). For more information about the Active Health System Viewer, see the AHSV user guide on the Hewlett Packard Enterprise website (http://www.hpe.com/support/ahsv-docs).
Figure 10: OS boot issues flowchart For more information about UEFI System Utilities, see the UEFI System Utilities documentation on the Hewlett Packard Enterprise website (http://www.hpe.com/info/UEFI/docs). For more information about the Active Health System Viewer, see the AHSV user guide on the Hewlett Packard Enterprise website (http://www.hpe.com/support/ahsv-docs).
Page 52
Figure 11: Intelligent Provisioning issues flowchart For more information about Intelligent Provisioning, see the Intelligent Provisioning documentation on the Hewlett Packard Enterprise website (http://www.hpe.com/info/intelligentprovisioning-docs). For more information about UEFI System Utilities, see the UEFI System Utilities documentation on the Hewlett Packard Enterprise website (http://www.hpe.com/info/UEFI/docs).
For removal and replacement procedures, see the server maintenance and service guide on the Hewlett Packard Enterprise website (http://www.hpe.com/info/proliantgen10-docs). More information Drive issues (hard drives and solid state drives) on page 79 Operating system issues and resolutions on page 129...
Page 55
Figure 13: Controller issues flowchart (2 of 2) For more information about the Active Health System Viewer, see the AHSV user guide on the Hewlett Packard Enterprise website (http://www.hpe.com/support/ahsv-docs). For more information about error messages, see the error message guide on the Hewlett Packard Enterprise website (http://www.hpe.com/info/gen10-troubleshooting).
HPE Smart Storage Battery issues for Smart Array controllers Symptom • The battery is not recognized. • The battery is not charging. • The battery or cache module is reported as failed. Cause • The battery is unsupported on this server.
Page 57
For removal and replacement procedures, see the server maintenance and service guide on the Hewlett Packard Enterprise website (http://www.hpe.com/info/proliantgen10-docs). 2. If the HPE Smart Storage battery issues did not resolve the issue, follow the steps in the following flowchart. Diagnostic flowcharts...
Figure 15: HPE Smart Storage Battry errors flowchart For removal and replacement procedures, see the server maintenance and service guide on the Hewlett Packard Enterprise website (http://www.hpe.com/info/proliantgen10-docs). More information HPE Smart Storage Battery issues on page 125 Gather important information before you begin on page 36...
Page 59
Figure 16: Physical drive issues flowchart For removal and replacement procedures, see the server maintenance and service guide on the Hewlett Packard Enterprise website (http://www.hpe.com/info/proliantgen10-docs). For installation procedures, see the server user guide on the Hewlett Packard Enterprise website (http:// www.hpe.com/info/proliantgen10-docs).
For more information about error messages, see the error message guide on the Hewlett Packard Enterprise website (http://www.hpe.com/info/gen10-troubleshooting). More information HPE Smart Storage Administrator Diagnostics Utility CLI on page 160 Updating firmware or system ROM on page 147 Gather important information before you begin on page 36...
For removal and replacement procedures, see the server maintenance and service guide on the Hewlett Packard Enterprise website (http://www.hpe.com/info/proliantgen10-docs). For more information about UEFI System Utilities, see the UEFI System Utilities documentation on the Hewlett Packard Enterprise website (http://www.hpe.com/info/UEFI/docs). More information Product QuickSpecs on page 169...
Server fault indications flowchart for non-blade servers Symptom • The server boots, but a fault event is reported in the IML. • The server boots, but the system health LED or component health LED is red or amber. Cause • Improperly seated or faulty internal or external component •...
Page 63
Packard Enterprise website (http://www.hpe.com/info/proliantgen10-docs). More information Power-on issues flowchart on page 38 HPE Smart Storage Administrator Diagnostics Utility CLI on page 160 Software and configuration utilities on page 138 Viewing the IML on page 159 Gather important information before you begin on page 36...
Server blade fault indications flowchart for BL c-Class server blades Symptom • The server boots, but a fault event is reported in the IML. • The server boots, but the system health LED is red or amber. Cause • Improperly seated or faulty internal or external component •...
Page 65
For server blade LED information, see the server blade documentation on the Hewlett Packard Enterprise website (http://www.hpe.com/info/bladesystem/docs). For removal and replacement procedures, see the server maintenance and service guide on the Hewlett Packard Enterprise website (http://www.hpe.com/info/bladesystem/docs). More information Drive issues (hard drives and solid state drives) on page 79...
NIC issues flowchart Symptom • The NIC is not working. • One or more ports on the NIC is not working. Cause • The firmware or drivers are outdated, mismatched, or faulty. • The NIC or cable is not seated properly. •...
Page 67
Figure 20: NIC issues flowchart (1 of 2) Diagnostic flowcharts...
Page 68
Figure 21: NIC issues flowchart (2 of 2) For more information about the Active Health System Viewer, see the AHSV user guide on the Hewlett Packard Enterprise website (http://www.hpe.com/support/ahsv-docs). For more information about error messages, see the error message guide on the Hewlett Packard Enterprise website (http://www.hpe.com/info/gen10-troubleshooting).
General diagnosis flowchart Symptom If your symptoms are not clear, use this flowchart. However, before you begin, gather important for use during troubleshooting. For more information, see Gather important information before you begin on page 36. The General diagnosis flowchart provides a generic approach to troubleshooting. If you are unsure of the issue, or if the other flowcharts do not fix the issue, use the following flowchart.
Page 70
For removal and replacement procedures, see the server maintenance and service guide on the Hewlett Packard Enterprise website (http://www.hpe.com/info/proliantgen10-docs). For more information about UEFI System Utilities, see the UEFI System Utilities documentation on the Hewlett Packard Enterprise website (http://www.hpe.com/info/UEFI/docs). More information Power-on issues flowchart on page 38...
Hardware issues Procedures for all ProLiant servers The procedures in this section are comprehensive and include steps about or references to hardware features that may not be supported by the server you are troubleshooting. CAUTION: Before removing or replacing any processors, be sure to follow the guidelines provided in Processor troubleshooting guidelines.
For more information, see the following documents: • HPE Power Capping and HPE Dynamic Power Capping for ProLiant servers technology brief on the Hewlett Packard Enterprise website (http://www.hpe.com/support/hpdpc). • HPE BladeSystem Onboard Administrator User Guide on the Hewlett Packard Enterprise website (http://www.hpe.com/support/BladeSystem/docs).
For product-specific information, see the server documentation on the Hewlett Packard Enterprise website (http://www.hpe.com/info/proliantgen10-docs). For more information, see the HPE Power Advisor on the Hewlett Packard Enterprise website (http:// www.hpe.com/info/poweradvisor). 4. If running a redundant configuration, be sure that all the power supplies have the same spare part number and are supported by the server.
If the issue is no longer present, then additional power supplies are required. c. If the issue persists, check the IML for system information. For more information, see the server documentation on the Hewlett Packard Enterprise website (http://www.hpe.com/info/enterprise-docs). More information Product QuickSpecs on page 138...
5. Be sure that the line cord is connected. 6. Be sure that each circuit breaker is in the ON position, or replace the fuse if needed. If this occurs repeatedly, contact an authorized service provider. 7. Check the UPS LEDs to be sure that a battery or site wiring issue has not occurred. For more information, see the UPS documentation.
Be sure that the issue is not caused by a change to the hardware release. For more information, see the release notes included with the hardware. If no documentation is available, see the Hewlett Packard Enterprise website (http://www.hpe.com/support/hpesc). Be sure that the new hardware is installed properly.
For more information on the UEFI System Utilities, see the UEFI System Utilities user guide in the UEFI Information Library on the Hewlett Packard Enterprise website (http://www.hpe.com/info/ UEFI/docs). 10. Be sure that all switch settings are set correctly.
2. Verify that the latest device drivers are installed. 3. Verify that the device is installed properly. For more information on which PCIe technology is supported and for the slot PCIe bus width, see the server QuickSpecs at http://www.hpe.com/info/qs. Third-party device issues...
Testing the device Procedure 1. Uninstall the device. If the server works when the device is removed, then one of the following issues exists: • An issue exists with the device. • The server does not support the device. • The device conflicts with another device.
5. Be sure drive blanks are installed properly when the server is operating. Drives may overheat and cause sluggish response or drive failure. 6. Run HPE SSA and check the status of the failed drive. 7. Be sure the replacement drives within an array are the same size or larger.
The HPE Smart Array S100i SR Gen10 and the AHCI do not support SAS drives. 14. If the HPE Smart Array S100i SR Gen10 is installed on the server, be sure that RAID mode is enabled and UEFI boot mode in UEFI System Utilities.
2. Review information about the operating system encryption technology, which can cause a decrease in server performance. For more information, see the operating system documentation. 3. Use HPE SSA to verify that a recovery operation is not pending on the logical drive. More information...
512e drives, the native block boundary is 4K. HPE SSA displays a warning indicating that the logical drive is not optimally aligned and that performance of the logical drive is not optimal. Additionally, if one or more of the following scenarios are met, the array presents a "Realign Logical Drive"...
For information about error messages, see the ProLiant Servers Troubleshooting Guide. • HPE Smart Storage Administrator Diagnostics Utility CLI This standalone diagnostic utility provides configuration and error information about array controllers, storage enclosures, drive cages, logical drives, physical drives, and tape drives. For any supported SSDs, the utility provides current usage level and remaining expected lifetime.
Page 85
For data in the logical drive to be readable, the data block sequence must be the same in every stripe. This sequencing process is performed by the array controller, which sends the data blocks to the drive write heads in the correct order. A natural consequence of the striping process is that each physical drive in a given logical drive will contain the same amount of data.
Page 86
For any configuration except RAID 0, further protection against data loss can be achieved by assigning a drive as an online spare (or hot spare). This drive contains no data and is connected to the same controller as the array. When any other physical drive in the array fails, the controller automatically rebuilds information that was originally on the failed drive to the online spare.
Page 87
This method has the following benefits: • Useful when performance and low cost are more important than data protection. • Has the highest write performance of all RAID methods. • Has the lowest cost per unit of stored data of all RAID methods. •...
Page 88
long as no two failed drives belong to the same mirrored pair. The total drive count must increment by 2 drives. This method has the following benefits: • It is useful when high performance and data protection are more important than usable capacity. •...
Page 89
When the array has more than three physical drives, drives are mirrored in trios, and the fault-tolerance method is known as RAID 10 (ADM). If a physical drive fails, the remaining two drives in the mirrored trio can still provide all the necessary data. Several drives in the array can fail without incurring data loss, as long as no three failed drives belong to the same mirrored trio.
Page 90
This method has the following benefits: • It is useful when usable capacity, write performance, and data protection are equally important. • It has the highest usable capacity of any fault-tolerant configuration. • Data is not lost if one physical drive fails. RAID 6 RAID 6 protects data using double parity.
Page 91
• It is useful when data protection and usable capacity are more important than write performance. • It allows any two drives to fail without loss of data. RAID 50 RAID 50 is a nested RAID method in which the constituent hard drives are organized into several identical RAID 5 logical drive sets (parity groups).
Page 92
The number of physical drives must be exactly divisible by the number of parity groups. Therefore, the number of parity groups that you can specify is restricted by the number of physical drives. The maximum number of parity groups possible for a particular number of physical drives is the total number of drives divided by the minimum number of drives necessary for that RAID level (three for RAID 50, 4 for RAID 60).
Selecting a RAID method Not all controllers support all RAID levels. To determine the RAID capabilities of your controller, see the model-specific information for your controller on the Hewlett Packard Enterprise website. Most important Also important Suggested RAID level criterion Fault tolerance Cost effectiveness RAID 6...
The HPE Smart Storage Battery is not installed. • The HPE Smart Storage Battery is not connected to the system board properly. • The cache module cable is not connected to the PCIe riser board (for controllers installed on a PCIe riser board).
4. Verify that the controller firmware versions are compatible and current. 5. Verify that the controller cache sizes are compatible. 6. Verify that the HPE Smart Storage Battery is installed and connected properly. 7. Verify that all controller cabling is connected properly.
The HPE Smart Array S100i SR Gen10 drives are not found when RAID mode is disabled. Cause When an HPE Smart Array S100i SR Gen10 is enabled on a server and RAID mode is disabled in the UEFI System Utilities, then the drives are listed as AHCI drives or HPE H220i drives and the RAID controller is not found in POST or device manager.
HPE Smart Array S100i SR Gen10 drives are not recognized Symptom When installing an OS, the OS installation does not recognize the HPE Smart Array S100i SR Gen10 drives. Action Manually install the HPE Smart Array S100i SR Gen10 drivers.
Verify the fan airflow path is not blocked by cables or other material. 10. For HPE BladeSystem c-Class enclosure fan issues, review the fan section of Onboard Administrator SHOW ALL and the FAN FRU low-level firmware. For more information, see the HPE BladeSystem c-Class Enclosure Troubleshooting Guide on the Hewlett Packard Enterprise website (http:// www.hpe.com/support/BladeSystem_Enclosure_TSG_en).
4. Verify that the correct processor heatsink is installed. 5. Verify that the correct fan is installed, if the system supports both standard fans and performance fans. More information Updating firmware or system ROM on page 147 Server documentation on page 167 Excessive fan noise (high speeds) Symptom Fans are operating at high speeds with excessive noise.
Server documentation on page 167 HPE BladeSystem c-Class enclosure fans are operating at a high speed Symptom All fans in an HPE BladeSystem c-Class enclosure are operating at a high speed while fans in the other enclosures are operating at normal speed. Action 1.
Page 101
Cause • The memory does not meet server requirements. • A DIMM has failed. • Third-party memory is installed on the server. • The DIMM is not properly seated. Action • Isolate and minimize the memory configuration. Use care when handling DIMMs. •...
• Verify that the DIMMs are supported on the server. For a list of supported options, see the product QuickSpecs on the Hewlett Packard Enterprise website (http://www.hpe.com/info/qs). • Be sure that the associated processor is installed for all DIMMs on the server.
2. Verify that the associated processor is installed for all DIMMs in the server. For more information on which DIMM slots are associated to each processor, see the server documentation in the Hewlett Packard Enterprise Information Library (http://www.hpe.com/info/ docs). 3. Verify that the memory is configured properly.
Be sure that the memory is the correct type for the server. For a list of supported options, see the product QuickSpecs on the Hewlett Packard Enterprise website (http://www.hpe.com/info/qs). Be sure that the memory is installed according to the server requirements.
• A system “hang” • A system “freeze” • • Server restarts or powers down unexpectedly • Parity errors occur Cause • The DIMM is not installed or seated properly. • The DIMM has failed. Action 1. Reseat the DIMM. 2.
For more information about error messages, see the error message guide for your server on the Hewlett Packard Enterprise website (http://www.hpe.com/info/gen10-troubleshooting). 2. Verify that all DIMMs are installed according to the population guidelines. For more information, see the DIMM population guidelines on the Hewlett Packard Enterprise website (http://www.hpe.com/docs/memory-population-rules).
HPE ProLiant Gen10 (http://www.hpe.com/info/proliantgen10-docs) • HPE Synergy (http://www.hpe.com/info/synergy-docs) 5. Verify that a supported HPE Smart Storage Battery is properly installed and operating in the server. For a list of supported components, see the server QuickSpecs on the Hewlett Packard Enterprise website (http://www.hpe.com/info/qs).
Select Enabled. c. Save your settings. For more information, see the UEFI System Utilities user guide for your product on the Hewlett Packard Enterprise website (http://www.hpe.com/info/uefi/docs). The persistent memory drive is read only Symptom A persistent memory drive marked as "read only" usually indicates that the underlying NVDIMM detected is not armed for backup during the boot process.
Action Reboot the system and wait for the backup energy source to sufficiently charge. Solution 4 Cause The backup energy source is not sufficiently charged and the charging process timed out. Action 1. Check the IML for additional backup power source-related messages and perform any recommended actions.
Action 1. Review the IML for messages indicating the NVDIMM controller error. 2. Perform the steps indicated in the IML. HPE Scalable Persistent Memory issues The persistent memory drive does not appear in the OS Symptom When a persistent memory drive does not appear in the OS, this symptom usually indicates that the underlying NVDIMM is mapped out.
Action Remove the physical NVDIMM from the server. For more information about removing NVDIMMs from the server, see the server maintenance and service guide on the Hewlett Packard Enterprise website (http://www.hpe.com/info/proliantgen10-docs). The persistent memory drive is read only Symptom A persistent memory drive marked as "read only" usually indicates that the underlying NVDIMM detected is not armed for backup during the boot process.
Page 112
Action Check the IML for additional errors and perform the recommended action. Solution 4 Cause The persistent memory backup power requirements exceed the available backup battery power. Action Remove any hardware that was recently added to the server. Solution 5 Cause The backup energy source is not sufficiently charged and the charging process was skipped by the user.
1. To locate a message that indicates the failed backup storage device, review the IML. 2. Replace the failed backup storage device indicated by the IML. For more information about replacing backup storage devices, see the server maintenance and service guide on the Hewlett Packard Enterprise website (http://www.hpe.com/info/proliantgen10- docs). Solution 2 Cause A backup storage device is missing.
3. If the issues still exist, replace the backup power source. Unable to configure Scalable Persistent Memory Cause HPE Scalable Persistent Memory functionality is disabled because a physical NVDIMM-N is installed in the server. Action Remove the physical NVDIMM-N from the server.
For more information about removing the NVDIMM-N, see the server maintenance and service guide on the Hewlett Packard Enterprise website (http://www.hpe.com/info/proliantgen10-docs). Processor issues Troubleshooting the processor Symptom A POST error message or an IML message is received. Cause • One or more processors are not supported by the server.
Uncorrectable machine check exception Symptom A POST error message or an IML message is received indicating an uncorrectable machine check exception. Action CAUTION: Before removing or replacing any processors, be sure to follow the guidelines provided in Processor troubleshooting guidelines on page 18. Failure to follow the recommended guidelines can cause damage to the system board, requiring replacement of the system board.
Hewlett Packard Enterprise recommended repair actions based on experience and best practices. If you are you unable to resolve the issues, submit a support case through AHSV. For more information, see the AHSV user guide on the Hewlett Packard Enterprise website (http://www.hpe.com/support/ ahsv-docs).
Cause • The drive boot order is not set to boot from the microSD card. • The microSD card is not detected by iLO. • The microSD card is not seated properly. Action 1. Be sure the drive boot order in the UEFI System Utilities is set so that the server boots from the microSD card.
Action • Use only cards listed as a supported option for the server. For a complete list of supported options, see the product QuickSpecs on the Hewlett Packard Enterprise website (http://www.hpe.com/info/ qs). • Be sure that the power supplies installed in the server provide adequate power to support the server configuration.
If the video expansion board is installed in a PCI hot-plug slot, be sure that the slot has power by checking the power LED on the slot, if applicable. See the server documentation on the Hewlett Packard Enterprise website (http://www.hpe.com/ info/enterprise-docs).
Monitor does not function properly with energy saver features Symptom The monitor does not function properly with energy saver features. Cause The monitor does not support energy saver features. Action Verify that the monitor supports energy saver features. If the monitor does not support energy saver features, disable the features.
Action Move the monitor away from other monitors or power transformers. Mouse and keyboard issues Symptom The mouse and/or keyboard are not operating normally. Action 1. Verify that all cables and cords are securely and properly connected. Check the following: •...
When replacing an expansion board on a BitLocker-encrypted server, always disable BitLocker before replacing the expansion board. If BitLocker is not disabled, the system requests the recovery method selected when BitLocker was configured. Failure to provide the correct recovery password or passwords results in loss of access to all encrypted data.
Action 1. Check the network controller or FlexibleLOM LEDs to see if any statuses indicate the source of the issue. For LED information, see the network controller documentation or server documentation. 2. Be sure the correct network driver is installed for the controller and that the driver file is not corrupted. Reinstall the driver.
Any server configured with an HPE Smart Storage Battery for HPE Smart Array Controllers might display a POST error message stating that the cache module or the HPE Smart Storage Battery failed. Cause The HPE Smart Storage Battery discharged to a threshold where it is permanently disabled and must be replaced. Action 1.
1. Do one of the following: • Ensure that the HPE Smart Storage Battery is fully charged. It may take up to 120 minutes in a powered server or chassis for the HPE Smart Storage Battery to charge to support the number of battery-backed devices in the server or chassis.
Action • Verify that the HPE Smart Storage Battery is installed and cabled properly. For more information, see the HPE Smart Array Controllers User Guide at http://www.hpe.com/info/SmartArrayGen10-docs. • Verify the HPE Smart Storage battery status in iLO. • Update the system ROM.
Page 128
• Attach a USB device that requires a power source less than 500mA. • Attach an externally powered USB hub to the SUV cable and connect the USB device to the hub. Hardware issues...
If you are unable to resolve the issues, submit a case using the Active Health System Viewer. For more information, see the Active Health System Viewer documentation at http://www.hpe.com/support/ahsv-docs. Errors are displayed in the error log Symptom Error messages are displayed in the error log.
Updating the operating system Prerequisites for updating the operating system Before updating the operating system, read the release notes for each update. Use care when applying operating system updates (Service Packs, hotfixes, and patches). If you do not require specific fixes from the update, Hewlett Packard Enterprise recommends that you do not apply the updates.
For troubleshooting information specific to Linux operating systems, see the Linux for ProLiant website at http://www.hpe.com/info/proliantlinux. To assist in possible LINUX installation issues on HPE ProLiant servers, capture either the sosreport or supportconfig before contacting Hewlett Packard Enterprise technical support.
Cause The new system settings are not supported. Action Check the system logs to determine what changes were made, and then change the settings back to the original configuration. Errors occur after the system software is changed Symptom Errors occur after the system software is changed. Action Change the settings back to the original configuration.
Action 1. Correct the command-line syntax. 2. Restart the process. Access denied on target computer Symptom When attempting to access a networked target computer for which administrative privileges are not authorized, an error message is displayed describing the issue, and then the program exits. Cause Administrative privileges are not available for the user requesting access.
Action 1. Attempt to ascertain and correct the cause of connection failure. 2. Restart the process. Failure occurs during ROM flash Symptom The server will not start. Cause The flash cannot be interrupted during a ROM flash, or the ROM image is corrupted and the server does not start.
• • Smart Array storage • Standup HBAs 3. Enable BitLocker after the firmware updates are complete. For information on BitLocker, see BitLocker for servers on the Microsoft website at http:// technet.microsoft.com/en-us/library/cc732774.aspx. More information Keeping the system current on page 147 Updating firmware Procedure 1.
Page 136
• Review the steps in the OS boot issues flowchart. • Verify the controller boot order. For more information, see the UEFI System Utilities user guide at http://www.hpe.com/info/uefi/ docs. • Access the redundant ROM using one of the following procedures: ◦...
When the server is configured for UEFI Boot Mode, PXE servers must be configured with a UEFI boot image. For x64 EFI machines, the DHCP server also needs to be configured to support x64 EFI DHCP boot requests. For more information, see the UEFI documentation at http://www.hpe.com/info/UEFI/ docs.
Product QuickSpecs For more information about product features, specifications, options, configurations, and compatibility, see the product QuickSpecs on the Hewlett Packard Enterprise website (http://www.hpe.com/info/qs). Active Health System Viewer Active Health System Viewer (AHSV) is an online tool used to read, diagnose, and resolve server issues quickly using AHS uploaded data.
Health System Viewer documentation at the following website: http://www.hpe.com/support/ahsv-docs. HPE iLO 5 iLO 5 is a remote server management processor embedded on the system boards of HPE ProLiant servers and Synergy compute modules. iLO enables the monitoring and controlling of servers from remote locations.
Service Port The Service Port is a USB port with the label iLO on ProLiant Gen10 servers and Synergy Gen10 compute modules. When you have physical access to a server, you can use the Service Port to do the following: •...
RESTful Interface Tool The RESTful Interface Tool (iLOREST) is a scripting tool that allows you to automate HPE server management tasks. It provides a set of simplified commands that take advantage of the iLO RESTful API. You can install the tool on your computer for remote use or install it locally on a server with a Windows or Linux Operating System.
IMPORTANT: HPE ProLiant XL servers do not support operating system installation with Intelligent Provisioning, but they do support the maintenance features. For more information, see "Performing Maintenance" in the Intelligent Provisioning User Guide and online help. After the server is running, you can update the firmware to install additional components. You can also update any components that have been outdated since the server was manufactured.
Management Security HPE ProLiant Gen10 servers are built with some of the industry's most advanced security capabilities, out of the box, with a foundation of secure embedded management applications and firmware. The management security provided by HPE embedded management products enables secure support of modern workloads, protecting your components from unauthorized access and unapproved use.
• An Embedded UEFI Shell that provides a preboot environment for running scripts and tools. • Boot support for option cards that only support a UEFI option ROM. Selecting the boot mode This server provides two Boot Mode configurations: UEFI Mode and Legacy BIOS Mode. Certain boot options require that you select a specific boot mode.
HPE Smart Storage Administrator HPE SSA is the main tool for configuring arrays on HPE Smart Array SR controllers. It exists in three interface formats: the HPE SSA GUI, the HPE SSA CLI, and HPE SSA Scripting. All formats provide support for configuration tasks.
Accessing HPE SSA in the offline environment IMPORTANT: If you are updating an existing server in an offline environment, obtain the latest version of HPE SSA through Service Pack for ProLiant before performing configuration procedures. Using one of multiple methods, you can run HPE SSA before launching the host operating system. In offline mode, users can configure or maintain detected and supported devices, such as optional Smart Array controllers and integrated Smart Array controllers.
To download the SPP, see the SPP download page at https://www.hpe.com/servers/spp/download. Smart Update Manager SUM is an innovative tool for maintaining and updating the firmware, drivers, and system software of HPE ProLiant, HPE BladeSystem, HPE Synergy, HPE Apollo, and HPE Moonshot servers, infrastructure, and associated options.
SUM does not support third-party controllers, including flashing hard drives behind the controllers. Smart Update Tools Smart Update Tools is a software utility used with iLO 4, HPE OneView, iLO Amplifier Pack, Service Pack for ProLiant (SPP), and Smart Update Manager (SUM) to stage, install, and activate firmware and driver updates.
Procedure 1. Access the System ROM Flash Binary component for your server from the Hewlett Packard Enterprise Support Center (http://www.hpe.com/support/hpesc). 2. Copy the binary file to a USB media or iLO virtual media. 3. Attach the media to the server.
HPE Pointnext Portfolio HPE Pointnext delivers confidence, reduces risk, and helps customers realize agility and stability. Hewlett Packard Enterprise helps customers succeed through Hybrid IT by simplifying and enriching the on- premise experience, informed by public cloud qualities and attributes.
Page 151
You can subscribe to proactive notifications on the Hewlett Packard Enterprise website. Software and configuration utilities...
System Log and submit a case to Hewlett Packard Enterprisesupport. For more information, see the Active Health System Viewer documentation on the Hewlett Packard Enterprise website (http:// www.hpe.com/support/ahsv-docs). • Integrated Management Log • HPE Smart Storage Administrator Diagnostic Utility CLI reports • HPS report • Linux reports—sosreport and supportconfig Active Health System log The Active Health System log is a single file that contains basic information required by Hewlett Packard Enterprise support to analyze the system.
This information is not written to the log data stored on the server. 3. Click Download. 4. Save the file. 5. If you have an open support case, you can email the log file to gsd_csc_case_mngmt@hpe.com. Intelligent Provisioning diagnostic tools...
3. Click Download Entire Log. 4. Save the file. 5. If you have an open support case, you can email the log file to gsd_csc_case_mngmt@hpe.com. Use the following convention for the email subject: CASE: <case number>. Files that are larger than 25 MB must be compressed and uploaded to an FTP site. If needed, contact Hewlett Packard Enterprise for FTP site information.
Page 155
IMPORTANT: When you enter these commands, ensure that you do not use spaces or other unsupported characters. If required by your command-line environment, special characters such as the ampersand must be preceded by the escape character. See the command-line environment documentation for more information.
Specifies your iLO user account credentials. –o <filename>.ahs Specifies the output file name and path. case_no=<HPE support case number> Specifies a Hewlett Packard Enterprise support case number to add to the log header. Options for adding contact information to the downloaded log phone=<phone number>...
Downloading the AHS log through IP For more information, see the Intelligent Provisioning user guide at http://www.hpe.com/info/ intelligentprovisioning/docs. Procedure 1. Open Intelligent Provisioning [F10] at boot-up. 2. Click Perform Maintenance. 3. Click Active Health System download. 4. Leave the range as the default or as instructed by the Hewlett Packard Enterprise technician.
1. Download the AHS log from the server. For more information, see "Downloading the AHS log" in the Active Health System Viewer User Guide. 2. Log in to AHSV (http://www.hpe.com/servers/ahsv) with your HPE Passport credentials, and upload the AHS log to AHSV.
3. Optional: To view the event details pane, click an event. HPE SSA diagnostic tasks 1. Open HPE SSA. For more information about HPE SSA, see the HPE Smart Storage Administrator User Guide on the Hewlett Packard Enterprise website. 2. Open the Diagnostics panel by doing one of the following: •...
If you select Save Diagnostic Report, wait for the report to generate, and then click Close Report or Save Report. For more information about reports and viewing them in a browser, see the HPE Smart Storage Administrator User Guide on the Hewlett Packard Enterprise website.
This report contains information about the current usage level and remaining expected lifetime of solid state drives attached to the system. For more information, see "Reported information." Reported information The array diagnostic report provides detailed information about devices (array controllers, storage enclosures, drive cages, physical drives, logical drives, and tape drives).
Page 162
• Solid State Drives with Wearout Status • Solid State Drives with Less Than an Estimated 56 Days of Life Remaining • Solid State Drives with Less Than 2% Usage Remaining • Solid State Drives with Less Than 5% Usage Remaining •...
Installing the utility Procedure 1. Browse to the HPE Smart Storage Administrator website. 2. Click Download software. 3. Select an OS. 4. Identify the preferred software and version, and then click Download. 5. Save, and then run, the executable file.
• ADUReport.txt—Diagnostic report in text format • ADUReport.xml—Diagnostic report in XML format • ADUReportViewer.htm—HTML viewer for XML diagnostic report • SlotX.txt (SlotX.old)—Controller serial output log The serial output log file(s) are only available if the Smart Array SAS/SATA Event Notification Service is installed and running.
Run this report before contacting Hewlett Packard Enterprise technical support and be prepared to send the cab file. Linux reports To assist in possible Linux installation issues on HPE servers, capture either the sosreport or supportconfig before contacting Hewlett Packard Enterprise Support. More information...
• Error Message Guide for HPE ProLiant Gen10 servers and HPE Synergy provides a list of error messages and information to assist with interpreting and resolving error messages. •...
Product documentation can be accessed from the following locations: • Product-specific information library — See the Start Here document that ships with the server. • Hewlett Packard Enterprise website (http://www.hpe.com/info/enterprise-docs)—Select the product to narrow the list to fewer documents. • Hewlett Packard Enterprise Support Center website (http://www.hpe.com/support/hpesc)
• Hewlett Packard Enterprise Support Center website (http://www.hpe.com/support/hpesc) HPE iLO software documentation For more information about iLO, see the HPE iLO user guide on the Hewlett Packard Enterprise website. UEFI System Utilities documentation For more information about UEFI System Utilities, see the UEFI System Utilities user guide on the Hewlett Packard Enterprise website.
Service notifications, advisories, and notices are available to provide more information about any known issues with an HPE . To locate service notifications, advisories or notices, search for your product on the Hewlett Packard Enterprise website (http://www.hpe.com/support/hpesc). Use the filters to narrow down the list of by documents type.
• Hewlett Packard Enterprise Support Center website HPE Smart Array controller documentation For more information about HPE Smart Array controllers, see the documentation on the Hewlett Packard Enterprise website. Spare part numbers Hewlett Packard Enterprise continually improves and changes product parts. For complete and current...
External cabling information See the cabling information on the Hewlett Packard Enterprise website. Power capacity For all HPE ProLiant Gen10 servers, see the HPE Power Advisor on the Hewlett Packard Enterprise website (http://www.hpe.com/info/poweradvisor). Switch settings, LED functions, drive, memory, expansion board and processor installation instructions, and board layouts See the access panel labels and the server user guide.
Library. Installation and configuration information for the server management system See the HPE Systems Insight Manager Installation and User Guide on the Hewlett Packard Enterprise Information Library. Fault tolerance, security, care and maintenance, configuration and setup See the server documentation, available in the following locations: •...
Websites General websites Hewlett Packard Enterprise Information Library www.hpe.com/info/EIL Single Point of Connectivity Knowledge (SPOCK) Storage compatibility matrix www.hpe.com/storage/spock Storage white papers and analyst reports www.hpe.com/storage/whitepapers For additional websites, see Support and other resources. Websites...
Support and other resources Accessing Hewlett Packard Enterprise Support • For live assistance, go to the Contact Hewlett Packard Enterprise Worldwide website: http://www.hpe.com/assistance • To access documentation and support services, go to the Hewlett Packard Enterprise Support Center website: http://www.hpe.com/support/hpesc Information to collect •...
IMPORTANT: Access to some updates might require product entitlement when accessed through the Hewlett Packard Enterprise Support Center. You must have an HPE Passport set up with relevant entitlements. Customer self repair Hewlett Packard Enterprise customer self repair (CSR) programs allow you to repair your product. If a CSR part needs to be replaced, it will be shipped directly to you so that you can install it at your convenience.
Documentation Feedback (docsfeedback@hpe.com). When submitting your feedback, include the document title, part number, edition, and publication date located on the front cover of the document. For online help content, include the product name, product version, help edition, and publication date located on the legal notices page.
Need help?
Do you have a question about the ProLiant Gen10 and is the answer not in the manual?
Questions and answers