IBM Power Systems 8001-12C Manual
IBM Power Systems 8001-12C Manual

IBM Power Systems 8001-12C Manual

Problem analysis, system parts, and locations
Hide thumbs Also See for Power Systems 8001-12C:
Table of Contents

Advertisement

Quick Links

Power Systems
Problem analysis, system parts, and
locations for the 8001-12C, 8001-22C,
8005-12N, and 8005-22N
IBM

Advertisement

Table of Contents
loading
Need help?

Need help?

Do you have a question about the Power Systems 8001-12C and is the answer not in the manual?

Questions and answers

Summary of Contents for IBM Power Systems 8001-12C

  • Page 1 Power Systems Problem analysis, system parts, and locations for the 8001-12C, 8001-22C, 8005-12N, and 8005-22N...
  • Page 3 Power Systems Problem analysis, system parts, and locations for the 8001-12C, 8001-22C, 8005-12N, and 8005-22N...
  • Page 4 Note Before using this information and the product it supports, read the information in “Safety notices” on page v, “Notices” on page 101, the IBM Systems Safety Notices manual, G229-9054, and the IBM Environmental Notices and User Guide, Z125–5823. ™...
  • Page 5: Table Of Contents

    Collecting diagnostic data . 71 Contacting IBM service and support . . 71 Finding parts and locations ......73 8001-12C or 8005-12N locations .
  • Page 6 Notices ........101 Accessibility features for IBM Power Systems servers .
  • Page 7: Safety Notices

    Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v If IBM supplied the power cord(s), connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
  • Page 8 – For racks with AC power, connect all power cords to a properly wired and grounded electrical outlet. Ensure that the outlet supplies proper voltage and phase rotation according to the system rating plate. – For racks with a DC power distribution panel (PDP), connect the customer’s DC power source to the PDP.
  • Page 9 v Each rack cabinet might have more than one power cord. – For AC powered racks, be sure to disconnect all power cords in the rack cabinet when directed to disconnect power during servicing. – For racks with a DC power distribution panel (PDP), turn off the circuit breaker that controls the power to the system unit(s), or disconnect the customer’s DC power source, when directed to disconnect power during servicing.
  • Page 10 CAUTION: Removing components from the upper positions in the rack cabinet improves rack stability during relocation. Follow these general guidelines whenever you relocate a populated rack cabinet within a room or building. v Reduce the weight of the rack cabinet by removing equipment starting at the top of the rack cabinet.
  • Page 11 DANGER: Rack-mounted devices are not to be used as shelves or work spaces. (L002) (L003) Safety notices...
  • Page 12 DANGER: Multiple power cords. The product might be equipped with multiple AC power cords or multiple DC power cables. To remove all hazardous voltages, disconnect all power cords and power cables. (L003) (L007) CAUTION: A hot surface nearby. (L007) (L008) Problem analysis, system parts, and locations for the 8001-12C, 8001-22C, 8005-12N, and 8005-22N...
  • Page 13 Exchange only with the IBM-approved part. Recycle or discard the battery as instructed by local regulations. In the United States, IBM has a process for the collection of this battery. For information, call 1-800-426-4333. Have the IBM part number for the battery unit available when you call. (C003)
  • Page 14 Power and cabling information for NEBS (Network Equipment-Building System) GR-1089-CORE The following comments apply to the IBM servers that have been designated as conforming to NEBS (Network Equipment-Building System) GR-1089-CORE: Problem analysis, system parts, and locations for the 8001-12C, 8001-22C, 8005-12N, and 8005-22N...
  • Page 15 The equipment is suitable for installation in the following: v Network telecommunications facilities v Locations where the NEC (National Electrical Code) applies The intrabuilding ports of this equipment are suitable for connection to intrabuilding or unexposed wiring or cabling only. The intrabuilding ports of this equipment must not be metallically connected to the interfaces that connect to the OSP (outside plant) or its wiring.
  • Page 16 Problem analysis, system parts, and locations for the 8001-12C, 8001-22C, 8005-12N, and 8005-22N...
  • Page 17: Beginning Troubleshooting And Problem Analysis

    Yes: Continue with the next step. Go to “Resolving a power problem” on page 4. 2. Can you access the baseboard management controller (BMC) across the network? Then Yes: Continue with the next step. © Copyright IBM Corp. 2016, 2019...
  • Page 18: Resolving A Bmc Access Problem

    Then Go to “Resolving a BMC access problem.” 3. Can you boot the system to the Petitboot menu? Then Yes: Continue with the next step. Go to “Resolving a system firmware boot failure” on page 5. 4. Is video displayed on the video graphics array (VGA) monitor? Then Yes: Continue with the next step.
  • Page 19 Note: If the IP address setting is incorrect, go to Configuring the firmware IP address website(http://www.ibm.com/support/knowledgecenter/linuxonibm/liabw/ liabwenablenetwork.htm). If the MAC address is 00:00:00:00:00:00, go to “Contacting IBM service and support” on page 71. 5. Are you able to log in to the BMC web interface?
  • Page 20: Resolving A Power Problem

    9. Update the BMC firmware by using a USB device. Complete the following steps: a. Ensure that the USB device is formatted by using the VFAT file system. b. Insert the USB device into the system if you have not already done so. c.
  • Page 21: Resolving A System Firmware Boot Failure

    Then Yes: Continue with the next step. No service action is required. This ends the procedure. 2. Perform the following actions, one at a time, until the problem is resolved: a. Ensure that all of the power cords are fully seated in the power supplies. b.
  • Page 22: Resolving A Vga Monitor Problem

    Then Yes: Continue with the next step. This ends the procedure. 4. Complete the following actions, one at a time, until the problem is resolved: a. Disconnect the power cords from the system for 30 seconds. Reconnect the power cords, wait 5 minutes, and then go to step 3 on page 5.
  • Page 23: Resolving An Operating System Boot Failure

    Resolving an operating system boot failure Learn how to identify the service action that is needed to resolve a failure while booting your operating system. 1. Was the system recently installed, serviced, moved, or upgraded? Then Yes: Ensure that all cables are properly seated in the connection path to the designated boot device.
  • Page 24 Table 1. Determine the command to verify that the boot drive is recognized and in optimal status (continued) Boot drive configuration Commands Non-Volatile Memory Express (NVMe) drive Use the nvme list command to verify that the boot drive is recognized: v nvme list Use the nvme smart-log command to verify the smart status of the boot drive:...
  • Page 25: Resolving A Sensor Indicator Problem

    Table 2. Determine the command to verify that the drives that are known to be in a RAID array are recognized (continued) Drive configuration Commands Drive connected directly to the system backplane v mvcli v info -o vd v info -o pd Are the drives that are known to be in the RAID array recognized? Then Reinstall the operating system on the boot drive.
  • Page 26: Resolving A Hardware Problem

    Did you identify a log entry that meets the above criteria? v Yes: Continue with the next step. v No: Go to “Collecting diagnostic data” on page 71. Then, go to “Contacting IBM service and support” on page 71. This ends the procedure.
  • Page 27: Resolving A Gpu, Pcie Adapter, Or Device Problem

    6. Was a service action identified? Then Yes: Continue with the next step. Go to “Collecting diagnostic data” on page 71. Then, go to “Contacting IBM service and support” on page 71. This ends the procedure. 7. Did the service action fix the problem? Then Yes: This ends the procedure.
  • Page 28: Resolving A Raid Adapter Problem

    If your system is an 8001-22C or 8005-22N, go to “8001-22C or 8005-22N locations” on page 87 to identify the physical location and the removal and replacement procedure. Go to “Collecting diagnostic data” on page 71. Then, go to “Contacting IBM service and support” on page 71.
  • Page 29: Resolving A Network Adapter Problem

    If the RAID adapter is functioning again, review the IBM support tips to confirm that there are no PCI address, driver, or firmware conflicts. Then, reinstall the new adapters again one at a time until all adapters function properly.
  • Page 30 If the network adapter is functioning again, review the IBM support tips to confirm that there are no PCI address, driver, or firmware conflicts. Then, reinstall the new adapters again one at a time until all adapters function properly.
  • Page 31: Resolving A Graphics Processing Unit Problem

    If the graphics adapter is functioning again, review the IBM support tips to confirm that there are no PCI address, driver, or firmware conflicts. Then, reinstall the new adapters again one at a time until all adapters function properly.
  • Page 32: Resolving A Storage Device Problem

    Table 7. NVMe Flash adapter problems and service actions Problem Service action System is unable to 1. If the system was recently installed, moved, serviced, or upgraded, verify that the NVMe find the NVMe Flash Flash adapter is seated and installed properly. adapter 2.
  • Page 33 Table 8. Storage device problems and service actions Problem Service action System is unable to find more than one storage device 1. If the system was recently installed, moved, serviced, or upgraded, verify that the device is seated and installed properly. 2.
  • Page 34: Identifying The Location Of The Pcie Adapter By Using The Slot Number

    Table 8. Storage device problems and service actions (continued) Problem Service action More than one storage device suddenly stops working 1. If the system was recently installed, moved, serviced, or upgraded, verify that the device is seated and installed properly. 2.
  • Page 35: Identifying The Location Of The Gpu By Using The Slot Number

    Table 9. Slot numbers, adapter descriptions, and service action for the 8001-12C and 8005-12N Slot information from log PCIe adapter description Service action UIO Network PCIe adapter 1 Replace the PCIe adapter indicated in the PCIe adapter description column. UIO Slot1 PCIe adapter 2 Go to “8001-12C or 8005-12N PLX Slot1...
  • Page 36: Identifying The Location Of The Nvme Flash Adapter

    Identifying the location of the NVMe Flash adapter Use this procedure to identify the location of a Non-Volatile Memory Express (NVMe) Flash adapter. 1. Does the operating system log contain the slot number? For example, the log might contain an error message similar to the following text: [131779.752714] EEH: PHB#0 failure detected, location: WIO Slot1 Then...
  • Page 37: User Guides For Gpus And Pcie Adapters

    Name User guide Avago Avago Technologies website (http:// www.avagotech.com/products/server-storage/raid- controllers/) Broadcom Broadcom website (http://www.broadcom.com) Emulex Emulex website (http://www.emulex.com/products/ ethernet-networking-storage-connectivity/ethernet- networking-adapters/ibm-branded/selection-guide/) Marvell Marvell website (http://www.marvell.com/storage/ system-solutions/sata-controllers/) Mellanox Mellanox Technologies website (http:// mymellanox.force.com/support/VF_SerialSearch) NVIDIA NVIDIA website (http://www.nvidia.com) QLogic QLogic website (http://driverdownloads.qlogic.com/ QLogicDriverDownloads_UI/IBM_Search.aspx) Identifying a service action Use the following procedures to help you identify the service action that is needed.
  • Page 38 If this SEL event continues to be logged, go to “Collecting diagnostic data” on page 71. Then, go to “Contacting IBM service and support” on page 71. 01xxxxxxxxxx Go to the “EPUB_PRC_FIND_DECONFIGURE_PART isolation procedure”...
  • Page 39 Continue with the next step. Yes: Go to “Collecting diagnostic data” on page 71. Then, go to “Contacting IBM service and support” on page 71. 7. Did you find only one SEL event that requires a service action as defined in step 5?
  • Page 40 v To display SEL details remotely over the LAN, use the following command: ipmitool -I lanplus -U <username> -P <password> -H <BMC IP address or BMC hostname> sel get <SEL record ID> Note: The SEL record ID must be entered in hexadecimal format. For example: 0x1a. The sensor ID field contains sensor information in the format sensor name (sensor ID).
  • Page 41 Table 17. OEM record c0 specific log information, description, and service action (continued) OEM record c0 specific log information Description Service action cdxx6fffffff An automatic shutdown event v Search for SEL events that are occurred due to high system related to high system temperature temperature and resolve them.
  • Page 42: Identifying Service Action Keywords In System Event Logs

    The sensor ID field contains sensor information in the format sensor name (sensor ID). Record the sensor name, sensor ID, and event description. Then, use this information to determine the service action to perform: v If your system is an 8001-12C or 8005-12N, go to “Identifying a service action by using sensor and event information for the 8001-12C and 8005-12N”...
  • Page 43: Identifying A Service Action By Using Sensor And Event Information

    You can use the sensor and event information from the system event log to determine a service action to ® perform for the IBM Power System S821LC (8001-12C) and IBM Hyperconverged Systems powered by Nutanix (8005-12N). If you have not done so already, complete “Identifying a service action by using system event logs” on page 21.
  • Page 44 Table 18. Sensor information, event description, and service action for the 8001-12C and 8005-12N (continued) Sensor name (Sensor ID) Event description Service action Peripheral Temp (0x02) Ensure that the room temperature v Transition to Critical from Less meets the requirements that are Severe specified for the system.
  • Page 45 Table 18. Sensor information, event description, and service action for the 8001-12C and 8005-12N (continued) Sensor name (Sensor ID) Event description Service action Ensure that there are no air flow v CPU1 Temp (0x0B) v Transition to Critical from Less obstructions at the front or at the rear Severe v CPU2 Temp (0x0D)
  • Page 46 Table 18. Sensor information, event description, and service action for the 8001-12C and 8005-12N (continued) Sensor name (Sensor ID) Event description Service action If the sensor name is CPU Func 1, v CPU Func 1 (0x0C) v IERR replace CPU 1. If the sensor name is v CPU Func 2 (0x0E) v Transition to Non-recoverable CPU Func 2, replace CPU 2.
  • Page 47 Table 18. Sensor information, event description, and service action for the 8001-12C and 8005-12N (continued) Sensor name (Sensor ID) Event description Service action No service action is required. v P1M1-DIMMA Func (0x10) v Memory Device Disabled v P1M1-DIMMB Func (0x11) v Uncorrectable Memory Error v P1M1-DIMMC Func (0x12) v Memory Scrub Failed...
  • Page 48 Table 18. Sensor information, event description, and service action for the 8001-12C and 8005-12N (continued) Sensor name (Sensor ID) Event description Service action Configuration Error Complete the following steps: v P1M1-DIMMA Func (0x10) 1. If the sensor name is v P1M1-DIMMB Func (0x11) P1M1-DIMMA Func, ensure that v P1M1-DIMMC Func (0x12) P1M1-DIMMA is seated properly.
  • Page 49 Service action System Event (0x35) Undetermined system hardware Go to “Collecting diagnostic data” on failure page 71. Then, go to “Contacting IBM service and support” on page 71. No service action is required. v System Reconfigured v OEM System boot event...
  • Page 50 Table 18. Sensor information, event description, and service action for the 8001-12C and 8005-12N (continued) Sensor name (Sensor ID) Event description Service action Ensure that there are no air flow v GPU1 Temp (0x52) v Transition to Critical from Less obstructions at the front or at the rear Severe v GPU2 Temp (0x53)
  • Page 51 Table 18. Sensor information, event description, and service action for the 8001-12C and 8005-12N (continued) Sensor name (Sensor ID) Event description Service action NVMe_SSD Temp (0x5B) Ensure that there are no air flow v Transition to Critical from Less obstructions at the front or at the rear Severe of the system.
  • Page 52 Table 18. Sensor information, event description, and service action for the 8001-12C and 8005-12N (continued) Sensor name (Sensor ID) Event description Service action Ensure that there are no air flow v P1M1-DIMMA Temp (0x66) v Transition to Critical from Less obstructions at the front or at the rear Severe v P1M1-DIMMB Temp (0x67)
  • Page 53 Table 18. Sensor information, event description, and service action for the 8001-12C and 8005-12N (continued) Sensor name (Sensor ID) Event description Service action Total Power (0xA0) No service action is required. v Lower Non-critical – going low v Lower Non-critical – going high v Lower Critical –...
  • Page 54 Table 18. Sensor information, event description, and service action for the 8001-12C and 8005-12N (continued) Sensor name (Sensor ID) Event description Service action Performance Met If Asserted is in the event v Freq Limit OT 1 (0xA8) description, no service action is v Mem Thrttl OT 1 (0xAA) required.
  • Page 55 Table 18. Sensor information, event description, and service action for the 8001-12C and 8005-12N (continued) Sensor name (Sensor ID) Event description Service action No service action is required. v CPU Core Temp 1 (0xB0) v Lower Non-critical – going low v CPU Core Temp 2 (0xB1) v Lower Non-critical –...
  • Page 56 Table 18. Sensor information, event description, and service action for the 8001-12C and 8005-12N (continued) Sensor name (Sensor ID) Event description Service action Replace system processor CPU 1. Go v CPU Core Func 1 (0xC8) v IERR to “8001-12C or 8005-12N locations” v CPU Core Func 2 (0xC9) v Transition to Non-recoverable on page 73 to identify the physical...
  • Page 57 Table 18. Sensor information, event description, and service action for the 8001-12C and 8005-12N (continued) Sensor name (Sensor ID) Event description Service action Replace system processor CPU 2. Go v CPU Core Func 13 (0xD4) v IERR to “8001-12C or 8005-12N locations” v CPU Core Func 14 (0xD5) v Transition to Non-recoverable on page 73 to identify the physical...
  • Page 58 Table 18. Sensor information, event description, and service action for the 8001-12C and 8005-12N (continued) Sensor name (Sensor ID) Event description Service action If the sensor name is FAN1, replace v FAN1 (0xE3) v Transition to Critical from Less Fan 1. If the sensor name is FAN2, Severe v FAN2 (0xE4) replace Fan 2.
  • Page 59: Identifying A Service Action By Using Sensor And Event Information For The 8001-22C And 8005-22N

    8001-22C and 8005-22N You can use the sensor and event information from the system event log to determine a service action to perform for the IBM Power System S822LC for Big Data (8001-22C) and IBM Hyperconverged Systems powered by Nutanix (8005-22N).
  • Page 60 If you have not done so already, complete “Identifying a service action by using system event logs” on page 21. Then, use the following table to determine the service action to perform. Table 19. Sensor information, event description, and service action for the 8001-22C and 8005-22N Sensor name (Sensor ID) Event description Service action...
  • Page 61 Table 19. Sensor information, event description, and service action for the 8001-22C and 8005-22N (continued) Sensor name (Sensor ID) Event description Service action Device Disabled If the sensor name is OCC Active 1, v OCC Active 1 (0x08) replace CPU 1. If the sensor name is v OCC Active 2 (0x09) OCC Active 2, replace CPU 2.
  • Page 62 Table 19. Sensor information, event description, and service action for the 8001-22C and 8005-22N (continued) Sensor name (Sensor ID) Event description Service action If the sensor name is CPU Func 1, v CPU Func 1 (0x0C) v IERR replace CPU 1. If the sensor name is v CPU Func 2 (0x0E) v Transition to Non-recoverable CPU Func 2, replace CPU 2.
  • Page 63 Table 19. Sensor information, event description, and service action for the 8001-22C and 8005-22N (continued) Sensor name (Sensor ID) Event description Service action No service action is required. v P1M1-DIMMA Func (0x10) v Memory Device Disabled v P1M1-DIMMB Func (0x11) v Uncorrectable Memory Error v P1M1-DIMMC Func (0x12) v Memory Scrub Failed...
  • Page 64 Table 19. Sensor information, event description, and service action for the 8001-22C and 8005-22N (continued) Sensor name (Sensor ID) Event description Service action Configuration Error Complete the following steps: v P1M1-DIMMA Func (0x10) 1. If the sensor name is v P1M1-DIMMB Func (0x11) P1M1-DIMMA Func, ensure that v P1M1-DIMMC Func (0x12) P1M1-DIMMA is seated properly.
  • Page 65 Service action System Event (0x35) Undetermined system hardware Go to “Collecting diagnostic data” on failure page 71. Then, go to “Contacting IBM service and support” on page 71. No service action is required. v System Reconfigured v OEM System boot event...
  • Page 66 Table 19. Sensor information, event description, and service action for the 8001-22C and 8005-22N (continued) Sensor name (Sensor ID) Event description Service action v GPU1 Temp (0x52) v Transition to Critical from Less v If the system is an 8001-22C, Severe ensure that the system does not v GPU2 Temp (0x53)
  • Page 67 Table 19. Sensor information, event description, and service action for the 8001-22C and 8005-22N (continued) Sensor name (Sensor ID) Event description Service action MB_10G Temp (0x5A) Ensure that there are no air flow v Transition to Critical from Less obstructions at the front or at the rear Severe of the system.
  • Page 68 Table 19. Sensor information, event description, and service action for the 8001-22C and 8005-22N (continued) Sensor name (Sensor ID) Event description Service action Ensure that there are no air flow v Mem Buf Temp 1 (0x5E) v Transition to Critical from Less obstructions at the front or at the rear Severe v Mem Buf Temp 2 (0x5F)
  • Page 69 Table 19. Sensor information, event description, and service action for the 8001-22C and 8005-22N (continued) Sensor name (Sensor ID) Event description Service action VBAT (0x9C) Replace the time-of-day battery. Go v Transition to Non-recoverable to “8001-22C or 8005-22N locations” v Lower Non-recoverable – going on page 87 to identify the physical location and removal and replacement procedure.
  • Page 70 Table 19. Sensor information, event description, and service action for the 8001-22C and 8005-22N (continued) Sensor name (Sensor ID) Event description Service action No service action required. v CPU1 Power or Proc0 Power v Lower Non-critical – going low (0xA2) v Lower Non-critical –...
  • Page 71 Table 19. Sensor information, event description, and service action for the 8001-22C and 8005-22N (continued) Sensor name (Sensor ID) Event description Service action Performance Met If Asserted is in the event v Freq Limit Pwr 1 (0xA9) description, no service action is v Freq Limit Pwr 2 (0xAD) required.
  • Page 72 Table 19. Sensor information, event description, and service action for the 8001-22C and 8005-22N (continued) Sensor name (Sensor ID) Event description Service action No service action is required. v CPU Core Temp 13 (0xBC) v Lower Non-critical – going low v CPU Core Temp 14 (0xBD) v Lower Non-critical –...
  • Page 73 Table 19. Sensor information, event description, and service action for the 8001-22C and 8005-22N (continued) Sensor name (Sensor ID) Event description Service action Replace system processor CPU 1. Go v CPU Core Func 1 (0xC8) v IERR to “8001-22C or 8005-22N locations” v CPU Core Func 2 (0xC9) v Transition to Non-recoverable on page 87 to identify the physical...
  • Page 74 Table 19. Sensor information, event description, and service action for the 8001-22C and 8005-22N (continued) Sensor name (Sensor ID) Event description Service action Replace system processor CPU 2. Go v CPU Core Func 13 (0xD4) v IERR to “8001-22C or 8005-22N locations” v CPU Core Func 14 (0xD5) v Transition to Non-recoverable on page 87 to identify the physical...
  • Page 75 Table 19. Sensor information, event description, and service action for the 8001-22C and 8005-22N (continued) Sensor name (Sensor ID) Event description Service action If the sensor name is FAN1, FAN4, v FAN1 (0xE3) v Transition to Critical from Less FAN5, or FAN8, no service action is Severe v FAN2 (0xE4) required.
  • Page 76: Isolation Procedures

    Table 19. Sensor information, event description, and service action for the 8001-22C and 8005-22N (continued) Sensor name (Sensor ID) Event description Service action If the sensor name is PS1 Status, v PS1 Status (0xF3) v Predictive Failure replace PSU 1. If the sensor name is v PS2 Status (0xF4) v Power Supply Input Out of Range PS2 Status, replace PSU 2.
  • Page 77: Epub_Prc_Find_Deconfigure_Part Isolation Procedure

    Does the problem persist? Then Yes: Replace the system backplane. If the replacement of the system backplane does not resolve the problem, go to “Contacting IBM service and support” on page 71. This ends the procedure. This ends the procedure. EPUB_PRC_SP_CODE isolation procedure A problem was detected in the system firmware.
  • Page 78: Epub_Prc_All_Procs Isolation Procedure

    Yes: Continue with the next step. Go to “Contacting IBM service and support” on page 71. This ends the procedure. 3. For each of the SELs that you identified in step 2, determine the sensor name that is associated with each SEL.
  • Page 79: Epub_Prc_Lvl_Support Isolation Procedure

    Yes: If you have not already done so, replace the system backplane. If the replacement of the system backplane does not resolve the problem, go to “Contacting IBM service and support” on page 71. This ends the procedure. This ends the procedure.
  • Page 80: Epub_Prc_Proc_Ab_Bus Isolation Procedure

    If the replacement of the system processors and the system backplane does not resolve the problem, go to “Contacting IBM service and support” on page 71. This ends the procedure. EPUB_PRC_PROC_AB_BUS isolation procedure A diagnostic function detected an external processor interface problem.
  • Page 81: Epub_Prc_Eibus_Error Isolation Procedure

    Yes: Continue with the next step. Go to “Contacting IBM service and support” on page 71. This ends the procedure. 3. For each of the SELs that you identified in step 2, determine the sensor name that is associated with each SEL.
  • Page 82: Epub_Prc_Power_Error Isolation Procedure

    Does the problem persist? Then Yes: Replace the system backplane. If the replacement of the system backplane does not resolve the problem, go to “Contacting IBM service and support” on page 71. This ends the procedure. This ends the procedure. EPUB_PRC_POWER_ERROR isolation procedure A power problem occurred.
  • Page 83: Epub_Prc_Tod_Clock_Err Isolation Procedure

    Does the problem persist? Then Yes: Replace the system backplane. If the replacement of the system backplane does not resolve the problem, go to “Contacting IBM service and support” on page 71. This ends the procedure. This ends the procedure. EPUB_PRC_TOD_CLOCK_ERR isolation procedure A diagnostic function detected a problem with the time of day or clock function.
  • Page 84: Epub_Prc_Cooling_System_Err Isolation Procedure

    If replacing the system backplane and both system processors does not resolve the problem, go to “Contacting IBM service and support” on page 71. This ends the procedure. 8001-22C or 8005-22N Replace the system backplane. If replacing the system backplane does not resolve the problem, replace system processor CPU 1.
  • Page 85: Verifying A Repair

    Verifying a repair Learn how to verify hardware operation after you make repairs to the system. 1. Power on the system. 2. Did you replace a graphics processing unit (GPU), PCIe adapter, disk drive, or solid-state drive? Then Yes: Go to step 5. Continue with the next step.
  • Page 86 Table 26. Determining a verification action for GPUs, PCIe adapters, and devices (continued) Adapter type Verification action Devices that are not controlled by a RAID adapter If the device is a SAS or SATA drive, complete the following steps: 1. Install the mvcli utility. 2.
  • Page 87: Collecting Diagnostic Data

    Follow the instructions to install and run the system event log collection tool. Then, continue with the next step. 4. Send the data that you collected during this procedure to IBM service and support. This ends the procedure. Contacting IBM service and support You can contact IBM service and support by telephone or through the IBM Support Portal.
  • Page 88 Operating system problem v IBM application program v Loop, hang, or message Hardware: v IBM system hardware broken v Hardware reference code v IBM input/output (I/O) problem v Upgrade Problem analysis, system parts, and locations for the 8001-12C, 8001-22C, 8005-12N, and 8005-22N...
  • Page 89: Finding Parts And Locations

    See Removing and replacing a storage drive in the 8001-12C or 8005-12N. HDD 3 or NVMe 3* See Removing and replacing a storage drive in the 8001-12C or 8005-12N. *8005-12N systems do not support NVMe drives. © Copyright IBM Corp. 2016, 2019...
  • Page 90 Figure 2. Top view Table 29. Top view locations FRU removal and replacement Index number FRU description procedures Disk drive backplane See Removing and replacing the disk drive backplane in the 8001-12C or 8005-12N. Fan 1 See Removing and replacing fans in the 8001-12C or 8005-12N.
  • Page 91 Table 29. Top view locations (continued) FRU removal and replacement Index number FRU description procedures CPU 2 See Removing and replacing a system processor module for the 8001-12C or 8005-12N. CPU 1 See Removing and replacing a system processor module for the 8001-12C or 8005-12N.
  • Page 92 Table 30. Rear view locations (continued) FRU removal and replacement Index number FRU description procedures PCIe adapter 4 or GPU (WIO Slot1) For PCIe adapters, see Removing and replacing PCIe adapters in the 8001-12C or 8005-12N. For the graphics processing unit, see Removing and replacing a graphics processing unit in the 8001-12C.
  • Page 93 Table 31. Memory locations FRU removal and replacement Index number FRU description procedures P1M1-DIMMA See Removing and replacing memory in the 8001-12C or 8005-12N. P1M1-DIMMB See Removing and replacing memory in the 8001-12C or 8005-12N. P1M1-DIMMC See Removing and replacing memory in the 8001-12C or 8005-12N.
  • Page 94: 8001-12C Or 8005-12N Parts

    After you identify the part number of the part that you want to order, go to Advanced Part Exchange Warranty Service. Registration is required. If you are not able to identify the part number, go to Contacting IBM service and support. Problem analysis, system parts, and locations for the 8001-12C, 8001-22C, 8005-12N, and 8005-22N...
  • Page 95 Rack final assembly Figure 6. Rack final assembly Table 33. Rack final assembly part numbers Units per Index number Part number assembly Description MCP-290- Slide rail kit - contains left and right slide rails and 00052-0N attaching screws (8001-12C) MCP-290- Slide rail kit - contains left and right slide rails and 00102-0N attaching screws (8005-12N)
  • Page 96 System parts Figure 7. System parts Table 34. System parts Index number Part number Units per assembly Description Top cover assembly Screws PCIe adapters. Use the feature type of the adapter to find the FRU number in PCIe adapter information by feature type for the 8001-12C or 8005-12N.
  • Page 97 Table 34. System parts (continued) Index number Part number Units per assembly Description PCIe adapter. Use the feature type of the adapter to find the FRU number in PCIe adapter information by feature type for the 8001-12C or 8005-12N. AOC-UR-i4XTF 1U UIO NIC PCIe adapter with integrated 4-port 10 GbE Base-T, Intel XL710, and CAPI Note: This PCIe adapter is also a PCIe riser.
  • Page 98 Table 34. System parts (continued) Index number Part number Units per assembly Description HDS-KIT-3N-1200- 1.2 TB small form factor NVMe drive (3 drive writes per IB001 day) (8001-12C) HDS-KIT-3N-1600- 1.6 TB small form factor NVMe drive (3 drive writes per IB001 day) (8001-12C) HDS-KIT-3N-2000-...
  • Page 99 Additional system parts Figure 8. Additional system parts Table 35. Additional system parts Index number Part number Units per assembly Description MTA9ASF51272PZ- 4 GB, 2400 MHz 1RX8 DDR4 RDIMM (Micron Technology, 2G3B1 Inc.)* (8001-12C) MTA9ASF1G72PZ- 8 GB, 2400 MHz 1RX8 DDR4 RDIMM (Micron Technology, 2G3B1 Inc.)* (8001-12C) MTA18ASF2G72PZ-...
  • Page 100 Table 35. Additional system parts (continued) Index number Part number Units per assembly Description M393A1G40DB0- 8 GB, 2133 MHz 1RX4 DDR4 RDIMM (Samsung Electronics Co., Ltd.)* (8001-12C) M393A2G40DB0- 16 GB, 2133 MHz 2RX4 DDR4 RDIMM (Samsung Electronics Co., Ltd.)* (8001-12C) M393A4K40BB0-CPB 16 32 GB, 2133 MHz 2RX4 DDR4 RDIMM (Samsung Electronics Co., Ltd.)* (8001-12C)
  • Page 101 Miscellaneous parts Table 36. Miscellaneous parts Description Part number Rail adapter kit for round MCP-290-91904-0N (8005-12N) hole racks Finding parts and locations...
  • Page 102 Problem analysis, system parts, and locations for the 8001-12C, 8001-22C, 8005-12N, and 8005-22N...
  • Page 103: Finding Parts And Locations

    8001-22C or 8005-22N. HDD 4 See Removing and replacing a storage drive in the 8001-22C or 8005-22N. HDD 5 See Removing and replacing a storage drive in the 8001-22C or 8005-22N. © Copyright IBM Corp. 2016, 2019...
  • Page 104 Table 37. Front view locations (continued) FRU removal and replacement Index number FRU description procedures HDD 6 See Removing and replacing a storage drive in the 8001-22C or 8005-22N. HDD 7 See Removing and replacing a storage drive in the 8001-22C or 8005-22N.
  • Page 105 Figure 10. Top view Table 38. Top view locations FRU removal and replacement Index number FRU description procedures Disk drive backplane See Removing and replacing the disk drive backplane in the 8001-22C or 8005-22N. Fan 2 See Removing and replacing fans in the 8001-22C or 8005-22N.
  • Page 106 Table 38. Top view locations (continued) FRU removal and replacement Index number FRU description procedures System backplane See Removing and replacing the system backplane in the 8001-22C or 8005-22N. PSU 1 See Removing and replacing a power supply in the 8001-12C, 8001-22C, 8005-12N, or 8005-22N.
  • Page 107 Table 39. Rear view locations (continued) FRU removal and replacement Index number FRU description procedures PCIe adapter 5 or GPU 2 (WIO Slot1) For PCIe adapters, see Removing and replacing PCIe adapters in the 8001-22C or 8005-22N. For the graphics processing unit, see Removing and replacing a graphics processing unit in the 8001-22C.
  • Page 108 The following table provides the memory locations. Table 40. Memory locations FRU removal and replacement Index number FRU description procedures P1M1-DIMMA See Removing and replacing memory in the 8001-22C or 8005-22N. P1M1-DIMMB See Removing and replacing memory in the 8001-22C or 8005-22N. P1M1-DIMMC See Removing and replacing memory in the 8001-22C or 8005-22N.
  • Page 109: 8001-22C Or 8005-22N Parts

    After you identify the part number of the part that you want to order, go to Advanced Part Exchange Warranty Service. Registration is required. If you are not able to identify the part number, go to Contacting IBM service and support. Finding parts and locations...
  • Page 110 Rack final assembly Figure 14. Rack final assembly Table 42. Rack final assembly part numbers Units per Index number Part number assembly Description MCP-290- Slide rail kit - contains left and right slide rails and 00057-0N attaching screws MCP-290- Slide rail kit - contains left and right slide rails and 00057-0N attaching screws Problem analysis, system parts, and locations for the 8001-12C, 8001-22C, 8005-12N, and 8005-22N...
  • Page 111 System parts Figure 15. System parts Finding parts and locations...
  • Page 112 Table 43. System parts Index number Part number Units per assembly Description Top cover assembly Screws MCP-310-82914-0B CPU air baffle (8001-22C) MCP-310-82908-0B CPU air baffle (8005-22N) SNK-P0053P-IB001 Heat sink kit (includes heat sink and thermal interface material) CPU-KIT-01EM062- 8 core 3.325 GHz system processor module kit (includes IB001 system processor, tray, and vacuum pen) (8001-22C) CPU-KIT-01EM063-...
  • Page 113 Table 43. System parts (continued) Index number Part number Units per assembly Description SSD-DM064-PHI 64 GB SATA drive on module (DOM) (8001-22C) SSD-DM064- 64 GB SATA drive on module (DOM) (8005-22N) SMCMVN1 SSD-DM128- 128 GB SATA drive on module (DOM) (8001-22C) SMCMVN1 MBD-P8DTU-2U- System backplane kit (includes system backplane, tray, and...
  • Page 114 Table 43. System parts (continued) Index number Part number Units per assembly Description HDS-KIT-2T-960- 960 GB 2.5 inch small form factor SATA solid-state drive IB001 (8001-22C) HDS-KIT-2T-1900- 1.9 TB 2.5 inch small form factor SATA solid-state drive IB001 (8001-22C) HDS-KIT-2T-3800- 3.8 TB 2.5 inch small form factor SATA solid-state drive IB001 (8001-22C)
  • Page 115 Table 43. System parts (continued) Index number Part number Units per assembly Description PCIe adapters. Use the feature type of the adapter to find the FRU number in PCIe adapter information by feature type for the 8001-22C or 8005-22N. Note: 8005-22N systems do not support this PCIe adapter. PCIe riser RSC-W2-688P PCIe riser PCIe adapter 4 or GPU 2 (WIO Slot1), PCIe...
  • Page 116 Problem analysis, system parts, and locations for the 8001-12C, 8001-22C, 8005-12N, and 8005-22N...
  • Page 117: Notices

    Consult your local IBM representative for information on the products and services currently available in your area. Any reference to an IBM product, program, or service is not intended to state or imply that only that IBM product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any IBM intellectual property right may be used instead.
  • Page 118: Accessibility Features For Ibm Power Systems Servers

    All IBM prices shown are IBM's suggested retail prices, are current and are subject to change without notice. Dealer prices may vary. This information is for planning purposes only. The information herein is subject to change before the products described become available.
  • Page 119: Privacy Policy Considerations

    This product uses standard navigation keys. Interface information The IBM Power Systems servers user interfaces do not have content that flashes 2 - 55 times per second. The IBM Power Systems servers web user interface relies on cascading style sheets to render content properly and to provide a usable experience.
  • Page 120: Trademarks

    IBM, the IBM logo, and ibm.com are trademarks or registered trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the web at Copyright and trademark information at www.ibm.com/legal/copytrade.shtml.
  • Page 121 Warning: This is a Class A product. In a domestic environment, this product may cause radio interference, in which case the user may be required to take adequate measures. VCCI Statement - Japan The following is a summary of the VCCI Japanese statement in the box above: This is a Class A product based on the standard of the VCCI Council.
  • Page 122 Warning: This is a Class A product. In a domestic environment this product may cause radio interference in which case the user will be required to take adequate measures. IBM Taiwan Contact Information: Problem analysis, system parts, and locations for the 8001-12C, 8001-22C, 8005-12N, and 8005-22N...
  • Page 123 Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw.
  • Page 124: Class B Notices

    Properly shielded and grounded cables and connectors must be used in order to meet FCC emission limits. Proper cables and connectors are available from IBM-authorized dealers. IBM is not responsible for any radio or television interference caused by unauthorized changes or modifications to this equipment.
  • Page 125 European Community contact: IBM Deutschland GmbH Technical Regulations, Abteilung M456 IBM-Allee 1, 71139 Ehningen, Germany Tel: +49 800 225 5426 email: halloibm@de.ibm.com VCCI Statement - Japan Japan Electronics and Information Technology Industries Association Statement This statement explains the Japan JIS C 61000-3-2 product wattage compliance.
  • Page 126 Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw.
  • Page 127: Terms And Conditions

    Permissions for the use of these publications are granted subject to the following terms and conditions. Applicability: These terms and conditions are in addition to any terms of use for the IBM website. Personal Use: You may reproduce these publications for your personal, noncommercial use provided that all proprietary notices are preserved.
  • Page 128 Problem analysis, system parts, and locations for the 8001-12C, 8001-22C, 8005-12N, and 8005-22N...
  • Page 130 IBM®...

Table of Contents