Lenovo ThinkSystem DM7000 Series Hardware Installation And Maintenance Manual

Table of Contents

Advertisement

Quick Links

ThinkSystem DM7000x
Hardware Installation and Maintenance Guide
Machine Types: 7Y40 and 7Y56

Advertisement

Table of Contents
loading

Summary of Contents for Lenovo ThinkSystem DM7000 Series

  • Page 1 ThinkSystem DM7000x Hardware Installation and Maintenance Guide Machine Types: 7Y40 and 7Y56...
  • Page 2 Before using this information and the product it supports, be sure to read and understand the safety information and the safety instructions, which are available at: http://thinksystem.lenovofiles.com/help/topic/safety_documentation/pdf_files.html In addition, be sure that you are familiar with the terms and conditions of the Lenovo warranty for your system, which can be found at: http://datacentersupport.lenovo.com/warrantylookup Third Edition (March 2023) ©...
  • Page 3: Table Of Contents

    Restoring and verifying the system diagnostics....87 configuration ....© Copyright Lenovo 2019, 2023...
  • Page 4 Introduction to system-level diagnostics ..Appendix B. Notice of Privacy Requirements for running system-level Practices ....107 diagnostics ....How to use online command-line help .
  • Page 5: Safety

    Vor der Installation dieses Produkts die Sicherheitshinweise lesen. Prima di installare questo prodotto, leggere le Informazioni sulla Sicurezza. Les sikkerhetsinformasjonen (Safety Information) før du installerer dette produktet. Antes de instalar este produto, leia as Informações sobre Segurança. © Copyright Lenovo 2019, 2023...
  • Page 6 Antes de instalar este producto, lea la información de seguridad. Läs säkerhetsinformationen innan du installerar den här produkten. ThinkSystem DM7000x Hardware Installation and Maintenance Guide...
  • Page 7: Chapter 1. Introduction

    • Total capacity: 256 GB • Eight 32 GB DIMMs • NVRAM/NVMEM used capacity: 16 GB System fans Three hot-swap fans Power supplies Two hot-swap power supplies for redundancy support PCIe slots Four PCIe slots in the rear © Copyright Lenovo 2019, 2023...
  • Page 8 Table 1. Specifications (continued) Specification Description Input/Output (I/O) features The system (with two controllers) has the following I/O ports: • Eight 12 Gb MiniSAS HD ports • Four 10 Gb SFP+ Ethernet ports • Eight 10 Gb/16 Gb UTA2 SFP+ ports •...
  • Page 9: Management Software

    Tech Tips Lenovo continually updates the support website with the latest tips and techniques that you can use to solve issues that you might have with your system. These Tech Tips (also called retain tips or service bulletins) provide procedures to work around issues related to the operation of your system.
  • Page 10 ThinkSystem DM7000x Hardware Installation and Maintenance Guide...
  • Page 11: Chapter 2. System Components

    The system stops working or there is an error on the system. (front) The system is operating normally. Chassis location LED Solid blue or The chassis location LED is manually activated to help locating the (front) blinking blue system. The chassis location LED is not activated. © Copyright Lenovo 2019, 2023...
  • Page 12: Rear View

    Front view without bezel Table 3. Components on the front of the system (without bezel) Fan attention LED (3) System fans (3) Chassis power LED Chassis attention LED (front) Chassis location LED (front) Rear view The rear of the system provides access to several connectors and components, including the power supplies and various connectors.
  • Page 13: Rear View Leds

    Rear view LEDs The rear of the system provides system LEDs. Figure 3. Rear view LEDs Table 5. LEDs on the rear of the system AC power good LED (2) Power supply attention LED (2) MiniSAS HD port link LED (8) MiniSAS HD port attention LED (8) SFP+ Ethernet port link LED (4) SFP+ Ethernet port attention LED (4)
  • Page 14 MiniSAS HD port LEDs Each MiniSAS HD port has two status LEDs. Ethernet status LED Color Status Description MiniSAS HD port link LED Green Link is established on at least one external SAS lane. None No link is established on any external SAS lane.
  • Page 15 Ethernet status LED Color Status Description RJ45 management port Green A link is established between the port and link LED some upstream device. None No link is established. RJ45 management port Amber Blinking Traffic is flowing over the connection. activity LED None No traffic is flowing over the connection.
  • Page 16 ThinkSystem DM7000x Hardware Installation and Maintenance Guide...
  • Page 17: Chapter 3. Rail Kit Installation Instructions

    Before you begin, verify that you have the correct rail type by examining the PN label located on the rail (PN: SM17A38397). Installing Rail to square-hole four-post rack © Copyright Lenovo 2019, 2023...
  • Page 18: Dm/De Series 2U12 Rail Kit Installation

    Installing Rail to round-hole four-post rack DM/DE Series 2U12 rail kit installation instructions Using this rail kit, a 2U 12-drive enclosure can be installed in a four-post rack. Before you begin The rail kit includes the following items: • A pair of slide rails for four-post racks with alignment screws installed for the square-hole rack •...
  • Page 19 Type of hardware Description Quantity Flat-head M5 screw; 14 You use six M5 screws for attaching the rails to mm long the rack, and two M5 screws for attaching the brackets at the back of the enclosure to the brackets at the back of the rails. Round-head M5 screw;...
  • Page 20 Repeat these steps for the other rail. Step 3. Place the back of the enclosure (the end with the connectors) on the rails. Attention: A fully loaded enclosure weighs approximately 65 lb (29 kg). Two persons are required to safely move the enclosure. Step 4.
  • Page 21 Step 6. Secure the enclosure to the back of the rails by inserting two M5 screws through the brackets at the enclosure and the rail kit bracket. Step 7. If applicable, replace the shelf end caps or the system bezel. Step 8.
  • Page 22 ThinkSystem DM7000x Hardware Installation and Maintenance Guide...
  • Page 23: Chapter 4. Hardware Replacement Procedures

    -node local -auto-giveback false Step 3. Take the degraded controller to the LOADER prompt by typing: storage failover takeover <degraded from the RJ45 Management Port of the degraded controller. controller name> © Copyright Lenovo 2019, 2023...
  • Page 24: Opening The Controller Module

    If the degraded controller is displaying... Then... The LOADER prompt Go to the next step. Waiting for giveback... Press Ctrl-C, and then respond when prompted. System prompt or password prompt Take over or halt the degraded controller: • Take over the degraded controller from the healthy controller: storage failover takeover -ofnode impaired_node_name...
  • Page 25: Replacing Or Adding A Caching Module

    • If you are adding a caching module, it must support the additional capacity as well as the caching module capacity. Lenovo Press • All other components in the storage system must be functioning properly; if not, you must contact technical support.
  • Page 26: Reinstalling The Controller

    Step 3. If you are adding a caching module, go to the next step; if you are replacing the caching module, gently pull it straight out of the housing. Step 4. Align the edges of the caching module with the socket in the housing, and then gently push it into the socket.
  • Page 27: Running System-Level Diagnostics

    Note: Do not completely insert the controller module in the chassis until instructed to do so. Step 3. Recable the system, as needed.If you removed the media converters (SFPs), remember to reinstall them if you are using fiber optic cables. Step 4.
  • Page 28 If the system-level diagnostics tests... Then... Were completed without any failures 1. Clear the status logs: sldiag device clearstatus 2. Verify that the log was cleared: sldiag device status The following default response is displayed: SLDIAG: No log messages are present. 3.
  • Page 29: Completing The Replacement Process

    8. Rerun the system-level diagnostic test. Completing the replacement process After you replace the part, you can return the failed part to Lenovo, as described in the RMA instructions shipped with the kit. Contact technical support at if you need the RMA number or Lenovo Data Center Support additional help with the replacement procedure.
  • Page 30: Opening The Controller Module

    Assign Epsilon to a healthy controller in the cluster: cluster modify -node healthy_node -epsilon true Step 2. Disable automatic giveback from the console of the healthy controller using the following command: storage failover modify -node local -auto-giveback false Step 3. Take the degraded controller to the LOADER prompt by typing: storage failover takeover <degraded from the RJ45 Management Port of the degraded controller.
  • Page 31: Replacing The Nvmem Battery

    Step 4. Loosen the thumbscrew on the cam handle on the controller module. Thumbscrew Cam handle Step 5. Pull the cam handle downward and begin to slide the controller module out of the chassis. Make sure that you support the bottom of the controller module as you slide it out of the chassis. Replacing the NVMEM battery To replace the NVMEM battery in your system, you must remove the failed NVMEM battery from the system and replace it with a new NVMEM battery.
  • Page 32: Reinstalling The Controller

    Battery lock tab NVMEM battery pack Step 3. Grasp the battery and press the blue locking tab marked PUSH, and then lift the battery out of the holder and controller module. Step 4. Remove the replacement battery from its package. Step 5.
  • Page 33: Running System-Level Diagnostics

    Step 2. Align the end of the controller module with the opening in the chassis, and then gently push the controller module halfway into the system. Note: Do not completely insert the controller module in the chassis until instructed to do so. Step 3.
  • Page 34 If the system-level diagnostics tests... Then... Were completed without any failures 1. Clear the status logs: sldiag device clearstatus 2. Verify that the log was cleared: sldiag device status The following default response is displayed: SLDIAG: No log messages are present. 3.
  • Page 35: Completing The Replacement Process

    8. Rerun the system-level diagnostic test. Completing the replacement process After you replace the part, you can return the failed part to Lenovo, as described in the RMA instructions shipped with the kit. Contact technical support at if you need the RMA number or Lenovo Data Center Support additional help with the replacement procedure.
  • Page 36 Step 4. Press down the release latch on the power supply cam handle, and then lower the cam handle to the fully open position to release the power supply from the mid plane. Power supply Cam handle release latch Power and Fault LEDs Cam handle Power cord locking mechanism Step 5.
  • Page 37: Completing The Replacement Process

    The amber fault LED should be off and the DC good light should be on for each power supply. Completing the replacement process After you replace the part, you can return the failed part to Lenovo, as described in the RMA instructions shipped with the kit. Contact technical support at...
  • Page 38: Completing The Replacement Process

    Step 10. Align the bezel with the ball studs, and then gently push the bezel onto the ball studs. Completing the replacement process After you replace the part, you can return the failed part to Lenovo, as described in the RMA instructions shipped with the kit. Contact technical support at...
  • Page 39: Preparing The System For The Replacement

    • Any PCIe cards moved from the old controller module to the new controller module or added from existing customer site inventory must be supported by the replacement controller module. Lenovo Press • It is important that you apply the commands in these steps on the correct systems: –...
  • Page 40 Checking quorum on the SCSI blade If you are operating in a SAN environment and you are replacing a controller module, you must verify that each controller is in a SAN quorum with other controllers in the cluster. Step 1. With the privilege level set to advanced, check that the most recent scsiblade event message for the degraded controller indicates that the scsi-blade is in quorum: event log show -node impaired-...
  • Page 41: Shutting Down The Degraded Controller

    If you do not see these quorum messages, check the health of the SAN processes and resolve any issues before proceeding with the replacement. Pre-replacement tasks for systems that use Storage Encryption If you are replacing a controller module in a system with Storage Encryption enabled, you must first reset the authentication keys of the disks to their MSID (the default security ID set by the manufacturer).
  • Page 42: Replacing The Controller Module Hardware

    If the degraded controller is displaying... Then... The LOADER prompt Go to the next step. Waiting for giveback... Press Ctrl-C, and then respond when prompted. System prompt or password prompt Take over or halt the degraded controller: • Take over the degraded controller from the healthy controller: storage failover takeover -ofnode impaired_node_name...
  • Page 43 Opening the controller module To replace the controller module, you must first remove the old controller module from the chassis. About this task A video for this task is available at: Chapter 4 Hardware replacement procedures...
  • Page 44 • YouTube: https://www.youtube.com/playlist?list=PLYV5R7hVcs-CZwRXsocAOmi5RsaXDVZQG Step 1. If you are not already grounded, properly ground yourself. Step 2. Loosen the hook and loop strap binding the cables to the cable management device, and then unplug the system cables and SFPs (if needed) from the controller module, keeping track of where the cables were connected.
  • Page 45 A video for this task is available at: • YouTube: https://www.youtube.com/playlist?list=PLYV5R7hVcs-CZwRXsocAOmi5RsaXDVZQG Step 1. Locate the caching module at the rear of the controller module and remove it. Press the release tab. Remove the heatsink. The storage system comes with two slots available for the caching module and only one slot is occupied, by default.
  • Page 46 About this task A video for this task is available at: • YouTube: https://www.youtube.com/playlist?list=PLYV5R7hVcs-CZwRXsocAOmi5RsaXDVZQG Step 1. Locate the boot media using the following illustration or the FRU map on the controller module: Step 2. Press the blue button on the boot media housing to release the boot media from its housing, and then gently pull it straight out of the boot media socket.
  • Page 47 A video for this task is available at: • YouTube: https://www.youtube.com/playlist?list=PLYV5R7hVcs-CZwRXsocAOmi5RsaXDVZQG Step 1. Open the CPU air duct and locate the NVMEM battery. Battery lock tab NVMEM battery pack Step 2. Grasp the battery and press the blue locking tab marked PUSH, and then lift the battery out of the holder and controller module.
  • Page 48 • YouTube: https://www.youtube.com/playlist?list=PLYV5R7hVcs-CZwRXsocAOmi5RsaXDVZQG Step 1. Locate the DIMMs on your controller. Note: Each system memory DIMM has an LED located on the board next to each DIMM slot. Verify that each DIMM is operating properly using the LED states. Step 2. Note the orientation of the DIMM in the socket so that you can insert the DIMM in the replacement controller module in the proper orientation.
  • Page 49 Attention: Visually inspect the DIMM to verify that it is evenly aligned and fully inserted into the slot. Step 8. Repeat these steps for the remaining DIMMs. Moving a PCIe card To move PCIe cards, locate and move them from the old controller into the replacement controller and follow the specific sequence of steps.
  • Page 50 Step 3. Remove the PCIe card from the old controller module and set it aside. Make sure that you keep track of which slot the PCIe card was in. Step 4. Repeat the preceding step for the remaining PCIe cards in the old controller module. Step 5.
  • Page 51: Restoring And Verifying The System

    If your system is in... Then perform these steps... An HA pair The controller module begins to boot as soon as it is fully seated in the chassis. Be prepared to interrupt the boot process. 1. With the cam handle in the open position, firmly push the controller module in until it meets the midplane and is fully seated, and then close the cam handle to the...
  • Page 52 Verifying and setting the HA state of the controller module You must verify the state of the controller module and, if necessary, update the state to match your system configuration. Step 1. In Maintenance mode from the new controller module, verify that all components display the same state: ha-config show If your system is in...
  • Page 53 Step 2. If the displayed system state of the controller module does not match your system configuration, set the state for the controller module: ha-config modify controller ha-state Step 3. If the displayed system state of the chassis does not match your system configuration, set the state for the chassis: ha-config modify chassis ha-state Running system-level diagnostics...
  • Page 54 If you want to run diagnostic tests on... Then... Individual components 1. Clear the status logs: sldiag device clearstatus 2. Display the available tests for the selected devices: sldiag device show -dev dev_name dev_name can be any one of the ports and devices identified in the preceding step.
  • Page 55: Completing System Restoration

    Completing system restoration To complete the replacement procedure and restore your system to full operation, you must recable the storage, confirm disk reassignment, restore the Lenovo Storage Encryption configuration (if necessary), and install licenses for the new controller. Chapter 4...
  • Page 56 Recabling the system After running diagnostics, you must recable the controller module's storage and network connections. Step 1. Recable the system. If you removed the media converters (SFPs), remember to reinstall them if you are using fiber optic cables. ThinkSystem DM7000x Hardware Installation and Maintenance Guide...
  • Page 57 Reassigning disks If the storage system is in an HA pair, the system ID of the new controller module is automatically assigned to the disks when the giveback occurs at the end of the procedure. You must use the correct procedure for your configuration: Controller redundancy Then use this procedure...
  • Page 58 If giveback is vetoed After the giveback has been completed, confirm that the HA pair is healthy and that takeover is possible: storage failover show The output from the command should not include the System ID storage failover show changed on partner message. Step 6.
  • Page 59 If you did not receive the email with the license keys within 30 days, contact technical support. Step 1. If you need to retrieve the license keys, obtain the replacement license keys on Lenovo Features on Demand. For details, refer to DM Series Premium Feature Key Procedure.
  • Page 60: Completing The Replacement Process

    Exit admin privilege. Completing the replacement process After you replace the part, you can return the failed part to Lenovo, as described in the RMA instructions shipped with the kit. Contact technical support at if you need the RMA number or Lenovo Data Center Support additional help with the replacement procedure.
  • Page 61: Opening The Controller Module

    Step 2. Disable automatic giveback from the console of the healthy controller using the following command: storage failover modify -node local -auto-giveback false Step 3. Take the degraded controller to the LOADER prompt by typing: storage failover takeover <degraded from the RJ45 Management Port of the degraded controller. controller name>...
  • Page 62: Replacing A Pcie Card

    Thumbscrew Cam handle Step 5. Pull the cam handle downward and begin to slide the controller module out of the chassis. Make sure that you support the bottom of the controller module as you slide it out of the chassis. Replacing a PCIe card To replace a PCIe card, locate it within the controller and follow the specific sequence of steps.
  • Page 63: Reinstalling The Controller

    Side panel PCIe card Step 4. Remove the PCIe card from the controller module and set it aside. Step 5. Install the replacement PCIe card. Be sure that you properly align the card in the slot and exert even pressure on the card when seating it in the socket. The adapter must be fully and evenly seated in the slot.
  • Page 64: Completing The Replacement Process

    -ofnode impaired_node_name Completing the replacement process After you replace the part, you can return the failed part to Lenovo, as described in the RMA instructions shipped with the kit. Contact technical support at if you need the RMA number or Lenovo Data Center Support additional help with the replacement procedure.
  • Page 65: Opening The Controller Module

    Before you begin • If you have a cluster with more than two controllers, check the health and Epsilon from advanced mode: cluster show -epsilon* • If the cluster is not in quorum or a controller that is not the degraded controller shows for eligibility false and health, correct the issue before proceeding to the next step.
  • Page 66: Replacing The Dimms

    Step 4. Loosen the thumbscrew on the cam handle on the controller module. Thumbscrew Cam handle Step 5. Pull the cam handle downward and begin to slide the controller module out of the chassis. Make sure that you support the bottom of the controller module as you slide it out of the chassis. Replacing the DIMMs To replace the DIMMs, locate them inside the controller and follow the specific sequence of steps.
  • Page 67 Attention: The NVMEM LED blinks while destaging contents to the flash memory when you halt the system. After the destage is complete, the LED turns off. • If power is lost without a clean shutdown, the NVMEM LED flashes until the destage is complete, and then the LED turns off.
  • Page 68 Step 7. Note the orientation of the DIMM in the socket so that you can insert the replacement DIMM in the proper orientation. Step 8. Slowly push apart on the two DIMM ejector tabs, on either side of the DIMM to eject the DIMM from its slot, and then slide it out of the slot.
  • Page 69: Reinstalling The Controller

    Step 13. Close the controller module cover. Reinstalling the controller After you replace a component within the controller module, you must reinstall the controller module in the system chassis and boot it to a state where you can run diagnostic tests on the replaced component. About this task A video for this task is available at: •...
  • Page 70 Step 2. Run diagnostics on the caching module: sldiag device run -dev fcache Step 3. Run diagnostics on the system memory: sldiag device run -dev mem Step 4. Verify that no hardware problems resulted from the replacement of the DIMMs: sldiag device status -dev mem -long -state failed System-level diagnostics returns you to the prompt if there are no test failures, or lists the full status...
  • Page 71 If the system-level diagnostics tests... Then... Were completed without any failures 1. Clear the status logs: sldiag device clearstatus 2. Verify that the log was cleared: sldiag device status The following default response is displayed: SLDIAG: No log messages are present. 3.
  • Page 72: Completing The Replacement Process

    8. Rerun the system-level diagnostic test. Completing the replacement process After you replace the part, you can return the failed part to Lenovo, as described in the RMA instructions shipped with the kit. Contact technical support at if you need the RMA number or Lenovo Data Center Support additional help with the replacement procedure.
  • Page 73: Shutting Down The Degraded Controller

    Shutting down the degraded controller You can shut down or take over the degraded controller using different procedures, depending on the storage system hardware configuration. Shutting down the controller To shut down the degraded controller, you must determine the status of the controller and, if necessary, take over the controller so that the healthy controller continues to serve data for the degraded controller’s storage.
  • Page 74: Replacing The Boot Media

    Step 1. If you are not already grounded, properly ground yourself. Step 2. Loosen the hook and loop strap binding the cables to the cable management device, and then unplug the system cables and SFPs (if needed) from the controller module, keeping track of where the cables were connected.
  • Page 75 • A copy of the same image version of ONTAP as what the degraded controller was running. You can download the appropriate image from the Downloads section on the Lenovo Data Center Support Site. • If your system is an HA pair, you must have a network connection.
  • Page 76 About this task A video for this task is available at: • YouTube: https://www.youtube.com/playlist?list=PLYV5R7hVcs-CZwRXsocAOmi5RsaXDVZQG Step 1. Align the end of the controller module with the opening in the chassis, and then gently push the controller module halfway into the system. Step 2.
  • Page 77 If your system has... Then... A network connection 1. Press when prompted to restore the backup configuration. 2. Set the healthy controller to advanced privilege level: set -privilege advanced 3. Run the restore backup command: system node restore-backup -node local -target- address impaired_node_IP_address 4.
  • Page 78: Completing The Replacement Process

    Completing the replacement process After you replace the part, you can return the failed part to Lenovo, as described in the RMA instructions shipped with the kit. Contact technical support at if you need the RMA number or Lenovo Data Center Support additional help with the replacement procedure.
  • Page 79: Opening The Controller Module

    Shutting down the controller To shut down the degraded controller, you must determine the status of the controller and, if necessary, take over the controller so that the healthy controller continues to serve data for the degraded controller’s storage. Before you begin •...
  • Page 80: Replacing The Rtc Battery

    the cables were connected. Leave the cables in the cable management device so that when you reinstall the cable management device, the cables are organized. Step 3. Remove and set aside the cable management devices from the left and right sides of the controller module.
  • Page 81: Reinstalling The Controller

    Step 3. Gently push the battery away from the holder, rotate it away from the holder, and then lift it out of the holder. Note: Note the polarity of the battery as you remove it from the holder. The battery is marked with a plus sign and must be positioned in the holder correctly.
  • Page 82: Completing The Replacement Process

    -ofnode impaired_node_name Completing the replacement process After you replace the part, you can return the failed part to Lenovo, as described in the RMA instructions shipped with the kit. Contact technical support at if you need the RMA number or Lenovo Data Center Support additional help with the replacement procedure.
  • Page 83: Swapping Out A Power Supply

    • If you have a cluster with more than two controllers, check the health and Epsilon from advanced mode: cluster show -epsilon* • If the cluster is not in quorum or a controller that is not the degraded controller shows for eligibility false and health, correct the issue before proceeding to the next step.
  • Page 84 Open the power cord retainer, and then unplug the power cord from the power supply. Unplug the power cord from the power source. Step 3. Press down the release latch on the power supply cam handle, and then lower the cam handle to the fully open position to release the power supply from the mid plane.
  • Page 85: Swapping Out A Fan

    Step 4. Use the cam handle to slide the power supply out of the system. CAUTION: When removing a power supply, always use two hands to support its weight. Step 5. Repeat the preceding steps for any remaining power supplies. Step 6.
  • Page 86: Removing The Controller Module

    Cam handle Fan module Cam handle release latch Fan module Attention LED Step 4. Pull the fan module straight out from the chassis, making sure that you support it with your free hand so that it does not swing out of the chassis. CAUTION: The fan modules are short.
  • Page 87: Replacing A Chassis From Within The Equipment Rack Or System Cabinet

    Step 1. If you are not already grounded, properly ground yourself. Step 2. Loosen the hook and loop strap binding the cables to the cable management device, and then unplug the system cables and SFPs (if needed) from the controller module, keeping track of where the cables were connected.
  • Page 88: Installing The Controller

    Step 2. With the help of two or three people, slide the old chassis off the rack rails in a system cabinet or L brackets in an equipment rack and set it aside. Step 3. If you are not already grounded, properly ground yourself. Step 4.
  • Page 89 If your system is in... Then perform these steps... An HA pair 1. With the cam handle in the open position, firmly push the controller module in until it meets the midplane and is fully seated, and then close the cam handle to the locked position.
  • Page 90: Running System-Level Diagnostics

    Step 3. If you have not already done so, recable the rest of your system. Step 4. The next step depends on your system configuration. If your system is in... Then... An HA pair with a second controller module Exit Maintenance mode: halt The LOADER prompt appears.
  • Page 91 If the system-level diagnostics tests... Then... Were completed without any failures 1. Clear the status logs: sldiag device clearstatus 2. Verify that the log was cleared: sldiag device status The following default response is displayed: SLDIAG: No log messages are present. 3.
  • Page 92: Completing The Replacement Process

    5. Rerun the system-level diagnostics test. Completing the replacement process After you replace the part, you can return the failed part to Lenovo, as described in the RMA instructions shipped with the kit. Contact technical support at if you need the RMA number or Lenovo Data Center Support additional help with the replacement procedure.
  • Page 93: Chapter 5. System Level Diagnostics

    The following requirements must be met when running system-level diagnostics; otherwise, parts of the tests fail and error messages appear in the status report: General requirements • Each system being tested must be on a separate network. © Copyright Lenovo 2019, 2023...
  • Page 94: How To Use Online Command-Line Help

    The network interface test assigns unique static IP addresses, beginning with 172.25.150.23, to all available network interfaces on a storage system. This results in network interface ports on different storage controllers being assigned the same IP address. If all the systems being tested are on the same network, then duplicate ip address warning messages appear on the connected consoles.
  • Page 95: Running System Installation Diagnostics

    [ ] (brackets) Indicate that the element inside the brackets is optional. { } (braces) Indicate that the element inside the braces is required. You can also type the question mark at the command line for a list of all the commands that are available at the current level of administration (administrative or advanced).
  • Page 96 • mem is system memory. • nic is a Network Interface Card not connected to a network. • nvram is nonvolatile RAM. • nvmem is a hybrid of NVRAM and system memory. • sas is a Serial Attached SCSI device not connected to a disk shelf. •...
  • Page 97 Adapter reset OK Started Adapter Get Connection State Test. Connection State: 5 Loop on FC Adapter 0b is OPEN Started adapter Retry LIP test Adapter Retry LIP OK ERROR: failed to init adaptor port for IOCTL call ioctl_status.class_type = 0x1 ioctl_status.subclass = 0x3 ioctl_status.info = 0x0 Started INTERNAL LOOPBACK:...
  • Page 98: Running System Panic Diagnostics

    If the system-level diagnostics tests... Then... Were completed without any failures There are no hardware problems and your storage system returns to the prompt. 1. Clear the status logs by entering the following command: sldiag device clearstatus 2. Verify that the log is cleared by entering the following command: sldiag device status...
  • Page 99 Your storage system provides the following output while the tests are still running: There are still test(s) being processed. After all the tests are complete, you receive the following default response: *> <SLDIAG:_ALL_TESTS_COMPLETED> Step 5. Identify the cause of the system panic by entering the following command: sldiag device status -long -state failed Example...
  • Page 100: Running Slow System Response Diagnostics

    Error Count: 2 Run Time: 70 secs >>>>> ERROR, please ensure the port has a shelf or plug. END DATE: Sat Jan 3 23:12:07 GMT 2009 LOOP: 1/1 TEST END -------------------------------------------- Then... If the system-level diagnostics tests... Were completed without any failures There are no hardware problems and your storage system returns to the prompt.
  • Page 101 Step 1. At the storage system prompt, switch to the LOADER prompt: halt Step 2. Enter the following command at the LOADER prompt: boot_diags Note: You must run this command from the LOADER prompt for system-level diagnostics to function properly. The command starts special drivers designed specifically for boot_diags system-level diagnostics.
  • Page 102 Adapter Retry LIP OK ERROR: failed to init adaptor port for IOCTL call ioctl_status.class_type = 0x1 ioctl_status.subclass = 0x3 ioctl_status.info = 0x0 Started INTERNAL LOOPBACK: INTERNAL LOOPBACK Error Count: 2 Run Time: 70 secs >>>>> ERROR, please ensure the port has a shelf or plug. END DATE: Sat Jan 3 23:12:07 GMT 2009 LOOP: 1/1 TEST END --------------------------------------------...
  • Page 103 If the system-level diagnostics tests... Then... Were completed without any failures There are no hardware problems and your storage system returns to the prompt. 1. Clear the status logs by entering the following command: sldiag device clearstatus 2. Verify that the log is cleared by entering the following command: sldiag device status...
  • Page 104: Running Hardware Installation Diagnostics

    If the system-level diagnostics tests... Then... specified device or named device by disabling all others first. 2. Verify that the tests were modified by entering the following command: sldiag option show 3. Repeat Steps 3 through 5 of Running slow system response diagnostics.
  • Page 105 – serviceproc is the Service Processor. – storage is an ATA, FC-AL, or SAS interface that has an attached disk shelf. – toe is a TCP Offload Engine, a type of NIC. • mb specifies that all the motherboard devices are to be tested. •...
  • Page 106 INTERNAL LOOPBACK Error Count: 2 Run Time: 70 secs >>>>> ERROR, please ensure the port has a shelf or plug. END DATE: Sat Jan 3 23:12:07 GMT 2009 LOOP: 1/1 TEST END -------------------------------------------- If the system-level diagnostics tests... Then... Were completed without any failures There are no hardware problems and your storage system returns to the prompt.
  • Page 107: Running Device Failure Diagnostics

    Running device failure diagnostics Running diagnostics can help you determine why access to a specific device becomes intermittent or why the device becomes unavailable in your storage system. Step 1. At the storage system prompt, switch to the LOADER prompt: halt Step 2.
  • Page 108 *> sldiag device status fcal -long -state failed TEST START ------------------------------------------ DEVTYPE: fcal NAME: Fcal Loopback Test START DATE: Sat Jan 3 23:10:56 GMT 2009 STATUS: Completed Starting test on Fcal Adapter: 0b Started gathering adapter info. Adapter get adapter info OK Adapter fc_data_link_rate: 1Gib Adapter name: QLogic 2532 Adapter firmware rev: 4.5.2...
  • Page 109 If the system-level diagnostics tests... Then... Resulted in some test failures Determine the cause of the problem. 1. Exit Maintenance mode by entering the following command: halt 2. Perform a clean shutdown and disconnect the power supplies. 3. Verify that you have observed all the considerations identified for running system-level diagnostics, that cables are securely connected, and that hardware...
  • Page 110 If the system-level diagnostics tests... Then... status [-dev devtype|mb|slotslotnum] following default response is displayed: SLDIAG: No log messages are present. 3. Exit Maintenance mode by entering the following command: halt 4. Enter the following command at the Loader prompt to boot the storage system: boot_ontap You have completed system-level diagnostics.
  • Page 111: Appendix A. Getting Help And Technical Assistance

    Appendix A. Getting help and technical assistance If you need help, service, or technical assistance or just want more information about Lenovo products, you will find a wide variety of sources available from Lenovo to assist you. On the World Wide Web, up-to-date information about Lenovo systems, optional devices, services, and support are available at: http://datacentersupport.lenovo.com...
  • Page 112: Collecting Service Data

    Gathering information needed to call Support If you believe that you require warranty service for your Lenovo product, the service technicians will be able to assist you more efficiently if you prepare before you call. You can also see http:// for more information about your product warranty.
  • Page 113: Appendix B. Notice Of Privacy Practices

    Appendix B. Notice of Privacy Practices Lenovo recognizes that privacy is of great importance to individuals everywhere – our customers, website visitors, product users...everyone. This is why the responsible use and protection of personal and other information under our care is a core Lenovo value.
  • Page 114 ThinkSystem DM7000x Hardware Installation and Maintenance Guide...
  • Page 115: Appendix C. Notices

    Lenovo representative for information on the products and services currently available in your area. Any reference to a Lenovo product, program, or service is not intended to state or imply that only that Lenovo product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any Lenovo intellectual property right may be used instead.
  • Page 116: Trademarks

    (TBW). A device that has exceeded this limit might fail to respond to system-generated commands or might be incapable of being written to. Lenovo is not responsible for replacement of a device that has exceeded its maximum guaranteed number of program/erase cycles, as documented in the Official Published Specifications for the device.
  • Page 117: Telecommunication Regulatory Statement

    This product may not be certified in your country for connection by any means whatsoever to interfaces of public telecommunications networks. Further certification may be required by law prior to making any such connection. Contact a Lenovo representative or reseller for any questions. Electronic emission notices When you attach a monitor to the equipment, you must use the designated monitor cable and any interference suppression devices that are supplied with the monitor.
  • Page 118: Taiwan Region Bsmi Rohs Declaration

    Taiwan region BSMI RoHS declaration Taiwan Region import and export contact information Contacts are available for Taiwan Region import and export information. ThinkSystem DM7000x Hardware Installation and Maintenance Guide...
  • Page 119: Index

    DIMMs running diagnostics error correction codes (ECC), considerations for running system-level diagnostics installing 41, 60 verifying and setting HA state of locating 41, 60 collecting service data © Copyright Lenovo 2019, 2023...
  • Page 120 moving introduction removing 41, 60 replacing running diagnostics running system-level diagnostics verifying there is no content in NVMEM licenses DIMMs diagnostics installing for the replacement controller in ONTAP running LIFs DIMMs system-level diagnostics verifying home ports running locating the boot media 39, 68 error correction codes (ECC) M.2 PCIe card...
  • Page 121 HA state of the chassis installing licenses for system IDs verifying system ID changes on HA systems running verifying changes on HA systems running ONTAP ONTAP system operations replacement procedures © Copyright Lenovo 2019, 2023...
  • Page 122 workflow for completing system restoration device failures system restoration hardware installations workflow for requirements for running system-level diagnostics system-level diagnostics running system-level diagnostics requirements for running slow system response systems system installation considerations for replacing DIMMs in controller system panics modules considerations for replacing NVMEM battery in controller modules...

This manual is also suitable for:

7y407y56

Table of Contents