Page 4
Note Before using this information and the product it supports, read the information in “Safety notices” on page vii, “Notices” on page 173, the IBM Systems Safety Notices manual, G229-9054, and the IBM Environmental Notices and User Guide, Z125–5823. This edition applies to IBM Power Systems servers that contain the POWER7 processor and to all associated models.
Page 6
Verifying that the hot-plug PCI tools are installed on the Linux partition . 123 Removing and replacing a PCI adapter contained in a cassette in an IBM i partition that is powered on . . 123 Preparing for hot-plug SCSI device or cable deconfiguration.
You should also refer to the safety information documentation any time you do not clearly understand any safety information in the U.S. English publications. Replacement or additional copies of safety information documentation can be obtained by calling the IBM Hotline at 1-800-300-8751.
Page 10
Electrical voltage and current from power, telephone, and communication cables are hazardous. To avoid a shock hazard: v Connect power to this unit only with the IBM provided power cord. Do not use the IBM provided power cord for any other product.
Page 11
Observe the following precautions when working on or around your IT rack system: v Heavy equipment–personal injury or equipment damage might result if mishandled. v Always lower the leveling pads on the rack cabinet. v Always install stabilizer brackets on the rack cabinet. v To avoid hazardous conditions due to uneven mechanical loading, always install the heaviest devices in the bottom of the rack cabinet.
Page 12
CAUTION: Removing components from the upper positions in the rack cabinet improves rack stability during relocation. Follow these general guidelines whenever you relocate a populated rack cabinet within a room or building: v Reduce the weight of the rack cabinet by removing equipment starting at the top of the rack cabinet.
Page 13
(L003) All lasers are certified in the U.S. to conform to the requirements of DHHS 21 CFR Subchapter J for class 1 laser products. Outside the U.S., they are certified to be in compliance with IEC 60825 as a class 1 laser product.
Page 14
Exchange only with the IBM-approved part. Recycle or discard the battery as instructed by local regulations. In the United States, IBM has a process for the collection of this battery. For information, call 1-800-426-4333. Have the IBM part number for the battery unit available when you call. (C003)
Linux operating system, see AIX and Linux problem analysis. – If the server or partition is running the IBM i operating system, see IBM i problem analysis. 7. To perform a repair operation from the management console, complete the following steps: v For HMC, complete the following steps: a.
Page 17
2. Select the check box next to the name of the server you want to enable the tasks for that server. 3. From the Tasks menu, click Operations > Power on. Follow all additional instructions in the interface. To power on a managed server by using an IBM Systems Director Management Console (SDMC), complete the following steps: 1.
Page 18
Figure 1. Control panel with labels Press the power button (A), as shown in the preceding figure, on the control panel. 4. Observe the following after pressing the power button: v The power-on light flashes slowly (standby) and then becomes solid. v The system cooling fans are activated after approximately 30 seconds and begin to accelerate to operating speed.
3. At the command line, enter one of the following commands: v If your system is running the AIX operating system, type shutdown. v If your system is running the IBM i operating system, type PWRDWNSYS *IMMED. If your system is running the Linux operating system, type shutdown -h.
Page 20
Look at the display. – If the dominant operating system is IBM i, look for a sign-on display that shows DSP01 in the upper right corner. DSP01 is the name that the system assigns to the primary console.
Is there more than one logical partition listed? Determine whether the system has multiple virtual servers by performing the following from the IBM Systems Director Management Console (SDMC): a. On the Welcome page, under the Resources tab, select Virtual Servers.
Page 22
3. Place the 571F/575B card set adapter on an ESD protective surface and orient it as shown in the following figure. 4. Disconnect battery cable (A) and the SCSI cable (B) from the 571F storage adapter. Leave the other end of the cables attached to the 575B auxiliary cache adapter.
Page 23
5. To prevent possible card damage, first loosen all five retaining screws (C) before removing them. After all five retaining screws have been loosened, remove the screws (C) from the 571F storage adapter. 6. Carefully lift the 571F storage adapter off the standoffs and set it on the ESD protective surface. Common service procedures...
Page 24
7. Turn the 571F storage adapter over so that the components are facing up, and locate the cache directory card (D) on the 571F storage adapter. It is the small rectangular card mounted on the I/O card. 8. Unseat the connector on the cache directory card by wiggling the two corners that are farthest from the mounting pegs.
Page 25
9. Move the cache directory card to the replacement 571F storage adapter and seat it on the connector and mounting pegs. 10. To reassemble the cards, perform steps 1 - 9 in reverse order. To prevent possible card damage, insert all five screws (C) before tightening any of them.
c. Slide the light pipe assembly (D) between the 571F/575B cards. Ensure that the threaded screw hole located on the light pipe assembly is aligned with the screw clearance hole located on the card. Return to the procedure that sent you here. This ends this procedure. Determining which processor is the secondary service processor Use this procedure to determine which service processor on your system is the secondary service processor.
1. The system gathers information about an error as the system detects the error. 2. The product activity log (PAL) records and stores the error information. When using IBM i the Service Action Log (SAL) serves a similar purpose as the PAL. The SAL uses an error analysis framework to analyze the error and details the recommended repair actions.
Page 28
2. Are you using the management console? Yes: Open the server with the problem, and then go to the IBM i partition with the problem and use the service action log (SAL) to find the entry related to this problem (see “Searching the service action log”...
Page 30
8. If there is no location information in the Location field, are there additional words displayed at the top of the screen? v Yes: Use the additional words to find the address information: – If characters 7 and 8 of word two are 60: - The direct select address is word four in the format BBBBCcbb.
Port Device Session 11. IBM i communication-diagnostic displays that are shown by the commands VFYCMN, WRKPRB, and ANZPRB use the following address format. Values for x are explained in the following table. x1 x2 x3x4 Table 3. Decimal address values...
Page 32
Table 4. SRC format examples Panel function, digits x to y (from Device detected word left to IOP SRC IOP SRC a failure number right) format 13 format 17 format 27 format 29 format 60 format 61 format 62 format 63 11, 1 - 8 TTTT TTTT...
Table 5. Description of SRC format examples (continued) SRC format Description LMMM Level indicator plus 3-digit model number (for example, 0001). lmmm Level indicator plus 3-digit model number of failing device. 0MMM Model number of failing device (that is, MMM=001). MIGV EP General system status.
Table 6. Unit address (E) definitions Unit address value Unit address data Unit address definition Reserved Communications I/O bus Adapter Port Channel Storage I/O adapter I/O bus Controller Device Workstation I/O bus Adapter Port Device Session Auxiliary processor I/O bus Auxiliary Adapter Port...
mmmm Model number of failing component (hex 2xxx - 9FFF). This value might be 0000 if system data is not available at the time the SRC is signaled. CCCC Cause code. NNNN Number of disk device facilities missing. MIGV EP General system status (see “System reference code information”...
a. Go to the list of reference codes and use the reference code that is indicated in the log to find the correct reference code table and unit reference code. b. Perform all actions in the Description/Action column before replacing failing items. Note: When replacing failing items, the part numbers and locations found in the service action log entry should be used.
v Yes: Perform the following steps: – Check that the workstation is operational. – Verify that the cabling and addressing for the workstation is correct. – Perform any actions indicated in the system operator message. If you need further assistance, contact your next level of support. This ends the procedure. 3.
The system service tools (SST) run one or more Licensed Internal Code or hardware service functions under the control of the operating system. SST performs service functions concurrently with the customer's application programs. The printer that is used for output must be an SCS-type data stream printer that is attached to the same workstation input/output processor (IOP) or storage media unit.
Page 39
Use this function key to change the amount of storage that all traces use. You might want to increase the amount of storage, if your system has many active traces or traces that will be active for a long time. v Display buffer size or display trace status (F11) With this function key, you can select and alternate between buffer size and trace status.
Page 40
Starting a communications trace The following is an example of a Start Trace display. Start Trace Configuration object .._________ Type ....1 1=Line 2=Network interface 3=Network server Trace description .
Page 41
3. The minimum value that is allowed for the beginning byte value for local area network (LAN) protocols (including frame relay protocols and ATM protocols) is 72 bytes. When all required options are complete, press the Enter key. The Work with Communications Trace display is shown.
Page 42
1. To view all the data associated with the trace, press the Enter key without changing to the defaults on the Format a Trace Data display . The information is not in any special format. 2. Select the option to format and print only the data that you want to see. 3.
Record timer The time that each event occurs. Depending on the communications hardware that is being used, the record timer is one of the following values: 1. A time of day value, HH:MM:SS.NNNNN (where H=hours, M=minutes, S=seconds, and N=subseconds), based on the system time when the trace was stopped. 2.
Page 44
Display Detail Report for Resource Serial Resource Name Type Model Number Name STORAGE 6512 10-4294009 SI05 Log ID ..: 06020132 Sequence ..: 2248 Date ..: 05/20/95 Time .
Page 45
If the rightmost hexadecimal digit of offset 000182 is 2, then the unit address is hexadecimal offset 192 - 195 Record this address information. Return to the procedure that sent you here, if one. Display Hexadecimal Report for Resource Serial Resource Name Type...
Examples: Obtaining additional information from hexadecimal reports You can get additional information for IOP 90xx reference code product activity log entries at DST by using a type D IPL. Notes: v The term I/O processor indicates an IOP or an IOA. Formatting is available (F4=Additional Information) for IPL from disk (type A or B IPL).
Page 47
Table 8. Mapping of format number to table reference (continued) Format number Reference the following tables Table 18 on page 47 Example: Device formatting log (unformatted information) Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6607 00-68753 Offset...
Page 48
Example: Device formatting log template Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6607 00-68753 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F 000180 -------- -------- -------- -------- 000190 -------- -------- -------- ------01 0001A0...
Page 49
Example: Array member formatting log (unformatted information) Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6532 00-50011 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F EBCDIC 000180 F4407800 65320001 90210210 0FFFFFFF 4 ....
Page 50
Example: Array member formatting log template Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6532 00-50011 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F 000180 -------- -------- -------- -------- 000190 -------- -------- -------- ------02 0001A0...
Page 51
Example: Configuration formatting log (unformatted information) Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6532 00-50011 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F EBCDIC 000180 50407800 65320001 90120210 0FFFFFFF &...
Page 52
Example: Array addendum formatting log (unformatted information) Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 2757 10-0322005 DC03 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F EBCDIC 000000 C5D3F0F5 000000A0 00E00180 82745D0F...
Page 53
Example: Array addendum log template Display Hexadecimal Report for Resource Serial Resource Name Type Model Number Name STORAGE 6532 00-50011 Offset 0 1 2 3 4 5 6 7 8 9 A B C D E F 000180 -------- -------- -------- -------- 000190 -------- -------- -------- ------04 0001A0...
Page 54
Table 9. Format 01 (continued) Description Hex offset New device 1 serial number 0001F6 - 0001F9 Device 1 IOA type and model 0001FA - 0001FD Device 1 IOA serial number 0001FE - 000201 Device 1 Cache type and model 000202 - 000205 Device 1 Cache serial number 000206 - 000209 Device 2 type...
Page 55
Table 10. Format 02 (continued) Description Hex offset Array member 3 expected unit address 00020E - 000211 Array member 3 current unit address 000212 - 000215 Array member 4 type 000216 - 000219 Array member 4 serial number 00021A - 00021D Array member 4 expected unit address 00021E - 000221 Array member 4 current unit address...
Page 56
Table 12. Format 04 Description Hex offset Array member 10 type 0001B6 - 0001B9 Array member 10 serial number 0001BA - 0001BD Array member 10 expected unit address 0001BE - 0001C1 Array member 10 current unit address 0001C2 - 0001C5 Array member 11 type 0001C6 - 0001C9 Array member 11 serial number...
Page 57
Table 14. Format 12 Description Hex offset Current I/O card type and model 0001C4 - 0001CB Current I/O card serial number 0001CC - 0001D3 Current I/O card worldwide ID 0001D4 - 0001DB Current cache type and model 0001DC - 0001E3 Current cache serial number 0001E4 - 0001EB Current cache worldwide ID...
Page 58
Table 15. Format 13 (continued) Description Hex offset Device 2 cache type and model 0002A4 - 0002AB Device 2 cache serial number 0002AC - 0002B3 Device 2 cache worldwide ID 0002B4 - 0002BB Device 3 type 0002BC - 0002BF Device 3 serial number 0002C0 - 0002C7 Device 3 worldwide ID 0002C8 - 0002CF...
Page 59
Table 16. Format 14 (continued) Description Hex offset Array member 4 type 00027C - 00027F Array member 4 serial number 000280 - 000287 Array member 4 worldwide ID 000288 - 00028F Array member 4 expected unit address 000290 - 000293 Array member 4 current unit address 000294 - 000297 Array member 5 type...
Page 60
Table 16. Format 14 (continued) Description Hex offset Array member 12 type 00035C - 00035F Array member 12 serial number 000360 - 000367 Array member 12 worldwide ID 000368 - 00036F Array member 12 expected unit address 000370 - 000373 Array member 12 current unit address 000374 - 000377 Array member 13 type...
An operational workstation on bus 1 as either a primary console or an operations console. v One workstation capable of communicating with each system partition. v A valid service tools user ID and password to sign on to DST. IBM supplies service tools user IDs that have different levels of privileges.
Note: For more information about how to identify the primary console, see “Primary consoles or alternative consoles” on page 5. 2. Sign on to DST. Type QSRV as the valid user ID and get the password from the customer. If the customer has changed the full DST authority user ID or password, ask the customer for the correct values.
Function Description Displays the machine type, model, processor feature code, processor class indicator, and IPL path description. This function is available in both normal and manual operating mode. Causes the Use Dedicated Service Tool display to appear on the system console. This function is available only in the manual operating mode and when activated by the operating system.
Perform the following steps: 1. Select a type B IPL in manual mode. Attention: For systems with logical partitions, you must power down the secondary partitions normally before performing an IPL on the primary partition to avoid possible data loss. 2.
Page 66
For more information on how to identify the primary console, see Primary consoles or alternative consoles. 7. Load the media and make the media unit ready. 8. Press the Power button to power on the system. Notes: a. Some types of removable media units automatically reset. In this condition, SRC A100 1933 (Media device not ready) might be displayed.
Starting a service tool Use this procedure to start a service tool. Select this option from the Use Dedicated Service (DST) Tools display. Note: The Start a service tool option is also available from the System Service Tools (SST) display. The following options are available from the Start a Service Tool display: v Display/alter dump v Licensed Internal Code log...
v Resource name, type-model, serial number, and logical address 4. Legend v Descriptions of the indicators v Logical address format information 132-character width printouts consist of the following: v Description v Type-module v Serial number v Location data – Frame ID –...
v High-speed link resources 3. Press F6 (Print) to print the information. This ends the procedure. Hardware service manager Use the hardware service manager (HSM) to display and work with system hardware. You can select the hardware service manager option from the Start a Service Tool display. With HSM you can display and work with system hardware (logical or packaging) and debug input/output processors (IOP) and devices.
Use the following information to help you manage the ASMI: Accessing the Advanced System Management Interface You can access the Advanced System Management Interface (ASMI) through a web browser, an ASCII console, or the management console. Accessing the Advanced System Management Interface by using a web browser You can connect a PC or notebook computer to the server to access the Advanced System Management Interface (ASMI).
browser. For example, if you connected your PC or notebook computer to HMC1, type https://169.254.2.147 in your PC's or notebook computer's web browser. 6. When the login display appears, enter one of the following default user IDs and passwords. Table 20. Default user IDs and passwords for the web interface User ID Password general...
You can access the Advanced System Management Console Interface (ASMI) through the Hardware Management Console (HMC) or the IBM Systems Director Management Console (SDMC). To access ASMI by using the HMC, complete the following steps: 1. Ensure that the HMC is set up and configured.
3. Log on to ASMI by using the administrator user ID and password. Displaying error and event logs You can view error and event logs that are generated by various service processor firmware components. The content of these logs can be useful in solving hardware or server firmware problems. To perform the display operation, your authority level must be one of the following values: v General v Administrator...
system when creating the location codes. This task must be done by using the ASMI, not with the control panel. However, if you do not have access to the ASMI, the system will still operate without updating this information. For example, when replacing the I/O backplane, you must re-enter the original enclosure serial number into the Enclosure serial number field to overwrite the serial number that is recorded for the new I/O backplane.
You can view manufacturer's VPD stored from the system boot prior to the one in progress now. To perform this operation, your authority level must be one of the following levels: v General v Administrator v Authorized service provider To display the VPD, complete the following steps: 1.
Attention: Misuse of service tools or using them with inappropriate security safeguards might result in loss of data, programs, or other objects on the system. Working with storage dumps This information describes how to work with storage dumps. A main storage dump (MSD) is a process of collecting data from the system's main storage. For more information on main storage dumps, including platform system dumps, see Performing a platform or main storage dump.
Page 77
You can use the Hardware Management Console (HMC) or the IBM Systems Director Management Console (SDMC) to perform a system dump, which is a collection of data from the server hardware and firmware.
Use this procedure only under the direction of your next level of support or your service provider, after first providing to your next level of support or service provider a dump of the operating system. Use the Advanced System Management Interface (ASMI) to start a service processor dump on an IBM Systems Director Management Console (SDMC).
Note: If you have a management console, you must manage your platform dumps with the management console. However, you will need to manage IBM i main storage dumps with the IBM i service tools. To copy a dump by using the HMC, complete the following steps: 1.
To copy a dump by using the SDMC, complete the following steps: 1. From the main IBM Systems Director welcome page, click View support files under the Service and Support Manager link. Alternatively, select Manage support files from the main Service and Support Manager summary panel.
Copying a platform system or SP dump: To copy a platform system or SP dump with the IBM i tools, complete the following steps: Note: If you have a management console, you must manage your platform dumps with the management console.
This ends the procedure. Reporting a dump Find out how to report a dump by using the management console, the IBM i service tools, an AIX command, or a Linux command. Reporting a dump by using a management console: You can report a dump by using the Hardware Management Console (HMC) or the IBM Systems Director Management Console (SDMC).
To report a dump by using the SDMC, complete the following steps: 1. From the main IBM Systems Director welcome page, select View support files under the Service and Support Manager link. Alternatively, click Manage support files from the main Service and Support Manager summary page.
Reporting a main storage dump, a platform system dump, or an SP dump by using the IBM i service tools: You can report a dump by using the IBM i service tools. If your system has the main storage dump enabled for auto copy, your system might have automatically copied the current main storage dump (MSD) to the auxiliary storage pool (ASP) by using the dump description Automatic copy.
6. Click Selected > Delete Dump. To manually delete a dump on the SDMC, complete the following steps: 1. From the main IBM Systems Director welcome page, select View support files under the Service and Support Manager link. Alternatively, click Manage support files from the main Service and Support Manager summary page.
Performing an IBM i main storage dump A main storage dump is a collection of data from the IBM i main storage. It can be manually initiated when the system administrator suspects a system wait or loop, or automatically by the server firmware as the result of a system failure.
Switch back to normal mode and wait for the dump to complete. If function 22 was selected, SRC A2D0 3000 on the console indicates that the main storage dump initiated by the user was successful. (SRC B6xx xxxx indicates that a main storage dump initiated by the IBM i operating system was successful.) 7.
If your system is running AIX, type shutdown v If your system is running IBM i, type pwrdwnsys *immed v If your system is running Linux, type shutdown -h now 3.
v Click Slow in the Set System Boot Speed field. v Click Running in the Boot to system server firmware field. 7. Select Save Settings. Note: These settings will apply to all future boots. After you complete the service action, change the settings back to what you recorded in step 5 on page 74.
– Operations Console cable attached to the 2609 or 2612 (part number 97H7555), 2699 (part number 97H7556), or 2721 (part number 97H7557) with 5250 emulation or Rumba active on PC Finding the primary console when the system is operational Various methods are available to find the primary console when the system is operational. The following methods are available to find the primary console: v Method 1: Look for a sign-on display with DSP01 shown in the upper-right corner.
7. Click OK. Checking for a duplicate IP address You can check for a duplicate IP address by using the Hardware Management Console (HMC) or the IBM Systems Director Management Console (SDMC). To check for a duplicate IP address by using the HMC, complete the following steps: 1.
For AIX, go to Replace the disk drive using AIX. For Linux, go to Replace the disk drive using Linux . For IBM i, go to Replace the disk drive using IBM i. Replacing the disk drive nonconcurrently Use this procedure to remove and replace a disk drive nonconcurrently.
Page 93
4. Install the bezel on the new disk drive: a. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. b. Remove the disk drive from its static protective package. c. Unlock the handle of the replacement disk drive by squeezing and pulling it out toward you. If the handle is not all the way out, the disk drive will not slide into the system.
5. Install the disk drive: a. Support the bottom of the disk drive as you align it with the guide rails in the system unit. Do not hold the disk drive by the handle. b. Slide the disk drive into the system until it stops. c.
Page 95
2. Remove the disk drive as a resource: a. Log in as root user or use CE Login. b. At the command line, type diag and press Enter. c. Press Enter to continue. d. On the Function Selection display, select Task Selection. e.
Page 96
A list of the disk drives that you can remove is shown. If the disk drive you want to remove does not appear on the list, ask your system administrator to put the disk drive in the failed state before continuing with this procedure. For information, see AIX System Management Guide: Operating System and Devices.
Page 97
5. Install the disk drive: a. Support the bottom of the disk drive as you align it with the guide rails in the system unit. Do not hold the disk drive by the handle. b. Slide the disk drive into the system until it stops. c.
Sign on the operator console with at least service level authority. b. Type strsst on the command line of the IBM i session and press Enter. c. Type your service tools user ID and service tools password on the Start Service Tools (STRSST) Sign On display.
Page 99
4. Determine if the failing disk drive is parity protected: Check the status of the failing disk drive for one of the following items: Note: An ASP with a status of unprotected might contain disk drives that are device parity protected.
Page 100
d. Locate the concurrent maintenance light that corresponds to the position of the disk drive that you are replacing. Important: When you press Enter, after a one minute delay, this light comes on and begins to blink rapidly. You then have nine seconds to remove the disk drive. e.
Page 101
12. Install the bezel on the new disk drive: a. Find the package that contains the new disk drive. Attention: Disk drives are fragile. Handle with care. b. Remove the disk drive from its static protective package. c. Unlock the handle of the replacement disk drive by squeezing and pulling it out toward you. If the handle is not all the way out, the disk drive will not slide into the system.
Page 102
When the Hardware Service Manager display is shown, press F3. 16. Rebuild the data on the replacement disk drive by: a. If necessary, start System Service Tools (SST) by typing strsst on the command line of the IBM i session and pressing Enter.
1. Log in as root user. 2. Type iprconfig on the command line of the Linux session and press Enter. The IBM Power RAID Configuration Utility display is shown. 3. Select Analyze log. Press Enter. The Kernel Messages Log display is shown.
Page 104
10. Type iprconfig on the command line and press Enter. The IBM Power RAID Configuration Utility display is shown. 11. Select Display hardware status from the IBM Power RAID Configuration Utility display. Press Enter. The Display Hardware Status display is shown.
Page 105
Disk Unit Hardware Resource Information Details Manufacturer ....: IBM Product ID ....: ST336607LC Firmware Version .
Removing the disk drive 1. From the IBM Power RAID Configuration Utility display, select Work with disk unit recovery. Press Enter. 2. From the Work with Disk Unit Recovery display, select Concurrent remove device. Press Enter. A Concurrent Device Remove display is shown, similar to the following display.
Page 107
drive will not slide out of the system or expansion unit. Figure 29. Removing a disk drive from the system unit Figure 30. Removing a disk drive from the system unit 8. Support the bottom of the disk drive as you slide it out of the system or expansion unit. Do not hold the disk drive by the handle.
9. Loosen and pull out on the thumbscrew (B) to release the disk-drive bezel (C) from the disk drive, as shown in the following figure. Figure 31. Removing a disk-drive bezel 10. Press Enter on the IBM Power RAID Configuration Utility to indicate that you have removed the disk drive. Replacing the disk drive 1.
Page 109
Figure 32. Attaching a bezel to the disk drive 5. From the IBM Power RAID Configuration Utility display, select Work with disk unit recovery. Press Enter. 6. From the Work with Disk Unit Recovery display, select Concurrent add device. Press Enter.
Page 110
Figure 34. Installing a disk drive in the system unit Figure 35. Installing a disk drive in the system unit 11. Press Enter on the Complete Device Concurrent Add display to indicate that the disk drive is installed. 12. Rebuild the data on the replacement disk drive. For instructions, go to “Rebuilding data on a replacement disk drive by using Linux”...
AIX partition that is powered on. If you are doing a concurrent repair by using the IBM i operating system, go to Removing and replacing a PCI adapter contained in a cassette in an IBM i partition that is powered on.
Page 112
7. At the Task Selection list, select Hot Plug Manager, and then select PCI Hot Plug Manager. 8. Select Unconfigure a Device, and then press Enter. 9. Press F4 (or Esc+4) to display the Device Names menu. 10. Select the adapter you are removing in the Device Names menu. 11.
Page 113
Figure 37. Rotate the EMC shield into the open position 20. Remove the cassette. Lift up the lower cassette handle (B) as shown in the following figure. Pull the PCI cassette (C) out of the system. Attention: A cassette containing either a PCI adapter or filler panel must be placed in the PCI adapter slot of the system unit for proper air flow and cooling.
Page 114
Figure 38. PCI adapter cassette removed from the system unit 21. Place the cassette with the cover facing up on an approved ESD surface. The cover will have a label on it. 22. Install the replacement adapter into the PCI adapter cassette by completing the following steps. If the cassette is not going to contain a replacement adapter, use this same procedure to place an adapter filler panel in the cassette.
Page 115
Figure 39. PCI adapter cassette cover removed b. Remove the adapter from the cassette by doing the following steps: 1) Unlock the adapter retainers by rotating the retainer clip (A) into the horizontal position. See Figure 40 on page 102. Notes: v The edge of the adapter located at the end of the cassette that contains the cassette handles is called the adapter tailstock.
Page 116
Figure 40. Adapter removed from the PCI adapter cassette c. Place the new adapter in the cassette. Attention: A cassette containing either a PCI adapter or filler panel must be placed in the PCI adapter slot of the system unit for proper air flow and cooling. If the cassette is not going to contain a PCI adapter, use this same procedure to place an adapter filler panel in the cassette.
Page 117
Figure 41. Adapter removed from the PCI adapter single-width cassette 5) Position the adapter retainers to support the adapter, and then rotate the retainer clip (B) into the closed position. See Figure 41. Note: Two retainers are located at the top of the cassette, along the top edge of the adapter. Two more retainers are located at the edge of the cassette opposite of the adapter tailstock.
Page 118
Figure 42. Long adapter in the PCI adapter cassette with the supports and stabilizer in place Adapter-cassette retainer placement for medium-length adapters v Remove the adapter stabilizer (C). See Figure 43 on page 105. v Place and lock the retainers (B). v Ensure the adapter edge is seated in each retainer groove (A).
Page 119
Figure 43. Medium-length adapter in the PCI adapter cassette with the supports in place Adapter-cassette retainer placement for short adapters v Remove the adapter stabilizer (C). See Figure 44 on page 106. v Place the hookarm (D) into the hole in the corner of the adapter. This supports the card when it is undocked from the connector on the system backplane.
Page 120
Figure 44. Short adapter in the PCI adapter cassette with the supports and the hookarm in place d. Replace the cassette cover by doing the following steps: 1) Slide the cover (B) into position on the cassette. 2) While holding the cover latch (A) in the open position, place the cover over the pivot pin (C). 3) Release the cover latch to lock the cover into place.
Page 121
Figure 45. PCI adapter cassette cover replaced 23. At the back of the system, lift the cassette cover flap and identify the cassette slot you want to use. 24. Ensure the lower cassette handle is pressed up toward the retainer clip. This places the adapter in the correct position to be docked in the system.
Page 122
Figure 46. PCI adapter cassette removed from the system unit 28. Connect the adapter cables. 29. Lower the PCI adapter EMC shield (A) into the closed position, close the shield latch, and then close the rear rack door.
Page 123
Figure 47. PCI adapter EMC shield in the closed position 30. Return to the AIX screen. Choose one of the following options:. v If you are replacing an adapter, go to step 31. v If you removing an adapter without replacing it, follow these steps: a.
37. Press F3 or Esc+3 to return to TASK Selection List. 38. Select Hot Plug Task, and then press enter. 39. Select PCI Hot Plug Manager, then select Configure a defined device, and then press Enter. 40. Select the device just replaced from the list, and then press Enter. The device is now configured. 41.
Page 125
Figure 48. Lift up on the EMC shield Common service procedures...
Page 126
Figure 49. Rotate the EMC shield into the open position 7. Remove the cassette. Lift up the lower cassette handle (B) as shown in the following figure. Pull the PCI cassette (C) out of the system. Attention: A cassette containing either a PCI adapter or filler panel must be placed in the PCI adapter slot of the system unit for proper air flow and cooling.
Page 127
Figure 50. PCI adapter cassette removed from the system unit 8. Place the cassette with the cover facing up on an approved ESD surface. The cover will have a label on it. 9. Install the replacement adapter into the PCI adapter cassette by completing the following steps. If the cassette is not going to contain a replacement adapter, use this same procedure to place an adapter filler panel in the cassette.
Page 128
Figure 51. PCI adapter cassette cover removed b. Remove the adapter from the cassette by doing the following steps: 1) Unlock the adapter retainers by rotating the retainer clip (A) into the horizontal position. See Figure 52 on page 115. Notes: v The edge of the adapter located at the end of the cassette that contains the cassette handles is called the adapter tailstock.
Page 129
Figure 52. Adapter removed from the PCI adapter cassette c. Place the new adapter in the cassette. Attention: A cassette containing either a PCI adapter or filler panel must be placed in the PCI adapter slot of the system unit for proper air flow and cooling. If the cassette is not going to contain a PCI adapter, use this same procedure to place an adapter filler panel in the cassette.
Page 130
Figure 53. Adapter removed from the PCI adapter single-width cassette 5) Position the adapter retainers to support the adapter, and then rotate the retainer clip (B) into the closed position. See Figure 53. Note: Two retainers are located at the top of the cassette, along the top edge of the adapter. Two more retainers are located at the edge of the cassette opposite of the adapter tailstock.
Page 131
Figure 54. Long adapter in the PCI adapter cassette with the supports and stabilizer in place Adapter-cassette retainer placement for medium-length adapters v Remove the adapter stabilizer (C). See Figure 55 on page 118. v Place and lock the retainers (B). v Ensure the adapter edge is seated in each retainer groove (A).
Page 132
Figure 55. Medium-length adapter in the PCI adapter cassette with the supports in place Adapter-cassette retainer placement for short adapters v Remove the adapter stabilizer (C). See Figure 56 on page 119. v Place the hookarm (D) into the hole in the corner of the adapter. This supports the card when it is undocked from the connector on the system backplane.
Page 133
Figure 56. Short adapter in the PCI adapter cassette with the supports and the hookarm in place d. Replace the cassette cover by doing the following steps: 1) Slide the cover (B) into position on the cassette. 2) While holding the cover latch (A) in the open position, place the cover over the pivot pin (C). 3) Release the cover latch to lock the cover into place.
Page 134
Figure 57. PCI adapter cassette cover replaced 10. At the back of the system, lift the cassette cover flap and identify the cassette slot you want to use. 11. Ensure the lower cassette handle is pressed up toward the retainer clip. This places the adapter in the correct position to be docked in the system.
Page 135
Figure 58. PCI adapter cassette removed from the system unit 15. Connect the adapter cables. 16. Lower the PCI adapter EMC shield (A) into the closed position, close the shield latch, then close the rear rack door. Common service procedures...
Page 136
Figure 59. PCI adapter EMC shield in the closed position 17. Run the lsslot command to verify that the slot is occupied. For example, Enter lsslot -c pci -s U7879.001.DQD014E-P1-C3 The following is an example of the information displayed by this command: # Slot Description Device(s) U7879.001.DQD014E-P1-C3 PCI-X capable,...
IBM i partition that is powered on Procedures performed on a PCI adapter in an IBM i partition that is powered on, also known as hot-plug procedures, require the system administrator to take the PCI adapter offline prior to performing the operation.
Page 138
3. Ensure that any processes or applications that might use the adapter are stopped. 4. Go to your terminal and type strsst on the command line of the Main Menu and press Enter. 5. Type your service tools user ID and service tools password on the System Service Tools (SST) Sign On display.
Page 139
Figure 60. Lift up on the EMC shield Common service procedures...
Page 140
Figure 61. Rotate the EMC shield into the open position 19. Remove the cassette. Lift up the lower cassette handle (B) as shown in the following figure. Pull the PCI cassette (C) out of the system. Attention: A cassette containing either a PCI adapter or filler panel must be placed in the PCI adapter slot of the system unit for proper air flow and cooling.
Page 141
Figure 62. PCI adapter cassette removed from the system unit 20. Place the cassette with the cover facing up on an approved ESD surface. The cover will have a label on it. 21. Install the replacement adapter into the PCI adapter cassette by completing the following steps. If the cassette is not going to contain a replacement adapter, use this same procedure to place an adapter filler panel in the cassette.
Page 142
Figure 63. PCI adapter cassette cover removed b. Remove the adapter from the cassette by doing the following steps: 1) Unlock the adapter retainers by rotating the retainer clip (A) into the horizontal position. See Figure 64 on page 129. Notes: v The edge of the adapter located at the end of the cassette that contains the cassette handles is called the adapter tailstock.
Page 143
Figure 64. Adapter removed from the PCI adapter cassette c. Place the new adapter in the cassette. Attention: A cassette containing either a PCI adapter or filler panel must be placed in the PCI adapter slot of the system unit for proper air flow and cooling. If the cassette is not going to contain a PCI adapter, use this same procedure to place an adapter filler panel in the cassette.
Page 144
Figure 65. Adapter removed from the PCI adapter single-width cassette 5) Position the adapter retainers to support the adapter, and then rotate the retainer clip (B) into the closed position. See Figure 65 Note: Two retainers are located at the top of the cassette, along the top edge of the adapter. Two more retainers are located at the edge of the cassette opposite of the adapter tailstock.
Page 145
Figure 66. Long adapter in the PCI adapter cassette with the supports and stabilizer in place Adapter-cassette retainer placement for medium-length adapters v Remove the adapter stabilizer (C). See Figure 67 on page 132. v Place and lock the retainers (B). v Ensure the adapter edge is seated in each retainer groove (A).
Page 146
Figure 67. Medium-length adapter in the PCI adapter cassette with the supports in place Adapter-cassette retainer placement for short adapters v Remove the adapter stabilizer (C). See Figure 68 on page 133. v Place the hook arm (D) into the hole in the corner of the adapter. This supports the card when it is undocked from the connector on the system backplane.
Page 147
Figure 68. Short adapter in the PCI adapter cassette with the supports and the hook arm in place d. Replace the cassette cover by doing the following steps: 1) Slide the cover (B) into position on the cassette. 2) While holding the cover latch (A) in the open position, place the cover over the pivot pin (C). 3) Release the cover latch to lock the cover into place.
Page 148
Figure 69. PCI adapter cassette cover replaced 22. At the back of the system, lift the cassette cover flap and identify the cassette slot you want to use. 23. Ensure the lower cassette handle is pressed up toward the retainer clip. This places the adapter in the correct position to be docked in the system.
Page 149
Figure 70. PCI adapter cassette removed from the system unit 27. Connect the adapter cables. 28. Lower the PCI adapter EMC shield (A) into the closed position, close the shield latch, then close the rear rack door. Common service procedures...
Figure 71. PCI adapter EMC shield in the closed position 29. Return to your terminal and the IBM i sign on screen. 30. Select Power on domain on the Hardware Resource Concurrent Maintenance display and press Enter. 31. Select Assign to on the resource that has an asterisk (*) on the Work with Controlling Resource display.
Attention: If the server is not managed by a management console and there is no IBM i service partition, the expansion unit power cannot be turned off with the server power turned on.
Page 152
Then, click Deactivate LED to turn off the LED. v From IBM i use System (or Dedicated) Service Tools > Start a service tool > Hardware service manager > Packaging hardware resources > Concurrent maintenance > Toggle LED blink off/on.
Page 153
SPCN system. This should be done by using the OptiConnect user interface, a feature of IBM i. If that is not successful due to problems with the OptiConnect user interface, it may be attempted as described in . Switching ownership of an expansion units switchable resources.
Page 154
If all partitions that own I/O resources in the expansion unit have been powered off, skip this step and go to step 13. v In IBM i partitions: – Work with the customer to vary off all configuration descriptions (for example, ctl, device, line, network server, etc.) that use resources in the expansion unit.
Page 155
Reporting partition type: RPA Return code type: Failure (hard stop or user intervention required). Message: com.ibm.hsc.client.rmc.HSCRMCException: Query operation for slot resource has failed on partition 2. The partition command is: drmgr -Q -c slot -s U0595.001.435DF30-CB1-C02 -w 0 -d 3...
Page 156
Attention: If the server is not managed by a management console and there is no IBM i service partition, the expansion unit power cannot be turned off with the server power turned on.
Page 157
If the procedure is being done from an IBM i partition, the system prevents the procedure from being done from a console or terminal session connected through the expansion unit being powered off. A different console or terminal session not connected through the expansion unit being powered off must be found, or the server must be powered off.
3. Select Packaging hardware resources. 4. Select Concurrent maintenance. If the server is not managed by a management console and there is no IBM i service partition, the enclosure cannot be powered off and on with the server powered on.
Page 159
4. Wait for I/O resources in the enclosure to be brought back online to a functional state, and then continue with the next step. It may take between five and ten minutes after the enclosure is powered on for IBM i resources in the enclosure to return to a operating state. Using the Packaging Hardware Resources and Logical...
– Work with the customer to configure all logical resources that were unconfigured when the enclosure was powered off. Use the cfgmgr command. v In IBM i partitions: – Work with the customer to vary on all configuration descriptions (ctl, device, line, network server, etc.) that were varied off when the enclosure was powered off.
“Verifying the repair in AIX” v “Verifying the repair in Linux” on page 152 v “Verifying a repair by using an IBM i system or logical partition” on page 151 Verifying the repair in AIX You can use this procedure to verify that a repair is complete using the AIX operating system.
Page 162
Did you encounter any problems? Reinstall the operating system and continue with step 5. If the original problem still exists, replace the field-replaceable unit (FRU) or perform the isolation procedure that is next in the FRU list. If you have reached the end of the FRU list, contact your next level of support.
Page 163
1. Enter diag at the command prompt. 2. Press Enter. 3. Select the Diagnostics Routines option. 4. When the Diagnostic Mode Selection menu is displayed, select System verification. 5. When the Diagnostic Selection menu is displayed, select the All Resources option or test the FRUs you exchanged and any devices that are attached to the FRUs you exchanged by selecting the diagnostics for the individual FRU.
Page 164
When a test is run on a resource in system verification mode, that resource has an entry in the AIX error log. If the test on the resource was successful, the Resource Repair Action menu displays. After replacing that FRU, you must select the resource for that FRU from the Resource Repair Action menu.
Verifying a repair by using an IBM i system or logical partition Use this procedure to verify a repair by using the IBM i operating system. 1. Was the system powered off during the repair? Yes: Continue with the next step.
Yes: Continue with the next step. No: Go to step 14. 12. Is the new reference code the same as the original reference code? Yes: Continue with the next step. No: A new symptom might have occurred. Go to the Start of call procedure to determine the cause of the problem.
Verifying the repair from the management console Perform these procedures to close problem numbers, clear hardware messages, and prepare the server to return to the customer by using the management console. Follow this checklist before performing the procedures: v You return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned.
You can do this task from the Hardware Management Console (HMC) or the IBM Systems Director Management Console (SDMC). If you want to be alerted if another problem occurs, you must deactivate the system attention LED so that it can be activated again if another problem occurs.
You can activate or deactivate an identify LED for components attached to the system from the Hardware Management Console (HMC) or from the IBM Systems Director Management Console (SDMC). The system provides several LEDs that help identify various components in the system, such as enclosures or field-replaceable units (FRUs).
5. Select one or more FRUs from the table, and then click either Activate LED or Deactivate LED. The associated LED is either turned on or off. Deactivating a system attention LED or logical partition LED by using the Advanced System Management Interface You can deactivate a system attention LED or a logical partition LED by using the Advanced System Management Interface (ASMI).
1. Ping the remote server to ensure network connectivity. ping testcase.software.ibm.com Note: Where testcase.software.ibm.com represents the remote location, which can be accessed by the support center organization. 2. Set the directory that contains the .pax file to be the working directory. Use the ftp command to connect to the remote server.
8. If the support center representative directs you to send the .pax file as an email attachment, send it to shift@us.ibm.com. Note: Do not send a .pax file unless directed to do so by support. This email address is not monitored.
Page 173
5. From the list of serviceable events recorded in step 4 on page 158, complete the following steps 6 - step 32 on page 161 for each open service action event. 6. Determine the error class of the serviceable event. Record for future use. 7.
Page 174
15. Add the partition list obtained in step 11 on page 159 to the existing list of Axx partitions obtained from processing previous service action events in this maintenance analysis procedure (MAP). 16. Remove all entries in the list of all partitions you recorded in step 11 on page 159. If you are referred to the list of partitions obtained in step 11 on page 159 in future steps, the list is empty.
Page 175
v Yes: Continue with the next step. v No: Go to step 32. 27. Perform all the following steps for each entry in the list of all partitions you recorded in step 11 on page 159, except for the partition you were using to debug the original problem. 28.
35. From the list of Axx partitions, open the management console virtual terminal window of a partition , and then type diag at the AIX command prompt. 36. When the diagnostic operating instructions are displayed, complete the following steps: 1. Press Enter. 2.
Page 177
1. Did you use a hot-swap operation by using an AIX diagnostics service aid to change the FRU? v Yes: Go to step 4 v No: Continue with the next step. 2. Do you have any field-replaceable units (FRUs) (for example cards, adapters, cables, or devices) that were removed during problem analysis that you want to put back into the system? Note: If the system backplane or battery has been replaced and you are loading diagnostics from a server over a network, it might be necessary for the customer to set the network boot information...
Page 178
If the system attention LED is still on, turn off the LED as described in “Activating and deactivating LEDs” on page 154. This completes the repair. Note: If, during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair.
Page 179
v Yes: Go to step 13. v No: Continue with the next step. 10. Did the TESTING COMPLETE, no trouble was found menu (801010) display? v Yes: Continue with the next step. v No: There is still a problem. Go to Problem analysis. 11.
To select the resource for the replaced FRU, complete the following steps: 1. Select the resource associated with the repair action: v If the repair action was reseating a cable or adapter, select the resource associated with that repair action. v If the resource associated with your repair action is not displayed on the Resource List, select sysplanar0.
Page 181
Follow this checklist before performing the procedure: v Return the server to the state that the customer normally uses, such as IPL type, IPL mode, and the way the system is configured or partitioned. Attention: Before returning the system to the customer, remove the system from service mode. If the system is left in service mode, it automatically places a call for service every two hours.
Page 182
9. Is the FRU list identical that is, the same FRUs, same number of FRUs, and same order of FRUs) to the FRU list of the error code recorded in step 1 on page 167? v Yes: Go to step 11. v No: Continue with the next step.
Page 183
20. Add or update the FRU information: Did you replace, add, or modify a FRU of the open service action event? v Yes: Continue with the next step. v No: Go to step 22. 21. Click OK to close the service action event. 22.
Page 184
v Yes: Continue with the next step. v No: Go to step 25 on page 169 to process the next partition in the list you recorded in step 11 on page 168. 30. Have all the serviceable events recorded in step 4 on page 167 been processed? v Yes: Continue with the next step.
Page 185
Note: If, during the processing of the list of open service action events, some service action events remained open, further service actions might be required to complete the repair. Common service procedures...
IBM, the IBM logo, and ibm.com are trademarks or registered trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. A current list of IBM trademarks is available on the web at Copyright and trademark information at www.ibm.com/legal/copytrade.shtml.
Page 189
Properly shielded and grounded cables and connectors must be used in order to meet FCC emission limits. IBM is not responsible for any radio or television interference caused by using other than recommended cables and connectors or by unauthorized changes or modifications to this equipment.
Page 190
The following is a summary of the EMI Taiwan statement above. Warning: This is a Class A product. In a domestic environment this product may cause radio interference in which case the user will be required to take adequate measures. IBM Taiwan Contact Information:...
Page 191
Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw.
Consult an IBM-authorized dealer or service representative for help. Properly shielded and grounded cables and connectors must be used in order to meet FCC emission limits. Proper cables and connectors are available from IBM-authorized dealers. IBM is not responsible for...
Page 193
This product is in conformity with the protection requirements of EU Council Directive 2004/108/EC on the approximation of the laws of the Member States relating to electromagnetic compatibility. IBM cannot accept responsibility for any failure to satisfy the protection requirements resulting from a non-recommended modification of the product, including the fitting of non-IBM option cards.
Page 194
Um dieses sicherzustellen, sind die Geräte wie in den Handbüchern beschrieben zu installieren und zu betreiben. Des Weiteren dürfen auch nur von der IBM empfohlene Kabel angeschlossen werden. IBM übernimmt keine Verantwortung für die Einhaltung der Schutzanforderungen, wenn das Produkt ohne Zustimmung von IBM verändert bzw.
Permissions for the use of these publications are granted subject to the following terms and conditions. Applicability: These terms and conditions are in addition to any terms of use for the IBM website. Personal Use: You may reproduce these publications for your personal, noncommercial use provided that all proprietary notices are preserved.
Need help?
Do you have a question about the Power System POWER7 Series and is the answer not in the manual?
Questions and answers