Summary of Contents for Digital Equipment DEC 10000
Page 1
DEC 10000 AXP System VAX 10000 System Service Manual Order Number EK–1002A–SV.001 This manual tells how to add or replace CPU and memory modules in a DEC 10000 AXP or VAX 10000 system. digital equipment corporation maynard, massachusetts...
Page 2
The information in this document is subject to change without notice and should not be construed as a commitment by Digital Equipment Corporation. Digital Equipment Corporation assumes no responsibility for any errors that may appear in this document. The software, if any, described in this document is furnished under a license and may be used or copied only in accordance with the terms of such license.
FEPROM Recovery—Hardware Requirements ....2-14 FEPROM Recovery—Software Requirements and Setup . 2-16 FEPROM Recovery—Procedure .......... 2-18 Chapter 3 Updating Firmware Booting LFU on a DEC 10000 System ........3-2 Booting LFU on a VAX 10000 System ........3-4 Show ..................3-6 List ..................3-8...
Page 4
Module Barcode ..............1-7 Removing a Module from the LSB Card Cage ...... 1-8 Inserting a Module in the LSB Card Cage ......1-10 Sample Hardware Requirements for Remote FEPROM Recovery ................2-14 Tables DEC 10000/VAX 10000 Documentation ......... ix...
Preface Intended Audience This manual is written for Digital customer service engineers and self- maintenance customers servicing DEC 10000 AXP or VAX 10000 systems. This manual is a follow-on to Basic Troubleshooting and Advanced Trou- bleshooting. Document Structure This manual uses a structured documentation design. Topics are organ- ized into small sections for efficient on-line and printed reference.
Conventions Used in This Document Terminology. Unless specified otherwise, the use of "system" refers to either a DEC 10000 AXP or VAX 10000 system. The DEC 10000 AXP sys- tems use the Alpha AXP architecture. References in text use DEC 10000 to refer to DEC 10000 AXP systems.
Page 9
Table 1 DEC 10000/VAX 10000 Documentation Title Order Number Installation Kit EK–1000B–DK EK–1000B–SP Site Preparation Guide EK–100EB–IN Installation Guide EK–1001B–DK Hardware User Information Kit EK–1000B–OP Operations Manual EK–1000B–TS Basic Troubleshooting EK–1002A–DK Service Information Kit—VAX 10000 EK–1000A–SV Platform Service Manual EK–1002A–SV System Service Manual EK–1000A–PG...
Page 10
Table 1 DEC 10000/VAX 10000 Documentation (Continued) Title Order Number Reference Manuals EK–70C0B–TM Console Reference Manual EK–KA7AA–TM KA7AA CPU Technical Manual EK–KN7AA–TM KN7AA CPU Technical Manual EK–MS7AA–TM MS7AA Memory Technical Manual EK–70I0A–TM I/O System Technical Manual EK–7000A–TM Platform Technical Manual Upgrade Manuals EK–KA7AA–IN...
Page 11
Table 2 Related Documents Title Order Number General Site Preparation EK–CSEPG–MA Site Environmental Preparation Guide System I/O Options EK–BA350–CG BA350 Modular Storage Shelf Subsystem Configuration Guide EK–BA350–UG BA350 Modular Storage Shelf Subsystem User’s Guide EK–350LA–UG BA350-LA Modular Storage Shelf User’s Guide EK–CIXCD–UG CIXCD Interface User Guide EK–DEMFA–IP...
Page 12
Table 2 Related Documents (Continued) Title Order Number Operating System Manuals EY–L520E–DP Alpha Architecture Reference Manual AA–PJU7A–TE DEC OSF/1 Guide to System Administration AA–PQYAA–TK DECnet for OpenVMS Network Management Utilities AA–PS2DA–TE Guide to Installing DEC OSF/1 AA–PQYSA–TE OpenVMS Alpha Version 1.0 Upgrade and Installation Manual AA–PRAHA–TE VMS Upgrade and Installation Supplement:...
Chapter 1 Adding or Replacing CPUs and Memories This chapter provides information on how to remove and install processor and memory modules in DEC 10000 and VAX 10000 systems. Sections in- clude: • What Is Required • LSB Configuration Rules •...
1.1 What Is Required Adding or replacing processor or memory modules is a simple op- eration. Afterward you must verify that the new modules are rec- ognized in the system. You may need to set system parameters. Figure 1-1 LSB Card Cage Slots System Cabinet...
Page 15
Processor and memory modules reside in the LSB card cage, a centerplane card cage in the system cabinet with nine slots for modules. The LSB card cage always contains an IOP module, a clock module, and at least one proc- essor and one memory module (see Figure 1-1).
1.2 LSB Configuration Rules The first CPU module is node 0, and the first memory module is at node 7. The LSB bus requires that an IOP module be at node 8. See Figure 1-2. Figure 1-2 LSB Configuration Rules Front Rear Power Filter...
The LSB card cage (see Figure1-2) has nine slots. Slot numbers are equivalent to node numbers. Four slots are at the front of the cabinet (nodes 0 through 3, right to left), and five slots are at the rear (nodes 4 through 8, right to left).
1.3 Identifying the Kernel FRUs Table 1-1 lists the field-replaceable units (FRUs) for DEC 10000 and VAX 10000 systems that are discussed in this book. Table 1-1 Field-Replaceable Units Option No. Part No. Description E2040-AB DEC 10000 CPU module KN7AA-AB...
Each memory or processor board is enclosed in the module case, protect- ing the module electronics from static discharge. A barcode label gives in- formation about the module, including the module part number, revision level, and the module serial number (see Figure 1-4). Figure 1-4 Module Barcode E2043-AA...
1.4 Removing a Module from the LSB Card Cage Use the following procedure to remove a module from the LSB card cage for replacement or reconfiguration. Figure 1-5 Removing a Module from the LSB Card Cage BXB-0090B-92 1-8 Adding or Replacing CPUs and Memories...
Page 21
Perform an orderly shutdown of the system. Turn the keyswitch on the front control panel to the Disable position and wait for the control panel yellow Fault LED to stop flashing. When the Fault LED stops flashing, power has been removed from the LSB backplane and you may safely proceed.
1.5 Inserting a Module in the LSB Card Cage Use the following procedure when replacing or adding a module in the system card cage during maintenance or upgrade. Figure 1-6 Inserting a Module in the LSB Card Cage BXB-0091A-92 1-10 Adding or Replacing CPUs and Memories...
Page 23
Follow Steps 1 through 6 in Section 1.4 and then: If you are adding a module, remove the filler module from the slot where you will install the new module. Hold the filler module firmly on the vertical piece closest to you and gently pull it out toward you. Place it aside for return.
1.6 Verifying the System Power up the system and check that all processor and memory modules appear in the self-test display. Example 1-1 Self-Test Display NODE # C0 XMI + C1 XMI + . A1 A0 . .128 128 . 256Mb Firmware Rev = SROM Rev =...
Page 25
Power up the system by turning the keyswitch from Disable to either the Enable or Restart position. Power sequencing begins and the system runs self-test. Check the self-test display to make sure that the system recog- nizes the newly installed modules. Example 1-1 shows the self-test display of a system in which one processor and one memory module were added.
Chapter 2 Servicing the CPU This chapter describes how to service a CPU in a DEC 10000 or a VAX 10000 system should it break or should new CPUs be added to a system. Some CPU firmware problems are covered in this chapter; others are cov- ered in Chapter 3.
Off or On Resets system and displays self-test results boot_reset during booting. Default is off for VAX 10000 and on for DEC 10000. The default device or device list from which bootdef_dev booting is attempted when no device name is specified by the boot command.
Page 29
Table 2-1 shows the permanent environment variables stored in EEPROM. Some of these variables must be set when either adding a CPU or replac- ing a broken one. You may view these variables by typing show * at the console prompt. Volatile environment variables are initialized by a system reset;...
Page 30
Applies only when using the test com- mand. dump_dev Null Device to which dump file is written if system crashes (DEC 10000 only). enable_audit If set to on, enables the generation of audit trail messages used to track boot to determine location of boot failure.
Page 31
Another important environment variable, not shown in Table 2-1, because it is not a default variable but may be defined by the customer, is a nick- name. Should the customer want to have a default boot path for a cluster and a different local one, a nickname variable may be used for that pur- pose.
2.2 How to Replace the Only Processor When replacing the only processor in a system, you must store the system ID and customized boot paths. If the customer changed console environment variables from the default values, you will want to set them as the customer wishes. Example 2-1 Replacing a Single Processor >>>...
Page 33
After you have removed and replaced the defective module, following the instructions in Chapter 1, take the following steps: Power up the system. Self-test is run and you need to decide whether the new CPU module is functioning properly. If it is not, try reseating the new CPU and/or refer to the Advanced Troubleshooting manual;...
2.3 How to Replace the Boot Processor In cases where the boot processor in a multiprocessing system is the CPU that is in need of repair, you need to manipulate which CPU receives data from the console terminal. Example 2-2 Replacing the Boot Processor NODE # C0 XMI + C1 XMI +...
Page 35
There are at least two factors to consider when replacing a primary CPU: The desire to retain the system environment. The possibility that the new CPU is at a higher or lower firmware re- vision than other CPUs in the system. shows the mismatch mes- sage should the firmware differ between CPUs.
2.4 How to Add a New Processor or Replace a Secondary Processor Add a new secondary in the slot to the left of the boot processor or other secondary processors. Example 2-3 Adding or Replacing a Secondary Processor NODE # C0 XMI + C1 XMI + .
Page 37
There are at least two factors to consider when adding or replacing a CPU: The desire to retain the system environment. The possibility that the new CPU is at a higher firmware revision than other CPUs in the system. shows the mismatch message should the firmware differ between CPUs.
2.5 Build EEPROM Command Should the EEPROM become corrupted, you can use the build eeprom command to recover. The build eeprom command is the proper response to the console error messages shown in Example 2-4. If the build eeprom command fails, return the module for re- pair.
Page 39
Most of the module unified 2-5-2.4 part number can also be found on the barcode. The first number is 80, the second for the DEC 10000 is E2040 and for the VAX 10000 is E2045, the third is a two-letter mod- ule variant, like AA, and the fourth is the hardware revision.
ROM forces a prompt, AXP- or VAX- 7000/10000-FRRC>, on the console terminal. Figure 2-1 Sample Hardware Requirements for Remote FEPROM Recovery InfoServer Source VAX Ethernet DEMNA DMB32 10000 System RRD42 (DEC 10000 only) RS232 Cable Console Port BXB-0005H-92 2-14 Servicing the CPU...
Page 41
• The AXP/VAX 10000 Console CD-ROM, with the console/diagnostic code on it. The file name for the console/diagnostic code is AXP7000_10000_CONSOLE_IMAGE.GROM for the DEC 10000 sys- tem and VAX7000_10000_CONSOLE_IMAGE.GROM for the VAX 10000 system. The serial line receive program in the serial ROM of the CPU is called FRRC (FEPROM recovery code).
2.7 FEPROM Recovery—Software Requirements and Setup On the source system you need to "bind" the RRD42 or InfoServer to a virtual disk container, mount it, and set the terminal speed to that of the target console. Example 2-5 Setting Up the Source System $ set term/speed=9600/perm txa3: # Set DMB32 port to # 9600 baud.
Page 43
Example 2-5 illustrates the steps needed to prepare Kermit with Open- VMS VAX. What you do is: First make sure that you have the hardware necessary to perform the task. Make sure you have the correct CD-ROM for the damaged system. Set the terminal speed on the source system to 9600.
2.8 FEPROM Recovery—Procedure After Kermit has been set up and you are ready to downline load the file, AXP or VAX7000_10000_CONSOLE_IMAGE.GROM, connect to the target system, prepare it to receive the file and then load it. The final steps are to copy the file into the FEPROMs and boot the system.
Page 45
Example 2-6 illustrates a VAX 10000 recovery. Follow the same steps for the DEC 10000, using a AXP7000_10000_console_image.grom file. At the Kermit prompt, connect to the target system. Here are two ex-...
Both the LFU program and the firmware microcode images it writes are supplied on a CD-ROM. You start LFU on DEC 10000 systems by booting the RRD42. On VAX 10000 systems you start LFU by booting the Info- Server on your Ethernet.
3.1 Booting LFU on a DEC 10000 System LFU is supplied on the DEC 7000/10000 AXP Console CD- ROM (Part Number AG-PQW3*-RE, where * is the letter 10000 that denotes the disk revision). Make sure this CD-ROM is mounted in the RRD42 in-cabinet CD drive. Boot LFU from the CD-ROM.
Page 49
Use the show device command to find the name of the RRD42 CD drive. Enter the boot command to boot from the RRD42. The RRD42 has a device name of dka100. LFU starts, displays a summary of its commands, and issues its prompt (Function?).
3.2 Booting LFU on a VAX 10000 System LFU is supplied on the VAX 7000/10000 Console CD-ROM (Part Number AG-PQW1*-RE, where * is the letter that de- 10000 notes the disk revision). Make sure this CD-ROM is mounted in one of the system’s InfoServers. Boot the In- itial System Load (ISL) program, and select the service cor- responding to the console CD-ROM.
Page 51
Copyright Digital Equipment Corporation 1992 All Rights Reserved. Loadable Environment Rev: V1.0-1625 Jul 12 1992 10:50:56 ***** Loadable Firmware Update Utility ***** Version 2.1 16-jun-1992 ------------------------------------------------------------------- Function Description ------------------------------------------------------------------- Display Displays the system’s configuration table. Exit Return to loadable offline operating environment.
3.3 Show The show command shows the current revision of firmware and hardware for every module in the system that contains microcode. In the display, each module that needs to be updated is indicated by a plus sign (+) following the device mnemonic. Example 3-3 Show Command Function? show Device Mnemonic(s)? ?
Page 53
If you type just the command show without a device mnemonic, LFU prompts for the device mnemonic. All the commands that require de- vice mnemonics will prompt. If you enter ? (or help) for the device, a table displays the syntax for specifying devices.
3.4 List The list command displays the inventory of update firmware on the CD-ROM. Only the devices listed at your terminal are sup- ported for firmware updates. Example 3-4 List Command Function? l Loadable Firmware Update Utility Version 2.1 Name Mnemonic Update Firmware Update Hardware...
Page 55
The list command shows the revisions of firmware corresponding to the revisions of hardware for each device. (There may be several hardware revisions for a particular device, but only one firmware re- vision corresponds to any hardware revision.) Comparing the output of the list and show commands helps you understand which devices should receive firmware updates.
3.5 Update The update command writes new firmware from the CD-ROM to the module. Then LFU automatically verifies the update by read- ing the new firmware image from the module back into memory and comparing it with the CD-ROM image. Example 3-5 Update Command Function? update kn7aa0 cixcd0...
Page 57
Continue? [Y/(N)] y WARNING: updates may take several minutes to complete for each device. DO NOT ABORT! demna0 Updating to 6.6... Reading Device... Verifying 6.6... PASSED. Function? update demna* Update all demna? [Y/(N)] n Function? This command specifically requests firmware updates for the CPU and CIXCD modules.
3.6 Exit The exit command terminates the LFU program, causes system in- itialization and self-test, and returns to the system console prompt. Example 3-6 Exit Command Function? show Device Mnemonic(s)? exit Function? exit Initializing... NODE # C0 XMI + C1 XMI + .
Page 59
From within the "Device Mnemonic(s)?" prompt, exit returns to the Function prompt. At the Function prompt, exit causes the system to be initialized. The console prompt appears. Updating Firmware 3-13...
3.7 Display and Verify Commands Display and verify commands are used in special situations. Display shows the physical configuration. Verify repeats the veri- fication process performed by the update command. Example 3-7 Display and Verify Commands Function? disp Name Type Mnemonic FW Rev HW Rev...
Page 61
Display shows the system physical configuration. Display is equivalent to issuing the console command show configuration. Because it shows the LSB slot for each module, display can help you identify unknown devices. Verify reads the firmware from the module into memory and com- pares it with the update firmware on the CD-ROM.
3.8 How to Update Corrupted Firmware If LFU identifies a device as unknown, the firmware on the mod- ule is corrupted. The update command allows you to specify the correct device type so that new firmware can be written to the module.
3.9 How to Modify Device Attributes The modify command can change parameters stored in EEPROM on the following devices: KZMSA (DEC 10000 system), KFMSA (VAX 10000), DEC LANcontroller 400 (DEMNA), and CIXCD. The attributes are specific to each device. Example 3-9 Modify Command...
Page 65
Enter new value (HEX) or <CR> to keep present value: Secondary Lock Retries? [(0020)] Enter new value (HEX) or <CR> to keep present value: Modify DSSI Timeouts? [Y/(N)]n Modify DSSI Retries? [Y/(N)]n Modify XMI Timeouts? [Y/(N)]n Finished display/modify parameters? [(Y)/N]y Function? m demna0 demna0 Remote Boot:...
Page 66
The CIXCD has only one parameter: the hardware revision. You would need to modify the value only if the EEPROM had become cor- rupted. When you modify the KFMSA, LFU first displays all the parameters for both ports. You select which port to modify. LFU prompts for parameters by category.
Appendix A Kermit Parameters To transmit a file using Kermit, the following parameters must be set: Kermit-32> show all VMS Kermit-32 version 3.3.111 Block check type One character checksum Debugging Delay 5 (sec) Server sends NAKs every 75 seconds while waiting for a command Escape character 035 (octal)
Page 68
End of line character 015 (octal) Quoting character 043 (octal) 8-bit quoting character 046 (octal) Start of packet 001 (octal) Transmit parameters Delay 0.0 (sec) Echo Repeat quoting character 176 (octal) Kermit Parameters...
Need help?
Do you have a question about the DEC 10000 and is the answer not in the manual?
Questions and answers