Sun Microsystems Sun Fire T1000 Service Manual

Sun Microsystems Sun Fire T1000 Service Manual

Hide thumbs Also See for Sun Fire T1000:
Table of Contents

Advertisement

Sun Fire
T1000 Server

Service Manual

Sun Microsystems, Inc.
www.sun.com
Part No. 819-3248-13
January 2007, Revision A
Submit comments about this document at:
http://www.sun.com/hwdocs/feedback

Advertisement

Table of Contents
loading
Need help?

Need help?

Do you have a question about the Sun Fire T1000 and is the answer not in the manual?

Questions and answers

Subscribe to Our Youtube Channel

Summary of Contents for Sun Microsystems Sun Fire T1000

  • Page 1: Service Manual

    Sun Fire T1000 Server ™ Service Manual Sun Microsystems, Inc. www.sun.com Part No. 819-3248-13 January 2007, Revision A Submit comments about this document at: http://www.sun.com/hwdocs/feedback...
  • Page 2 Fujitsu Limited or Sun Microsystems, Inc., or any affiliate of either of them.
  • Page 3 Aucune partie de ce produit, de ces technologies ou de ce document ne peut être reproduite sous quelque forme que ce soit, par quelque moyen que ce soit, sans l’autorisation écrite préalable de Fujitsu Limited et de Sun Microsystems, Inc., et de leurs éventuels bailleurs de licence.
  • Page 5: Table Of Contents

    Contents Preface xiii Safety Information 1–1 Safety Information 1–1 Safety Symbols 1–1 Electrostatic Discharge Safety 1–2 1.3.1 Using an Antistatic Wrist Strap 1–2 1.3.2 Using an Antistatic Mat 1–2 Server Overview 2–1 Server Overview 2–1 Obtaining the Chassis Serial Number 2–3 Server Diagnostics 3–1 Overview of Server Diagnostics 3–1 3.1.1...
  • Page 6 Using the fmdump Command to Identify Faults 3–41 3.5.2 Clearing PSH Detected Faults 3–44 Collecting Information From Solaris OS Files and Commands 3–45 3.6.1 Checking the Message Buffer 3–45 3.6.2 Viewing System Message Log Files 3–46 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 7 Managing Components With Automatic System Recovery Commands 3– 3.7.1 Displaying System Components 3–47 3.7.2 Disabling Components 3–48 3.7.3 Enabling Disabled Components 3–49 Exercising the System With SunVTS 3–49 3.8.1 Checking Whether SunVTS Software Is Installed 3–49 3.8.2 Exercising the System Using SunVTS Software 3–50 3.8.3 Using SunVTS Software 3–51 Preparing for Servicing 4–1...
  • Page 8 Final Service Procedures 6–1 6.1.1 Replacing the Top Cover 6–1 6.1.2 Reinstalling the Server Chassis in the Rack 6–1 6.1.3 Applying Power to the Server 6–2 A. Field-Replaceable Units A–1 Index Index–1 viii Sun Fire T1000 Server Service Manual • January 2007...
  • Page 9 Figures Server 2–1 FIGURE 2-1 Server Components 2–2 FIGURE 2-2 Server Front Panel 2–2 FIGURE 2-3 Server Rear Panel 2–3 FIGURE 2-4 Diagnostic Flowchart 3–3 FIGURE 3-1 LEDs on the Server Front Panel 3–8 FIGURE 3-2 LEDs on the Server Rear Panel 3–9 FIGURE 3-3 ALOM CMT Fault Management 3–12 FIGURE 3-4...
  • Page 10 FIGURE 5-10 DIMM Locations 5–15 FIGURE 5-11 Removing the Clock Battery From the Motherboard 5–22 FIGURE 5-12 Installing the Clock Battery on the Motherboard 5–23 FIGURE 5-13 Field-Replaceable Units A–2 FIGURE A-1 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 11 Tables Diagnostic Flowchart Actions 3–4 TABLE 3-1 Front and Rear Panel LEDs 3–10 TABLE 3-2 Power Supply LEDs 3–11 TABLE 3-3 Service-Related ALOM CMT Commands 3–14 TABLE 3-4 ALOM CMT Parameters Used for POST Configuration 3–23 TABLE 3-5 ALOM CMT Parameters and POST Modes 3–26 TABLE 3-6 ASR Commands 3–47 TABLE 3-7...
  • Page 12 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 13: Preface

    Preface The Sun Fire T1000 Server Service Manual provides information to aid in troubleshooting problems with and replacing components within the Sun Fire™ T1000 server. This manual is written for technicians, service personnel, and system administrators who service and repair computer systems. The person qualified to use this manual: Can open a system chassis, identify, and replace internal components ■...
  • Page 14: Typographic Conventions

    * The settings on your browser might differ from these settings. Shell Prompts Shell Prompt C shell machine-name% C shell superuser machine-name# Bourne shell and Korn shell Bourne shell and Korn shell superuser xiv Sun Fire T1000 Server Service Manual • January 2007...
  • Page 15: Additional Service Related Information

    In addition to this service manual, the following resources are available to help you keep your server running optimally: Product Notes – The Sun Fire T1000 Server Product Notes (819-3246) contain late- ■ breaking information about the system including required software patches, updated hardware and compatibility information, and solutions to know issues.
  • Page 16 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 17: Sun Welcomes Your Comments

    Sun is interested in improving its documentation and welcomes your comments and suggestions. You can submit your comments by going to: http://www.sun.com/hwdocs/feedback Please include the title and part number of your document with your feedback: Sun Fire T1000 Server Service Manual, part number 819-3248-13 Preface xvii...
  • Page 18 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 19: Safety Information

    C H A P T E R Safety Information This chapter provides important safety information for servicing the server. The following topics are covered: Section 1.1, “Safety Information” on page 1-1 ■ Section 1.2, “Safety Symbols” on page 1-1 ■ Section 1.3, “Electrostatic Discharge Safety”...
  • Page 20: Electrostatic Discharge Safety

    1.3.2 Using an Antistatic Mat Place ESD-sensitive components such as the motherboard, memory, and other PCB cards on an antistatic mat. Sun Fire T1000 Server Service Manual • January 2007...
  • Page 21: Server Overview

    C H A P T E R Server Overview This chapter provides an overview of the server. Topics include: Section 2.1, “Server Overview” on page 2-1 ■ Section 2.2, “Obtaining the Chassis Serial Number” on page 2-3 ■ Server Overview The server is a high-performance, entry-level server that is highly scalable and very reliable ( FIGURE 2-1...
  • Page 22: Figure 2-2 Server Components

    DIMMs Fan tray assembly Power supply Hard drive Server Components FIGURE 2-2 Locator LED/button Service Required LED Power OK LED and Power On/Off button Server Front Panel FIGURE 2-3 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 23: Obtaining The Chassis Serial Number

    Power supply LEDs Ethernet ports PCI-E slot Locator LED/button SC network management port Power OK LED SC serial management port Service Required LED DB9 serial port Server Rear Panel FIGURE 2-4 Obtaining the Chassis Serial Number To obtain support for your system, you need your chassis serial number. On the server, the chassis serial number is located on a sticker that is on the front of the server and another sticker at the rear of the server, below the AC power connector.
  • Page 24 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 25: Server Diagnostics

    C H A P T E R Server Diagnostics This chapter describes the diagnostics that are available for monitoring and troubleshooting the server. This chapter does not provide detailed troubleshooting procedures, but instead describes the server diagnostics facilities and how to use them.
  • Page 26 The flow chart assumes that you have already performed some troubleshooting such as verification of proper installation and visual inspection of cables and power, and possibly performed a reset of the server (refer to the Sun Fire T1000 Server Installation Guide and Sun Fire T1000 Server Administration Guide for details).
  • Page 27: Figure 3-1 Diagnostic Flowchart

    flow chart Numbers in this flow chart Check the 1. Are the correspond to the Action Faulty power source Power OK and numbers in Table 2-1. hardware AC OK LEDs suspected connections. off? 2. Are any faults reported showfaults by the ALOM command showfaults displays a...
  • Page 28: Table 3-1 Diagnostic Flowchart Actions

    Solaris OS. on page 3-49 • If SunVTS reports a faulty device replace the FRU. Chapter 5 • If SunVTS does not report a faulty device, go to Action No. Sun Fire T1000 Server Service Manual • January 2007...
  • Page 29 Diagnostic Flowchart Actions (Continued) TABLE 3-1 Action For more information, see Diagnostic Action Resulting Action these sections Run POST. POST performs basic tests of the server components Section 3.4, “Running and reports faulty FRUs. POST” on page 3-22 Note - diag_level=min is the default ALOM CMT setting, which tests devices required to boot TABLE 3-5 TABLE 3-6...
  • Page 30: Memory Configuration And Fault Handling

    Understanding the underlying features helps you identify and repair memory problems. This section describes how the memory is configured and how the server deals with memory faults. Sun Fire T1000 Server Service Manual • January 2007...
  • Page 31: Memory Configuration

    3.1.1.1 Memory Configuration In the server memory, there are eight slots that hold DDR-2 memory DIMMs in the following DIMM sizes: 512 MB (maximum of 4 GB) ■ 1 GB (maximum of 8 GB) ■ 2GB (maximum of 16 GB) ■...
  • Page 32: Troubleshooting Memory Faults

    These LEDs provide a quick visual check of the state of the system. Locator LED/button Service Required LED Power OK LED and Power On/Off button LEDs on the Server Front Panel FIGURE 3-2 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 33: Figure 3-3 Leds On The Server Rear Panel

    Activity LED Activity LED Fault LED Link LED Link LED DC OK LED Power OK LED AC OK LED Service Required LED Locator LED/button LEDs on the Server Rear Panel FIGURE 3-3 Chapter 3 Server Diagnostics...
  • Page 34: Front And Rear Panel Leds

    Indicates that there is activity on the SC Network Activity LED Management port. SC Network Management Rear panel Green Indicates that the server is linked to the SC network Link LED management port. 3-10 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 35: Power Supply Leds

    3.2.2 Power Supply LEDs The power supply LEDs ( ) are located on the back of the power supply. TABLE 3-3 Power Supply LEDs TABLE 3-3 Name Color Description Fault Amber • On – Power supply has detected a failure. •...
  • Page 36: Figure 3-4 Alom Cmt Fault Management

    FRU replacement or if ALOM CMT was unable to automatically detect the FRU replacement. Note – ALOM CMT does not automatically detect hard drive replacement. 3-12 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 37: Running Alom Cmt Service-Related Commands

    Many environmental faults can automatically recover. A temperature that is exceeding a threshold might return to normal limits. An unplugged power supply can be plugged in, and so on. Recovery of environmental faults is automatically detected. Recovery events are reported using one of two forms: fru at location is OK.
  • Page 38: Switching Between The System Console And Alom

    • bootscript=string enables the passing of a string to the boot command. powercycle [-f] Performs a poweroff followed by poweron. The -f option forces an immediate poweroff, otherwise the command attempts a graceful shutdown. 3-14 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 39 Service-Related ALOM CMT Commands (Continued) TABLE 3-4 ALOM CMT Command Description poweroff [-y] [-f] Powers off the host server. The -y option enables you to skip the confirmation question. The -f option forces an immediate shutdown. Powers on the host server. Using the -c option executes a console poweron [-c] command after completion of the poweron command.
  • Page 40: Running The Showfaults Command

    ■ sc> showfaults -v Last POST run: TUE FEB 07 18:51:02 2006 POST status: Passed all devices ID FRU Fault 0 IOBD VOLTAGE_SENSOR at IOBD/V_+1V has exceeded low warning threshold. 3-16 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 41: Running The Showenvironment Command

    Example showing a fault that was detected by POST. These kinds of faults are ■ identified by the message deemed faulty and disabled and by a FRU name. sc> showfaults -v ID Time Fault 1 OCT 13 12:47:27 MB/CMP0/CH0/R1/D0 MB/CMP0/CH0/R1/D0 deemed faulty and disabled Example showing a fault that was detected by the PSH technology.
  • Page 42 ----------------------------------------------------------- MB/I_VCORE 20.560 80.000 88.000 MB/I_VMEM 8.160 60.000 66.000 ----------------------------------------------------------- ---------------------- Current sensors: ---------------------- Sensor Status ---------------------- MB/BAT/V_BAT ------------------------------------------------------------------------------ Power Supplies: ------------------------------------------------------------------------------ Supply Status Underspeed Overtemp Overvolt Undervolt Overcurrent ------------------------------------------------------------------------------ 3-18 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 43: Running The Showfru Command

    sc> Note – Some environmental information might not be available when the server is in Standby mode. 3.3.4 Running the showfru Command The showfru command displays information about the FRUs in the server. Use this command to see information about an individual FRU, or for all the FRUs. Note –...
  • Page 44 /SPD/Vendor Serial No: d03eb26 FRU_PROM at MB/CMP0/CH3/R0/D0/SEEPROM /SPD/Timestamp: MON OCT 03 12:00:00 2005 /SPD/Description: DDR2 SDRAM, 2048 MB /SPD/Manufacture Location: /SPD/Vendor: Infineon (formerly Siemens) /SPD/Vendor Part No: 72T256220HR3.7A /SPD/Vendor Serial No: d03e620 3-20 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 45 FRU_PROM at MB/CMP0/CH3/R0/D1/SEEPROM /SPD/Timestamp: MON OCT 03 12:00:00 2005 /SPD/Description: DDR2 SDRAM, 2048 MB /SPD/Manufacture Location: /SPD/Vendor: Infineon (formerly Siemens) /SPD/Vendor Part No: 72T256220HR3.7A /SPD/Vendor Serial No: d040920 FRU_PROM at MB/CMP0/CH3/R1/D0/SEEPROM /SPD/Timestamp: MON OCT 03 12:00:00 2005 /SPD/Description: DDR2 SDRAM, 2048 MB /SPD/Manufacture Location: /SPD/Vendor: Infineon (formerly Siemens) /SPD/Vendor Part No:...
  • Page 46: Running Post

    The server can be configured for normal, extensive, or no POST execution. You can also control the level of tests that run, the amount of POST output that is displayed, and which reset events trigger POST by using ALOM CMT variables. 3-22 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 47: Table 3-5 Alom Cmt Parameters Used For Post Configuration

    lists the ALOM CMT variables used to configure POST and TABLE 3-5 FIGURE 3-5 shows how the variables work together. Note – Use the ALOM CMT setsc command to set all the parameters in TABLE 3-5 except setkeyswitch. ALOM CMT Parameters Used for POST Configuration TABLE 3-5 Parameter Values...
  • Page 48 Values Description POST output displays functional tests with a banner and pinwheel. POST output displays all test and informational normal messages. POST displays all test, informational, and some debugging messages. 3-24 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 49: Figure 3-5 Flowchart Of Alom Cmt Variables For Post Configuration

    Flowchart of ALOM CMT Variables for POST Configuration FIGURE 3-5 Chapter 3 Server Diagnostics 3-25...
  • Page 50: Changing Post Parameters

    The setkeyswitch parameter sets the virtual keyswitch, so it does not use the setsc command. For example, to change the POST parameters using the setkeyswitch command, enter the following: sc> setkeyswitch diag 3-26 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 51: Reasons To Run Post

    To change the POST parameters using the setsc command, you must first set the setkeyswitch parameter to normal, then you can change the POST parameters using the setsc command: sc> setkeyswitch normal sc> setsc value Example: sc> setkeyswitch normal sc> setsc diag_mode service 3.4.3 Reasons to Run POST You can use POST for basic hardware verification and diagnosis, and for...
  • Page 52: Diagnosing The System Hardware

    3. Reset the system so that POST runs. There are several ways to initiate a reset. The following example uses the powercycle command. For other methods, refer to the Sun Fire T1000 Server Administration Guide. sc> powercycle...
  • Page 53 0:0> 0:0>@(#) ERIE Integrated POST 4.x.0.build_17 2005/08/30 11:25 /export/common-source/firmware_re/ontario- fireball_fio/build_17/post/Niagara/erie/integrated (firmware_re) 0:0>Copyright © 2005 Sun Microsystems, Inc. All rights reserved SUN PROPRIETARY/CONFIDENTIAL. Use is subject to license terms. 0:0>VBSC selecting POST IO Testing. 0:0>VBSC enabling threads: 1 0:0>VBSC setting verbosity level 3 0:0>Start Selftest..
  • Page 54 Testing Memory Channel 0 Rank 0 Stack 1 0:0> Testing Memory Channel 3 Rank 0 Stack 1 0:0>L2 Directory clear 0:0>L2 Scrub VD & UA 0:0>L2 Scrub Tags 0:0>L2 Disable 3-30 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 55 0:0>Address Bitwalk 0:0> Testing Memory Channel 0 Rank 0 Stack 0 0:0> Testing Memory Channel 3 Rank 0 Stack 0 0:0> Testing Memory Channel 0 Rank 0 Stack 1 0:0> Testing Memory Channel 3 Rank 0 Stack 1 0:0>Test Slave Threads Basic..0:0>Set Mailbox 0:0>Setup Final DMMU Entries 0:0>Post Image Region Scrub...
  • Page 56 0:0>IO-Bridge Quick Read 0:0> 0:0>-------------------------------------------------------------- 0:0>--------- IO-Bridge Quick Read Only of CSR and ID --------------- 0:0>-------------------------------------------------------------- 0:0>fire 1 JBUSID 00000080.0f000000 = 0:0> fc000002.e03dda23 0:0>-------------------------------------------------------------- 0:0>fire 1 JBUSCSR 00000080.0f410000 = 0:0> 00000ff5.13cb7000 0:0>-------------------------------------------------------------- 3-32 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 57 0:0>IO-Bridge unit 1 jbus perf test 0:0>IO-Bridge unit 1 int init test 0:0>IO-Bridge unit 1 msi init test 0:0>IO-Bridge unit 1 ilu init test 0:0>IO-Bridge unit 1 tlu init test 0:0>IO-Bridge unit 1 lpu init test 0:0>IO-Bridge unit 1 link train port B 0:0>IO-Bridge unit 1 interrupt test 0:0>IO-Bridge unit 1 Config MB bridges 0:0>Config port B, bus 2 dev 0 func 0, tag 5714 BRIDGE...
  • Page 58 0:0>MSG = Pin 3 failed on MB/CMP0/CH0/R1/D0/S0 (J0701) 0:0>END_ERROR 0:0>Testing Memory Channel 3 Rank 1 Stack 0 In this example, POST is reporting a memory error at DIMM location MB/CMP0/CH0/R1/D0 (J0701). 3-34 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 59: Correctable Errors Detected By Post

    b. Run the showfaults command to obtain additional fault information. The fault is captured by ALOM, where the fault is logged, the Service Required LED is lit, and the faulty component is disabled. Example: ok #. sc> showfaults -v Time Fault 1 APR 24 12:47:27 MB/CMP0/CH0/R1/D0...
  • Page 60: Correctable Errors For Single Dimms

    3. Reset the system so that POST runs. There are several ways to initiate a reset. The following example uses the powercycle command. For other methods, refer to the Sun Fire T1000 Server Administration Guide. sc> powercycle...
  • Page 61: Determining When To Replace Detected Devices

    3.4.5.2 Determining When to Replace Detected Devices Note – This section assumes faults are detected by POST in maximum mode. If a detected device is part of a hardware upgrade or repair, or if POST detects multiple DIMMs ( ), replace the detected devices. CODE EXAMPLE 3-2 POST Fault for Multiple DIMMs CODE EXAMPLE 3-2...
  • Page 62: Clearing Post Detected Faults

    If no fault is reported, you do not need to do anything else. Do not perform the ■ subsequent steps. If a fault is reported, perform Step 2 through Step ■ 3-38 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 63: Using The Solaris Predictive Self-Healing Feature

    2. Use the enablecomponent command to clear the fault and remove the component from the ASR blacklist. Use the FRU name that was reported in the fault in the previous step. Example: sc> enablecomponent MB/CMP0/CH0/R1/D0 The fault is cleared and should not appear when you run the showfaults command.
  • Page 64: Identifying Psh Detected Faults

    IMPACT: Total system memory capacity will be reduced as pages are retired. REC-ACTION: Schedule a repair procedure to replace the affected memory module. Use fmdump -v -u <EVENT_ID> to identify the module. 3-40 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 65: Using The Fmdump Command To Identify Faults

    The following is an example of the ALOM CMT alert for the same PSH diagnosed fault: SC Alert: Host detected fault, MSGID: SUN4V-8000-DX Note – The Service Required LED is also turns on for PSH diagnosed faults. 3.5.1.1 Using the fmdump Command to Identify Faults The fmdump command displays the list of faults detected by the Solaris PSH facility and identifies the faulty FRU for a particular EVENT_ID (UUID).
  • Page 66 2. Use the Sun message ID to obtain more information about this type of fault. a. In a browser, go to the Predictive Self-Healing Knowledge Article web site: http://www.sun.com/msg 3-42 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 67 b. Obtain the message ID from the console output or the ALOM CMT showfaults command. c. Enter the message ID in the SUNW-MSG-ID field, and click Lookup. In this example, the message ID SUN4V-8000-DX returns the following information for corrective action: Article for Message ID: SUN4V-8000-DX Correctable memory errors exceeded acceptable levels...
  • Page 68: Clearing Psh Detected Faults

    If no fault is reported, you do not need to do anything else. Do not perform the ■ subsequent steps. If a fault is reported, perform Step 2 through Step ■ 3-44 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 69: Collecting Information From Solaris Os Files And Commands

    3. Run the clearfault command with the UUID provided in the showfaults output: sc> clearfault 7ee0e46b-ea64-6565-e684-e996963f7b86 Clearing fault from all indicted FRUs... Fault cleared. 4. Clear the fault from all persistent fault records. In some cases, even though the fault is cleared, some persistent fault information remains and results in erroneous fault messages at boot time.
  • Page 70: Viewing System Message Log Files

    In the server, the following components are managed by the ASR feature: UltraSPARC T1 processor strands ■ Memory DIMMS ■ I/O bus ■ 3-46 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 71: Displaying System Components

    The database that contains the list of disabled components is called the ASR blacklist (asr-db). In most cases, POST automatically disables a faulty component. After the cause of the fault is repaired (FRU replacement, loose connector reseated, and so on), you must remove the component from the ASR blacklist.
  • Page 72: Disabling Components

    1. At the sc> prompt, enter the disablecomponent command sc> disablecomponent MB/CMP0/CH3/R1/D1 SC Alert:MB/CMP0/CH3/R1/D1 disabled 2. After receiving confirmation that the disablecomponent command is complete, reset the server so that the ASR command takes effect. sc> reset 3-48 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 73: Enabling Disabled Components

    3.7.3 Enabling Disabled Components The enablecomponent command enables a disabled component by removing it from the ASR blacklist. 1. At the sc> prompt, enter the enablecomponent command. sc> enablecomponent MB/CMP0/CH3/R1/D1 SC Alert:MB/CMP0/CH3/R1/D1 reenabled 2. After receiving confirmation that the enablecomponent command is complete, reset the server so that the ASR command takes effect.
  • Page 74: Exercising The System Using Sunvts Software

    Common Desktop Environment (CDE). For more information about the character-based SunVTS TTY interface, and specifically for instructions on accessing it by tip or telnet commands, refer to the SunVTS User’s Guide. 3-50 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 75: Using Sunvts Software

    SunVTS software can be run in several modes. This procedure assumes that you are using the default mode. This procedure also assumes that the server is headless, that is, it is not equipped with a monitor capable of displaying bitmap graphics. In this case, you access the SunVTS GUI by logging in remotely from a machine that has a graphics display.
  • Page 76: Figure 3-6 Sunvts Gui

    The test selection area lists tests in categories, such as Network, as shown in . To expand a category, left-click the icon (expand category icon) to the FIGURE 3-7 left of the category name. 3-52 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 77: Figure 3-7 Sunvts Test Selection Panel

    SunVTS Test Selection Panel FIGURE 3-7 6. (Optional) Select the tests you want to run. Certain tests are enabled by default, and you can choose to accept these. Alternatively, you can enable and disable individual tests or blocks of tests by clicking the checkbox next to the test name or test category name.
  • Page 78 Solaris OS Messages (/var/adm/messages) A file containing messages – ■ generated by the operating system and various applications. Log Files (/var/opt/SUNWvts/logs) A directory containing the log files. – ■ 3-54 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 79: Preparing For Servicing

    C H A P T E R Preparing for Servicing This chapter describes how to prepare the server for servicing. The following topics are covered: Section 4.1, “Common Procedures for Parts Replacement” on page 4-1 ■ For a list of FRUs, see Appendix Note –...
  • Page 80: Required Tools

    This command is described in the Solaris system administration documentation. 5. Switch from the system console prompt to the SC console prompt by issuing the #. (Hash-Period) escape sequence. ok #. sc> Sun Fire T1000 Server Service Manual • January 2007...
  • Page 81: Removing The Server From A Rack

    Note – You can also use the Power On/Off button on the front of the server to initiate a graceful system shutdown. Refer to the Sun Fire T1000 Server Administration Guide for more information about the ALOM poweroff command. 4.1.3...
  • Page 82: Figure 4-1 Unlocking A Mounting Bracket

    FIGURE 4-2 The mounting brackets slide approximately 4 in. (10 cm) farther before disengaging. Location of the Mounting Bracket Release Buttons FIGURE 4-2 7. Set the chassis on a sturdy work surface. Sun Fire T1000 Server Service Manual • January 2007...
  • Page 83: Performing Electrostatic Discharge (Esd) Prevention Measures

    4.1.4 Performing Electrostatic Discharge (ESD) Prevention Measures 1. Prepare an antistatic surface to set parts on during removal and installation. Place ESD-sensitive components, such as the printed circuit boards, on an antistatic mat. The following items can be used as an antistatic mat: Antistatic bag used to wrap a Sun replacement part ■...
  • Page 84: Figure 4-3 Location Of Top Cover Release Button

    Cover release Top cover button Location of Top Cover Release Button FIGURE 4-3 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 85: Replacing Field-Replaceable Units

    C H A P T E R Replacing Field-Replaceable Units This chapter describes how to remove and replace customer-replaceable field- replaceable units (FRUs) in the server. The following topics are covered: Section 5.1, “Replacing the Optional PCI-Express Card” on page 5-2 ■...
  • Page 86: Replacing The Optional Pci-Express Card

    3. On the rear of the chassis, pull the release lever that secures the PCI-Express card to the chassis ( FIGURE 5-1 PCI-E card Release lever Releasing the PCI-Express Card Release Lever FIGURE 5-1 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 87: Installing The Optional Pci-Express Card

    4. Carefully pull the PCI-Express card out of the connector on the PCI-Express card riser board and the note slot ( FIGURE 5-2 Note slot Connector PCI-E riser board Removing and Installing the PCI-Express Card FIGURE 5-2 5. Place the PCI-Express card on an antistatic mat. 5.1.2 Installing the Optional PCI-Express Card Use this procedure to replace the PCI-Express cards.
  • Page 88: Replacing The Fan Tray Assembly

    3. Push in on the clasps on both sides of the fan assembly ( FIGURE 5-3 Fan tray assembly Removing the Fan Tray Assembly FIGURE 5-3 4. Remove the fan assembly from the sheet metal mounting brackets. Sun Fire T1000 Server Service Manual • January 2007...
  • Page 89: Installing The Fan Tray Assembly

    5.2.2 Installing the Fan Tray Assembly 1. Unpack the replacement fan tray assembly and place it on an antistatic mat. 2. Align the fan tray assembly with the sheet metal mounting brackets and slide it into place until the clasps on each side lock it into place. 3.
  • Page 90: Installing The Power Supply

    3. Push the fastener down on the front of the power supply to lock it into place in the chassis ( FIGURE 5-5 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 91: Replacing A Hard Drive

    Power supply Fastener Installing the Power Supply FIGURE 5-5 4. Redress the power cable through the midwall in the chassis and connect the cable to the motherboard. 5. Perform the procedures described in Chapter 6. At the sc> prompt, issue the showenvironment command to verify the status of the power supply.
  • Page 92: Replacing A Hard Drive In A Single-Drive Assembly

    ( FIGURE 5-6 3. Pull the fasteners up on the rear of the single-drive assembly and remove the assembly from the chassis ( FIGURE 5-6 Removing the Single-Drive Assembly FIGURE 5-6 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 93: Figure 5-7 Installing The Single-Drive Assembly

    5.4.1.2 Installing the Hard Drive in a Single-Drive Assembly 1. Unpack the replacement single-drive assembly. 2. Slide the single-drive assembly into the chassis until it mates with the front of the chassis ( FIGURE 5-7 Installing the Single-Drive Assembly FIGURE 5-7 3.
  • Page 94: Replacing A Hard Drive In A Dual-Drive Assembly

    2. Disconnect the drive cable from the data and power connectors on the motherboard ( FIGURE 5-8 Data connector (J5002) Data connector (J5003) Power connector Location of Drive Power and Data Connectors on the Motherboard FIGURE 5-8 5-10 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 95: Figure 5-9 Removing The Dual-Drive Assembly

    3. Pull the fasteners up on the rear of the dual-drive assembly and remove the dual- drive assembly from the chassis ( FIGURE 5-9 Fasteners Removing the Dual-Drive Assembly FIGURE 5-9 4. Determine which of the two hard drives you want to remove. The upper drive (drive 1) is typically the data drive or mirror drive.
  • Page 96: Installing The Hard Drive In A Dual-Drive Assembly

    Ensure that the connector is correctly oriented before plugging it into the data/power connector on the drive. 3. Slide the drive assembly into the chassis until it mates with the front of the chassis ( FIGURE 5-10 5-12 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 97: Figure 5-10 Installing The Dual-Drive Assembly

    Fasteners Installing the Dual-Drive Assembly FIGURE 5-10 4. Push the fasteners down to lock the drive assembly into place in the chassis FIGURE 5-10 5. Redress the cable through the midwall in the chassis. 6. Route the drive data cables underneath the power supply cable. 7.
  • Page 98: Replacing Dimms

    Chapter 1. Perform the procedures described in Chapter 2. Locate the DIMM that you want to remove. to identify the DIMM that you want to remove. FIGURE 5-11 TABLE 5-1 5-14 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 99 DIMM Locations FIGURE 5-11 maps the DIMM names that are displayed in faults to the socket numbers TABLE 5-1 that identify the location of the DIMM on the motherboard. The Channel/Rank/DIMM locations (for example, CH0/R0/D0) are silkscreened on the board and on a label near the board. DIMM Names and Socket Numbers TABLE 5-1 Socket Number...
  • Page 100: Installing Dimms

    7. Run the showfaults -v command to determine how to clear the fault. The method you use to clear a fault depends on how the fault is identified by the showfaults command. 5-16 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 101 If the fault is a host-detected fault (displays a UUID), continue to Step 8. For ■ example: sc> showfaults -v ID Time Fault 0 SEP 09 11:09:26 MB/CMP0/CH0/R0/D0 Host detected fault MSGID: SUN4V-8000-DX UUID: f92e9fbe-735e-c218-cf87-9e1720a28004 If the fault resulted in the FRU being disabled, such as the following, ■...
  • Page 102 # fmadm faulty No memory or DIMM faults should be displayed. If faults are reported, refer to the diagnostics flow chart in for an FIGURE 3-1 approach to diagnose the fault. 5-18 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 103 9. Obtain the ALOM CMT sc> prompt. 10. Run the showfaults command. If the fault was detected by the host and the fault information persists, the output will be similar to the following example: sc> showfaults -v ID Time Fault 0 SEP 09 11:09:26 MB/CMP0/CH0/R0/D0 Host detected fault MSGID: SUN4V-8000-DX UUID: f92e9fbe-735e-c218-cf87-9e1720a28004...
  • Page 104: Replacing The Motherboard And Chassis

    The location of this SEEPROM is shown in Appendix 5.6.2 Installing the Motherboard and Chassis 1. Replace the PCI-Express card. Section 5.1, “Replacing the Optional PCI-Express Card” on page 5-2. 5-20 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 105 2. Replace the fan tray assembly and cable. Section 5.2, “Replacing the Fan Tray Assembly” on page 5-4. 3. Replace the power supply and cable. Section 5.3, “Replacing the Power Supply” on page 5-5. 4. Replace the hard drive and cable. Section 5.4, “Replacing a Hard Drive”...
  • Page 106: Replacing The Clock Battery

    FIGURE 5-12 5.7.2 Installing the Clock Battery on the Motherboard 1. Unpack the replacement battery. 2. Press the new battery into the motherboard with the + facing upward ( FIGURE 5-13 5-22 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 107: Figure 5-13 Installing The Clock Battery On The Motherboard

    Installing the Clock Battery on the Motherboard FIGURE 5-13 3. Perform the procedures described in Chapter 4. Use the ALOM setdate command to set the day and time. Use the setdate command before you power on the host system. For details about this command, refer to the Advanced Lights Out Management (ALOM) CMT Guide.
  • Page 108 5-24 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 109: Finishing Up Servicing

    6.1.2 Reinstalling the Server Chassis in the Rack 1. Refer to the Sun Fire T1000 Server Installation Guide for installation instructions. 2. After you have reinstalled the server chassis in the rack, reconnect all cables that you disconnected when you removed the chassis from the rack.
  • Page 110: Applying Power To The Server

    Reconnect the power cord to the power supply. Note – As soon as the power cord is connected, standby power is applied. Depending on the configuration of the firmware, the system might boot. Sun Fire T1000 Server Service Manual • January 2007...
  • Page 111: Field-Replaceable Units

    A P P E N D I X Field-Replaceable Units shows the locations of the field-replaceable units (FRUs) in the server. FIGURE A-1 lists the FRUs. Note that item number 4 in is a 3.5-inch SATA TABLE A-1 FIGURE A-1 drive used in the single-drive configuration.
  • Page 112: Field-Replaceable Units

    Field-Replaceable Units FIGURE A-1 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 113: Server Fru List

    Server FRU List TABLE A-1 Replacement Item No. Instructions Description Location Motherboard Section 5.6, The motherboard and chassis are and chassis “Replacing the replaced as a single assembly. The assembly Motherboard and motherboard is provided in different Chassis” on configurations to accommodate the page 5-20 different processor models (6 core and 8 core).
  • Page 114 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 115: Index

    Index removing, 5-22 components, disabled, 3-47, 3-48 AC OK LED, 3-4 components, displaying the state of, 3-47 Advanced ECC technology, 3-7 connecting to ALOM CMT, 3-13 Advanced Lights Out Management (ALOM) CMT connecting to, 3-13 console, 3-14 diagnosis and repair of server, 3-11 console command, 3-14, 3-29 POST, and, 3-23 consolehistory command, 3-14...
  • Page 116 5-6 how to run, 3-28 top cover, 6-1 parameters, changing, 3-26 reasons to run, 3-27 installing the server in the rack, 6-1 troubleshooting with, 3-6 Predictive Self-Healing (PSH) Index-2 Sun Fire T1000 Server Service Manual • January 2007...
  • Page 117 about, 3-39 exercising the system with, 3-50 clearing faults, 3-44 running, 3-51 memory faults, and, 3-8 tests, 3-53 Sun URL, 3-40 user interfaces, 3-50 PSH detected faults, 3-16 support, obtaining, 3-5 PSH see also Predictive Self-Healing (PSH), 3-39 syslogd daemon, 3-46 system console, switching to, 3-14 system temperatures, displaying, 3-17 removing...
  • Page 118 Index-4 Sun Fire T1000 Server Service Manual • January 2007...

Table of Contents