Summary of Contents for Sun Microsystems Sun Fire V445 Server
Page 1
Sun Fire V445 Server ™ Administration Guide Sun Microsystems, Inc. www.sun.com Part No. 819-3741-13 September 2007, Revision A Submit comments about this document at: http://www.sun.com/hwdocs/feedback...
Page 2
Copyright 2007 Sun Microsystems, Inc., 4150 Network Circle, Santa Clara, California 95054, Etats-Unis. Tous droits réservés. Sun Microsystems, Inc. a les droits de propriété intellectuels relatants à la technologie qui est décrit dans ce document. En particulier, et sans la limitation, ces droits de propriété...
Contents Preface xxvii System Overview 1 Sun Fire V445 Server Overview 1 Processors and Memory 3 External Ports 3 Gigabit Ethernet Ports 3 10BASE-T Network Management Port 4 Serial Management and DB-9 Ports 4 USB Ports 4 RAID 0,1 Internal Hard Drives 5...
Page 4
Network Management Ports 29 Access Through the Network Management Port 30 ALOM 30 Alternative System Console Configuration 31 Accessing the System Console Through a Graphics Monitor 32 About the sc> Prompt 32 Sun Fire V445 Server Administration Guide • September 2007...
Page 5
Access Through Multiple Controller Sessions 34 Ways of Reaching the sc> Prompt 34 About the ok Prompt 35 Entering the ok Prompt 35 Graceful Shutdown 36 ALOM System Controller break or console Command 36 L1-A (Stop-A) Keys or Break Key 37 Externally Initiated Reset (XIR) 37 Manual System Reset 37 About Switching Between the ALOM System Controller and the System...
Page 6
To Power Off the Server Locally 66 Initiating a Reconfiguration Boot 66 To Initiate a Reconfiguration Boot 67 Selecting a Boot Device 69 To Select a Boot Device 70 Configuring Hardware 73 About the CPU/Memory Modules 73 Sun Fire V445 Server Administration Guide • September 2007...
Page 7
DIMMs 74 Memory Interleaving 76 Independent Memory Subsystems 76 DIMM Configuration Rules 77 About the ALOM System Controller Card 77 Configuration Rules 80 About the PCI Cards and Buses 81 Configuration Rules 84 About the SAS Controller 84 About the SAS Backplane 85 Configuration Rules 85 About Hot-Pluggable and Hot-Swappable Components 85 Hard Disk Drives 86...
Page 8
Stop-D Function 111 About Automatic System Restoration 111 Unconfiguring a Device Manually 112 To Unconfigure a Device Manually 112 Reconfiguring a Device Manually 114 To Reconfigure a Device Manually 114 viii Sun Fire V445 Server Administration Guide • September 2007...
Page 9
Enabling the Hardware Watchdog Mechanism and Its Options 114 To Enable the Hardware Watchdog Mechanism and Its Options 115 About Multipathing Software 115 Managing Disk Volumes 117 About Disk Volumes 118 About Volume Management Software 118 VERITAS Dynamic Multipathing 119 Sun StorEdge Traffic Manager 119 About RAID Technology 120 Disk Concatenation 120...
Page 10
What’s New in Diagnostic Operation 158 About the New and Redefined Configuration Variables 158 About the Default Configuration 159 About Service Mode 162 About Initiating Service Mode 163 About Overriding Service Mode Settings 164 Sun Fire V445 Server Administration Guide • September 2007...
Page 11
About Normal Mode 164 About Initiating Normal Mode 165 About the post Command 165 To Initiate Service Mode 167 To Initiate Normal Mode 167 Reference for Estimating System Boot Time (to the ok Prompt) 168 Boot Time Estimates for Typical Configurations 169 Estimating Boot Time for Your System 169 Reference for Sample Outputs 170 Reference for Determining Diagnostic Mode 172...
Page 12
Reset Scenarios 211 Automatic System Restoration User Commands 212 Enabling Automatic System Restoration 212 Disabling Automatic System Restoration 212 To Disable Automatic System Restoration 212 Displaying Automatic System Restoration Information 213 Sun Fire V445 Server Administration Guide • September 2007...
Page 13
About SunVTS 214 SunVTS Software and Security 214 Using SunVTS 215 To Find Out Whether SunVTS Is Installed 216 Installing SunVTS 216 Viewing SunVTS Documentation 216 About Sun Management Center 217 How Sun Management Center Works 218 Using Sun Management Center 219 Other Sun Management Center Features 219 Informal Tracking 219 Hardware Diagnostic Suite 220...
Page 14
Reference for the USB Connectors 239 USB Connector Diagram 239 USB Connector Signals 239 Reference for the Gigabit Ethernet Connectors 240 Gigabit Ethernet Connector Diagram 240 Gigabit Ethernet Connector Signals 241 Sun Fire V445 Server Administration Guide • September 2007...
Page 15
System Specifications 243 Reference for Physical Specifications 244 Reference for Electrical Specifications 244 Reference for Environmental Specifications 245 Reference for Agency Compliance Specifications 246 Reference for Clearance and Service Access Specifications 247 OpenBoot Configuration Variables 249 Index 253 Contents...
Page 16
Sun Fire V445 Server Administration Guide • September 2007...
Page 17
FIGURE 2-3 Separate System Console and System Controller Channels 39 FIGURE 2-4 Patch Panel Connection Between a Terminal Server and a Sun Fire V445 Server 45 FIGURE 2-7 Tip Connection Between a Sun Fire V445 Server and Another Sun System 48...
Page 18
Serial Management Connector Diagram 236 FIGURE A-2 Network Management Connector Diagram 237 FIGURE A-3 Serial Port Connector Diagram 238 FIGURE A-4 USB Connector Diagram 239 FIGURE A-5 Gigabit Ethernet Connector Diagram 241 xviii Sun Fire V445 Server Administration Guide • September 2007...
Page 19
Tables TABLE 1-1 Sun Fire V445 Server Features at a Glance 2 TABLE 1-2 System Status Indicators 11 TABLE 1-3 System Diagnostic Indicators 11 TABLE 1-4 Network Management Port Indicator 19 TABLE 1-5 Ethernet Indicators 21 TABLE 2-1 Ways of Communicating With the System 26...
Page 20
PCI Slot Device Names and Paths TABLE 4-4 Hard Disk Drive Status Indicators 88 TABLE 4-5 Power Supply Status Indicators TABLE 4-6 Fan Tray Status Indicators 93 TABLE 5-1 TABLE 5-2 Sun Fire V445 Server Administration Guide • September 2007...
Page 23
TABLE 6-38 TABLE 6-39 TABLE 8-1 Summary of Diagnostic Tools 152 TABLE 8-2 What ALOM Monitors 154 TABLE 8-3 TABLE 8-4 TABLE 8-5 TABLE 8-6 TABLE 8-7 OpenBoot Configuration Variables That Control Diagnostic Testing and Automatic System Restoration 160 TABLE 8-8 Service Mode Overrides 163 TABLE 8-9 Scenarios for Overriding Service Mode Settings 164...
Page 24
TABLE 8-39 Sun Management Center Features 218 TABLE 9-1 OpenBoot Configuration Variable Settings to Enable Automatic System Restoration 228 TABLE 9-2 TABLE 9-3 TABLE 9-4 TABLE 9-5 TABLE 9-6 TABLE 9-7 xxiv Sun Fire V445 Server Administration Guide • September 2007...
Page 25
TABLE A-1 Serial Management Connector Signals 236 TABLE A-2 Network Management Connector Signals 237 TABLE A-3 Serial Port Connector Signals 238 TABLE A-4 USB Connector Signals 239 TABLE A-5 Gigabit Ethernet Connector Signals 241 TABLE B-1 Dimensions and Weight 244 TABLE B-2 Electrical Specifications 244 TABLE B-3...
Page 26
Sun Fire V445 Server Administration Guide • September 2007...
Solaris™ Operating System (OS). How This Book Is Organized The Sun Fire V445 Server Administration Guide is divided into the following chapters: Chapter 1 presents an illustrated overview of the system and a description of the system’s reliability, availability, and serviceability (RAS) features, as well as new...
Page 28
See one or more of the following for this information: Online documentation for the Solaris OS at docs.sun.com Other software documentation that you received with your system xxviii Sun Fire V445 Server Administration Guide • September 2007...
Typographic Conventions TABLE P-1 Typeface Meaning Examples The names of commands, files, Edit your.login file. AaBbCc123 and directories; on-screen Use ls -a to list all files. computer output % You have mail. What you type, when contrasted AaBbCc123 with on-screen computer output Password: AaBbCc123 Book titles, new words or terms,...
Sun Fire V445 Server Site 819-3745 Printed Shipping kit planning Planning Guide data sheet Online Sun Advanced Lights Out 817-1960 Online Advanced Manager (ALOM) 1.6 Online Lights Out Help Manager (ALOM) system controller xxx Sun Fire V445 Server Administration Guide • September 2007...
Sun is interested in improving its documentation and welcomes your comments and suggestions. You can submit your comments by going to: http://www.sun.com/hwdocs/feedback Please include the title and part number of your document with your feedback: Sun Fire V445 Server Administration Guide, part number 819-3741 Preface xxxi...
Page 32
Sun Fire V445 Server Administration Guide • September 2007...
C H A P T E R System Overview This chapter introduces you to the Sun Fire V445 server and describes its features. The following sections are included: “Sun Fire V445 Server Overview” on page 1 “New Features” on page 7 “Locating Front Panel Features”...
Advanced Lights Out Manager (ALOM) software, which also controls powering on/off and diagnostics. The system also meets ROHS requirements. provides a brief description of the Sun Fire V445 server features. More TABLE 1-1 details on these features are provided in the following subsections.
“DIMMs” on page External Ports The Sun Fire V445 server provides four Gigabit Ethernet ports, one 10BASE-T network management port, two Serial ports, and four USB ports. Gigabit Ethernet Ports The four on-board Gigabit Ethernet ports located on the back panel support several modes of operations at 10, 100, and 1000 megabits per second (Mbps).
Sun Type-6 USB keyboard and mouse. The USB ports are USB 2.0 compliant, and support 480 Mbps, 12 Mbps, and 1.5 Mbps speeds. For additional details, see “About the USB Ports” on page Sun Fire V445 Server Administration Guide • September 2007...
RAID 0,1 Internal Hard Drives Internal disk storage is provided by up to eight 2.5 inch (5.1 cm) high, hot-pluggable, SAS disk drives. The basic system includes a SAS disk backplane that accommodates eight disks capable of data transfer rates of up to 320 megabytes per second. See “About the Internal Disk Drives”...
ALOM System Controller Card The Sun ALOM system controller card enables system management and administration for the Sun Fire V445 server over a serial line or an Ethernet network. The ALOM system controller provides remote system administration for geographically distributed or physically inaccessible systems. These features include powering on/off the system and enabling diagnostics.
Sun systems to accurately predict component failures and mitigate many serious problems before they actually occur. This technology is incorporated into both the hardware and software of the Sun Fire V445 server. At the heart of the Predictive Self-Healing capabilities is the Solaris™ Fault Manager, a service that receives data relating to hardware and software errors, and automatically and silently diagnoses the underlying problem.
Page 40
Automated diagnosis of CPU, memory, and I/O fault events during runtime (Solaris 10 and subsequent compatible versions of Solaris OS) Dynamic FRUID support of environmental events Software readable chassis serial number for asset management Sun Fire V445 Server Administration Guide • September 2007...
Locating Front Panel Features The illustration below shows the system features that you can access from the front panel. SAS disk drives (8) Status Indicators/control panel USB ports Removable media drive Front Panel Features FIGURE 1-1 For information about front panel controls and indicators, see “Front Panel Indicators”...
Front Panel System Status Indicators FIGURE 1-2 Each system status indicator has a corresponding indicator on the back panel. Sun Fire V445 Server Administration Guide • September 2007...
Listed from left to right, the system status indicators operate as described in the following table. System Status Indicators TABLE 1-2 Icon Name Description Locator This white indicator is lit by a Solaris command, Sun Management Center command, or ALOM commands to help you locate the system.
Power Button Location FIGURE 1-3 USB Ports The Sun Fire V445 server has four Universal Serial Bus (USB) ports: two on the front panel, and two on the back panel. All four USB ports comply with the USB 2.0 specification.
Internal Disk Drives” on page Removable Media Drive The Sun Fire V445 server has a DVD-ROM drive in a removable media bay. This drive also has DVD-RW and CD-RW capabilities. Sun Fire V445 Server Administration Guide • September 2007...
Removable media drive Removable Media Drive Location FIGURE 1-6 For more information about servicing the DVD-ROM drive, see the Sun Fire V445 Server Service Manual. Chapter 1 System Overview...
Locating Back Panel Features The illustration below shows the system features that are accessible from the back panel. Power PCIe card slots supplies PCI-X card slots System status External indicators ports Sun Fire V445 Server Administration Guide • September 2007...
“About the Power Supplies” on page PCI Slots The Sun Fire V445 server has four PCIe slots and four PCI-X slots. (One of the PCI-X slots is occupied by the LSI Logic 1068X SAS controller.) These are labeled on the back panel.
PCI4 PCI2 PCI Slot Locations FIGURE 1-8 For more information about how to install a PCI card, see the Sun Fire V445 Server Service Manual. For more information about PCI cards, see “About the PCI Cards and Buses” on page...
System Controller Ports There are two system controller ports. Both use an RJ-45 connector. Network management port Serial management port (NET MGT) (SER MGT) Network and Serial Management Port Locations FIGURE 1-9 Network Management Port This port provides direct network access to the ALOM system controller, when configured, and can access the ALOM prompt and system console output.
There are two USB ports on the back panel. These comply with the USB 2.0 specification. For more information about the USB ports, see “About the USB Ports” on page Gigabit Ethernet Ports The Sun Fire V445 server has four Gigabit Ethernet ports. Sun Fire V445 Server Administration Guide • September 2007...
NET2 NET3 NET0 NET1 Gigabit Ethernet Port Locations FIGURE 1-11 Each Gigabit Ethernet port has a corresponding status indicator, described in TABLE 1-5 Ethernet Indicators TABLE 1-5 Color Description (None) No connection present. Green This indicates a 10/100 Megabit Ethernet connection. The indicator blinks to indicate network activity.
Reliability, Availability, and Serviceability (RAS) Features The Sun Fire V445 server provides the following RAS features: Hot-pluggable disk drives Redundant, hot-swappable power supplies, fan trays, and USB components Sun ALOM system controller with SSH connections for all remote monitoring and...
With Sun Cluster software installed, other nodes in the cluster will automatically take over and assume the workload when a node goes down. The software delivers predictability and fast recovery capabilities through features such as local application restart, individual application failover, and local network adapter failover.
Page 56
Sun Fire V445 Server Administration Guide • September 2007...
Configuring the System Console This chapter explains what the system console is, describes the different ways of configuring it on a Sun Fire V445 server, and helps you understand its relation to the system controller. Tasks covered in this chapter include: “Entering the ok Prompt”...
“Accessing the System Console With an Alphanumeric Terminal” on page 53 • “To Verify Serial Port Settings on TTYB” on page 55 • “Reference for System Console OpenBoot Configuration Variable Settings” on page 59 Sun Fire V445 Server Administration Guide • September 2007...
Ways of Communicating With the System (Continued) TABLE 2-1 During After Devices Available for Accessing the System Console Installation Installation A tip line attached to the serial management port (SERIAL MGT) or TTYB. See: • “Using the Serial Management Port” on page 41 •...
You also must ensure that the system console is directed to the appropriate port on the Sun Fire V445 server’s back panel – generally, the one to which your hardware console device is attached. (See .) You do this by setting the input- FIGURE 2-1 device and output-device OpenBoot configuration variables.
Using a Tip line might be preferable to connecting an alphanumeric terminal, since the tip command allows you to use windowing and OS features on the machine being used to connect to the Sun Fire V445 server. Although the Solaris OS sees the serial management port as TTYA, the serial management port is not a general-purpose serial port.
USB overcurrent monitoring and alerts Hot-plug configuration change monitoring and alerts Dynamic FRU ID data transactions For more information about ALOM software, see “About the ALOM System Controller Card” on page Sun Fire V445 Server Administration Guide • September 2007...
Alternative System Console Configuration In the default configuration, system controller alerts and system console output appear interspersed in the same window. After initial system installation, you can redirect the system console to take its input from and send its output to the serial port TTYB, or to a graphics card’s port.
About the sc> Prompt The ALOM system controller runs independently of the Sun Fire V445 server and regardless of system power state. When you connect a Sun Fire V445 server to AC power, the ALOM system controller immediately starts up, and begins monitoring the system.
Page 65
Note – When you access the ALOM system controller for the first time, it forces you to create a user name and password for subsequent access. After this initial configuration, you will be prompted to enter a user name and password every time you access the ALOM system controller.
“Using the Serial Management Port” on page You can log in directly to the ALOM system controller using a connection through the network management port. See “Activating the Network Management Port” on page Sun Fire V445 Server Administration Guide • September 2007...
A synopsis of run levels follows. For a full description, see the Solaris system administration documentation. Most of the time, you operate a Sun Fire V445 server at run level 2 or run level 3, which are multiuser states with access to full system and network resources.
“Powering Off the System Remotely” on page 64 ALOM System Controller break or console Command Typing break from the sc> prompt forces a running Sun Fire V445 server to move to OpenBoot firmware control. If the OS is already halted, you can use the console command instead of break to reach the ok prompt.
When it is impossible or impractical to shut down the system gracefully, you can get to the ok prompt by typing the L1-A (Stop-A) key sequence from a Sun keyboard, or, if you have an alphanumeric terminal attached to the Sun Fire V445 server, by pressing the Break key.
Caution – When you access the ok prompt from a functioning Sun Fire V445 server, you are suspending the Solaris OS and placing the system under firmware control.
System Console NET MGT or SERIAL MGT Port console sc> ALOM System Controller Separate System Console and System Controller Channels FIGURE 2-3 If the system console is configured to be accessible from the serial management and network management ports, when you connect through one of these ports you can access either the ALOM command-line interface or the system console.
“About the ok Prompt” on page Caution – Dropping the Sun Fire V445 server to the ok prompt suspends all application and OS software. After you issue firmware commands and run firmware-based tests from the ok prompt, the system might not be able to resume where it left off.
4. Refer to for instructions. TABLE 2-3 Ways of Accessing the ok Prompt TABLE 2-3 Access Method What to Do Graceful shutdown of • From a shell or command tool window, issue an appropriate the Solaris OS command (for example, the shutdown or init command) as described in Solaris system administration documentation.
IP address using the Dynamic Host Configuration Protocol (DHCP) from another server. Data centers frequently devote a separate subnet to system management. If your data center has such a configuration, connect the network management port to this subnet. Sun Fire V445 Server Administration Guide • September 2007...
Note – The network management port is a 10BASE-T port. The IP address assigned to the network management port is a unique IP address, separate from the main Sun Fire V445 server IP address, and is dedicated for use only with the ALOM system controller.
1. Complete the physical connection from the serial management port to your terminal server. The serial management port on the Sun Fire V445 server is a data terminal equipment (DTE) port. The pinouts for the serial management port correspond with the pinouts for the RJ-45 ports on the Serial Interface Breakout Cable supplied by Cisco for use with the Cisco AS2511-RJ terminal server.
If the pinouts for the serial management port do not correspond with the pinouts for the RJ-45 ports on the terminal server, you need to make a crossover cable that takes each pin on the Sun Fire V445 server serial management port to the corresponding pin in the terminal server’s serial port.
2. Open a terminal session on the connecting device, and type: TABLE 5 % telnet IP-address-of-terminal-server port-number For example, for a Sun Fire V445 server connected to port 10000 on a terminal server whose IP address is 192.20.30.10, you would type: TABLE 6 % telnet 192.20.30.10 10000...
Accessing the System Console With a Tip Connection This procedure assumes that you are accessing the Sun Fire V445 server system console by connecting the serial port of another Sun system to the serial management port (SERIAL MGT) of the Sun Fire V445 server (...
(typically TTYB) and the serial management port on the back panel of the Sun Fire V445 server. Pinouts, part numbers, and other details about the serial cable and adapter are provided in the Sun Fire V445 Server Parts Installation and Removal Guide.
Table 2-12 connected The shell tool is now a Tip window directed to the Sun Fire V445 server through the Sun system’s serial port. This connection is established and maintained even when the Sun Fire V445 server is completely powered off or just starting up.
Page 82
If you have redirected the system console to TTYB and want to change the system console settings back to use the serial management and network management ports, see: “Reference for System Console OpenBoot Configuration Variable Settings” on page 59 Sun Fire V445 Server Administration Guide • September 2007...
Modifying the /etc/remote File This procedure might be necessary if you are accessing the Sun Fire V445 server using a Tip connection from a Sun system running an older version of the Solaris OS software. You might also need to perform this procedure if the /etc/remote file on the Sun system has been altered and no longer contains an appropriate hardwire entry.
Page 84
/dev/ttyb with /dev/ttya. The /etc/remote file is now properly configured. Continue establishing a Tip connection to the Sun Fire V445 server system console. See: “Accessing the System Console With a Tip Connection” on page 47...
Accessing the System Console With an Alphanumeric Terminal This procedure assumes that you are accessing the Sun Fire V445 server system console by connecting the serial port of an alphanumeric terminal to the serial management port (SERIAL MGT) of the Sun Fire V445 server.
You can issue system commands and view system messages using the alphanumeric terminal. Continue with your installation or diagnostic procedure, as needed. When you are finished, type the alphanumeric terminal’s escape sequence. Sun Fire V445 Server Administration Guide • September 2007...
Note – The serial management port always operates at 9600 baud, 8 bits, with no parity and 1 stop bit. You must be logged in to the Sun Fire V445 server, and the server must be running Solaris OS software.
1. Install the graphics card into an appropriate PCI slot. Installation must be performed by a qualified service provider. For further information, see the Sun Fire V445 Server Installation Guide or contact your qualified service provider. 2. Attach the monitor’s video cable to the graphics card’s video port.
Page 89
4. Connect the USB keyboard cable to any USB port on the Sun Fire V445 server front panel. 5. Connect the USB mouse cable to any USB port on the Sun Fire V445 server front panel. Chapter 2 Configuring the System Console...
Page 90
Continue with your installation or diagnostic procedure, as needed. If you want to redirect the system console back to the serial management and network management ports, see: “Reference for System Console OpenBoot Configuration Variable Settings” on page Sun Fire V445 Server Administration Guide • September 2007...
Reference for System Console OpenBoot Configuration Variable Settings The Sun Fire V445 system console is directed to the serial management and network management ports (SERIAL MGT and NET MGT) by default. However, you can redirect the system console to the serial DB-9 port (TTYB), or to a local graphics monitor, keyboard, and mouse.
Page 92
Sun Fire V445 Server Administration Guide • September 2007...
C H A P T E R Powering On and Powering Off the System This chapter describes how to power on and power off the system, and how to initiate a reconfiguration boot. This chapter explains the following tasks: “Powering On the Server Remotely” on page 62 “Powering On the Server Locally”...
Tip connection to the Sun Fire V445 server. See Chapter 2 for more information about connecting the Sun Fire V445 server to a terminal or similar device. Do not use this power-on procedure if you have just added any new internal option or external storage device, or if you have removed a storage device without replacing it.
Powering On the Server Locally Do not use this power-on procedure if you have just added any new internal option or external storage device, or if you have removed a storage device without replacing it. To power on the system under those circumstances, you must initiate a reconfiguration boot.
Tip connection to the Sun Fire V445 server. See Chapter 2 for more information about connecting the Sun Fire V445 server to a terminal or similar device. You can power off the system remotely either from the ok prompt or from the ALOM system controller sc>...
Caution – Applications running on the Solaris OS can be adversely affected by a poorly executed system shutdown. ensure that you stop and exit applications, and shut down the OS before powering off the server. For more information, see: “About Communicating With the System” on page 26 “About the ok Prompt”...
OS is able to recognize newly installed devices. In addition, if you remove any device and do not install a replacement device prior to rebooting the system, you must perform a reconfiguration boot for Sun Fire V445 Server Administration Guide • September 2007...
Tip connection to the Sun Fire V445 server. See Chapter 2 for more information about connecting the Sun Fire V445 server to a terminal or similar device. This procedure assumes that you are accessing the system console using the serial management or network management port.
Page 100
POST and OpenBoot Diagnostics tests being performed. For more information about OpenBoot configuration variables, see Appendix The system front panel LED indicators provide power-on status information. For information about the system indicators, see: Sun Fire V445 Server Administration Guide • September 2007...
Gigabit Ethernet interface. Before you can select a boot device, you must complete system installation according to the instructions in the Sun Fire V445 Server Installation Guide. This procedure assumes that you are familiar with the OpenBoot firmware and that you know how to enter the OpenBoot environment.
Page 102
PCI device. For more information about using the OpenBoot firmware, refer to the OpenBoot 4.x Command Reference Manual in the OpenBoot Collection AnswerBook for your specific Solaris release. Sun Fire V445 Server Administration Guide • September 2007...
Page 103
Chapter 3 Powering On and Powering Off the System...
Page 104
Sun Fire V445 Server Administration Guide • September 2007...
For instructions on preparing the system for servicing and procedures to install and remove the server components described in this chapter, refer to the Sun Fire V445 Server Service Manual. Topics in this chapter include: “About the CPU/Memory Modules” on page 73 “About the ALOM System Controller Card”...
Note – CPU/Memory modules on a Sun Fire V445 server are not hot-pluggable or hot-swappable. The UltraSPARC IIIi processor is a high-performance, highly integrated superscalar processor implementing the SPARC V9 64-bit architecture. The UltraSPARC IIIi processor can support both 2D and 3D graphics, as well as image processing, video compression and decompression, and video effects through the sophisticated Visual Instruction Set extension (Sun VIS software).
DIMM group 1 DIMM group 0 Memory Module Groups 0 and 1 FIGURE 4-1 lists the DIMMs on the CPU/Memory module, and to which group each TABLE 4-1 DIMM belongs. Memory Module Groups 0 and 1 TABLE 4-1 Label Group Physical Group B1/D1 1 (must be installed as a pair)
V445 Server Installation Guide and the Sun Fire V445 Server Service Manual. For guidelines and complete instructions on how to install and identify DIMMs in a CPU/Memory module, refer to the Sun Fire V445 Server Service Manual and the Sun Fire V445 Server Installation Guide.
Card The Sun Advanced Lights Out Manager (ALOM) system controller card enables access, monitoring, and control of the Sun Fire V445 server from a remote location. It is a fully independent processor card with its own resident firmware, self- diagnostics, and OS.
The ALOM system controller card features serial and 10BASE-T Ethernet interfaces that provide multiple ALOM system controller software users with simultaneous access to the Sun Fire V445 server. ALOM system controller software users are provided secure password-protected access to the system’s Solaris and OpenBoot console functions.
Page 111
Caution – Although access to the ALOM system controller through the network management port is secure, access through the serial management port is not secure. Therefore, avoid connecting a serial modem to the serial management port. Note – The ALOM system controller serial management port (labeled SERIAL MGT) and network management port (labeled NET MGT) are present in the Solaris OS device tree as /dev/ttya, and in the OpenBoot configuration variables as ttya.
The serial management port on the ALOM system controller cannot be used as a conventional serial port. If your configuration requires a standard serial connection, use the DB-9 port labeled “TTYB” instead. Sun Fire V445 Server Administration Guide • September 2007...
PCI bus characteristics and maps each bus to its associated TABLE 4-2 bridge chip, integrated devices, and PCI card slots. All slots comply with PCI Local Bus Specification Revision 2.2. Note – PCI cards in a Sun Fire V445 server are not hot-pluggable or hot-swappable. Chapter 4 Configuring Hardware...
Page 114
** Internal SAS Controller Card Expansion Connector not in use at time of this release *** Slot Consumed by the SAS1068 Disk Controller shows the PCI card slots on the motherboard. FIGURE 4-4 Sun Fire V445 Server Administration Guide • September 2007...
PCI0 PCI6 PCI7 PCI1 PCI3 PCI5 PCI4 PCI2 PCI Slots FIGURE 4-4 lists the device name and path for the eight PCI slots. TABLE 4-3 PCI Slot Device Names and Paths TABLE 4-3 PCI Slot PCIe Bus Device Name an d Base Path (not full path) PCIe Slot 0 /pci@1e,600000/pci@0 PCIe Slot 1...
3) they each run at 100 MHz. 133-MHz operation is only possible when only one slot is populated with one PCI-X 133-MHz capable card. For information about installing or removing PCI cards, see the Sun Fire V445 Server Service Manual.
About the SAS Backplane The Sun Fire V445 server includes a single SAS backplane with connections for up to eight internal hard disk drives, all of which are hot-pluggable. The SAS disk backplane accepts eight, low-profile (2.5-inch), SAS disk drives. Each hard disk drive is connected to the backplane with a standard SAS hot-plug disk connector, which makes it easy to add or remove hard disk drives from the system.
For more information about the disk drives, see “About the Internal Disk Drives” on page 87. For general hard disk hot-plug procedures, refer to the Sun Fire V445 Server Service Manual. For procedures to perform a hard disk hot-plug operation on mirrored and nonmirrored disks, see “Performing a Mirrored Disk Hot-Plug...
Server Service Manual. System Fan Trays For procedues on removing and installing fan trays, refer to the Sun Fire V445 Server Service Manual. Caution – At least one fan must remain operational in each of the three pairs of fan trays to maintain adequate system cooling.
The hot-plug feature of the system’s internal hard disk drives enables you to add, remove, or replace disks while the system continues to operate. This capability significantly reduces system downtime associated with hard disk drive replacement. Sun Fire V445 Server Administration Guide • September 2007...
AC power is brought to each power supply with a power cord (total of four power cords). The Sun Fire V445 server’s power supplies are modular, hot-swappable units. They are customer replacable units (CRUs) designed for fast, easy installation or removal, even while the system is fully operational.
AC power is available, regardless of system power state. Note – If a power supply is faulty, the system Service Required indicator is also lit. “Front Panel Indicators” on page 10 for more information. Sun Fire V445 Server Administration Guide • September 2007...
Check the Service Required indicators to verify which power supply has failed. The failed power supply causes the amber system Service Required indicator and power supply Service Required indicator to light. To complete this procedure, refer to the Sun Fire V445 Server Service Manual. Chapter 4 Configuring Hardware...
For FIGURE 4-7 each fan in the system, the environmental monitoring subsystem monitors fan speed in revolutions per minute. Sun Fire V445 Server Administration Guide • September 2007...
System Fan Trays and Fan Indicators FIGURE 4-7 Refer to these indicators to determine which fan tray needs to be replaced. provides a description of the fan tray indicators. TABLE 4-6 Fan Tray Status Indicators TABLE 4-6 Indicator Color Notes Power/OK Green This indicator is lit when the system is running and the...
The minimum system configuration requires at least one fan operating per redundant pair. Note – For instructions on how to remove and install fan trays, refer to the Sun Fire V445 Server Service Manual. Sun Fire V445 Server Administration Guide • September 2007...
About the USB Ports The system front and back panels provide two external Universal Serial Bus (USB) ports on two independent controllers to connect USB peripheral devices such as: Sun Type-6 USB keyboard Sun opto-mechanical three-button USB mouse Modems Printers Scanners Digital cameras The USB ports are compliant with the Open Host Controller Interface (Open HCI)
About the Serial Ports The default console connection to the Sun Fire V445 server is through the RJ-45 serial management port (labeled SERIAL MGT) on the back panel of the ALOM system controller card. This port operates only at 9600 baud.
C H A P T E R Managing RAS Features and System Firmware This chapter describes how to manage reliability, availability, and serviceability (RAS) features and system firmware, including Sun Advanced Lights Out Manager (ALOM) system controller, automatic system restoration (ASR), and the hardware watchdog mechanism.
Sun Fire V445 hardware is designed to support hot-plugging of internal disk drives. By using the proper software commands, you can install or remove these components while the system is running. The server also supports hot-swapping of Sun Fire V445 Server Administration Guide • September 2007...
Sun Advanced Lights Out Manager (ALOM) system controller is a secure server management tool that comes preinstalled on the Sun Fire V445 server, in the form of a module with preinstalled firmware. It lets you monitor and control your server over a serial line or over a network.
Environmental Monitoring and Control The Sun Fire V445 server features an environmental monitoring subsystem that protects the server and its components against: Extreme temperatures Lack of adequate airflow through the system Operating with missing or misconfigured components Power supply failures...
detects the failure and generates an error message to the system console, logs the message in the /var/adm/messages file, and lights the Service Required indicators. The power subsystem is monitored in a similar fashion. Polling the power supply status periodically, the monitoring subsystem indicates the status of each supply’s DC outputs, AC inputs, and presence.
Chapter 9 Support for RAID Storage Configurations By attaching one or more external storage devices to the Sun Fire V445 server, you can use a redundant array of independent disks (RAID) software application such as Solstice DiskSuite™ to configure system disk storage in a variety of different RAID levels.
(striping with interleaved parity). You choose the appropriate RAID configuration based on the price, performance, reliability, and availability goals for your system. You can also configure one or more disk drives to serve as “hot spares” to fill in automatically in the event of a disk drive failure. In addition to software RAID configurations, you can set up a hardware RAID 1 (mirroring) configuration for any pair of internal disk drives using the SAS controller, providing a high-performance solution for disk drive mirroring.
The ALOM system controller command prompt (sc>) provides you with a way of interacting with the system controller. For more information about the sc> prompt, see “About the sc> Prompt” on page 32 Sun Fire V445 Server Administration Guide • September 2007...
Note – There is no default password. You must assign a password during initial system configuration. For more information, see your Sun Fire V445 Server Installation Guide and Sun Advanced Lights Out Manager (ALOM) Online Help. Caution – In order to provide optimum system security, best practice is to change the default system login name and password during initial setup.
The scadm utility sends its output to stdout. You can also use scadm in scripts to manage and configure ALOM from the host system. For more information about the scadm utility, refer to the following: scadm man page Sun Advanced Lights Out Manager (ALOM) Online Help Sun Fire V445 Server Administration Guide • September 2007...
Viewing Environmental Information Use the showenvironment command to view environment information. To View Environmental Information 1. Log in to the ALOM system controller. 2. Use the showenvironment command to display a snapshot of the server’s environmental status. TABLE 5-5 sc> showenvironment =============== Environmental Status =============== ------------------------------------------------------------------------------ System Temperatures (Temperatures in Celsius):...
Locator LED is on. 2. To turn off the Locator indicator, do one of the following: In Solaris, log in as root and type: TABLE 5-8 # /usr/sbin/locator -f Locator LED is off. Sun Fire V445 Server Administration Guide • September 2007...
Specifically, the Stop-N, Stop-D, and Stop-F commands that were available on systems with non-USB keyboards are not supported on systems that use USB keyboards, such as the Sun Fire V445 server. If you are familiar with the earlier (non-USB) keyboard functionality, this section describes the analogous OpenBoot emergency procedures available in newer systems that use USB keyboards.
Expires TUE FEB 18 18:44:41 2003 This command resets the default OpenBoot configuration variables. 3. To reset the system, type: TABLE 5-13 sc> reset Are you sure you want to reset the system [y/n]? sc> console Sun Fire V445 Server Administration Guide • September 2007...
4. To view console output as the system boots with default OpenBoot configuration variables, switch to console mode. TABLE 5-14 sc> console 5. Type set-defaults to discard any customized IDPROM values and to restore the default settings for all OpenBoot configuration variables. Stop-F Function The Stop-F function is not available on systems with USB keyboards.
On-board Ethernet controllers net0, net1,net2,net3 SAS controller ob-scsi pci0, ... pci7 PCI slots 0 – 7 All PCI slots pci-slot* All on-board PCI devices (on-board pci* Ethernet, SAS) and all PCI slots Sun Fire V445 Server Administration Guide • September 2007...
Page 145
Device Identifiers and Devices (Continued) TABLE 5-15 Device Identifiers Devices PCI bridge chips 0 and 1, respecti hba8, hba9 USB devices usb0, ..., usb4 All devices You can determine full physical device paths by typing: ok show-devs The show-devs command lists the system devices and displays the full path name of each device.
Enabling the Hardware Watchdog Mechanism and Its Options For background information about the hardware watchdog mechanism and related externally initiated reset (XIR) functionality, see: “Hardware Watchdog Mechanism and XIR” on page 102 Sun Fire V445 Server Administration Guide • September 2007...
To Enable the Hardware Watchdog Mechanism and Its Options 1. Edit the /etc/system file to include the following entry: set watchdog_enable = 1 2. To obtain the ok prompt, type: TABLE 5-16 # init 0 3. Reboot the system so that the changes can take effect. 4.
Page 148
For the Sun Fire V445 server, three different types of multipathing software are available: Solaris IP Network Multipathing software provides multipathing and load-balancing capabilities for IP network interfaces.
C H A P T E R Managing Disk Volumes This chapter describes redundant array of independent disks (RAID) concepts, how to manage disk volumes, and how to configure hardware mirroring using the SAS controller. This chapter contains the following sections: “About Disk Volumes”...
About Volume Management Software Volume management software lets you create disk volumes. Sun Microsystems offers two different volume management applications for use on the Sun Fire V445 server: Solaris Volume Manager software VERITAS Volume Manager software Sun’s volume management applications offer the following features:...
Sun StorEdge Traffic Manager A newer alternative to DMP that is also supported by the Sun Fire V445 server is Sun StorEdge Traffic Manager software. Sun StorEdge Traffic Manager is a server- based dynamic path failover software solution, used to improve the overall availability of business applications.
RAID 0: Disk Striping or Intergated Stripe (IS) Disk striping, Integrated Stripe (IS), or RAID 0 is a technique for increasing system throughput by using several disk drives in parallel. In nonstriped disks the OS writes a single block to a single disk. In a striped arrangement, each block is divided and portions of the data are written to different disks simultaneously.
Member disk devices are invisible to the operating system, and are accessed only by the SAS controller. Note – The Sun Fire V445 server’s on-board controller can configure as many as two RAID sets. Prior to volume creation, ensure that the member disks are available and that there are not two sets already created.
Caution – Creating RAID volumes using the on-board controller destroys all data on the member disks. The disk controller’s volume initialization procedure reserves a portion of each physical disk for metadata and other internal information used by the controller. Once the volume initialization is complete, you can configure the volume and label it using format(1M).
Names” on page 123 To Create a Hardware Disk Mirror 1. To verify that a hardware disk mirror does not already exist, type: TABLE 6-2 # raidctl No RAID volumes found. Sun Fire V445 Server Administration Guide • September 2007...
Page 157
The example indicates that no RAID volume exists. In another case: TABLE 6-3 # raidctl RAID Volume RAID RAID Disk Volume Type Status Disk Status ------------------------------------------------------ c0t4d0 c0t5d0 c0t4d0 The example indicates a hardware mirror has degraded at disk c1t2d0. Note –...
One alternate medium is a network installation image in single-user mode (refer to the Solaris 10 Installation Guide for information about configuring and using network-based installations). Sun Fire V445 Server Administration Guide • September 2007...
To Create a Hardware Mirrored Volume of the Default Boot Device 1. Determine which disk is the default boot device From the OpenBoot ok prompt, type the printenv command, and if necessary the devalias command, to identify the default boot device. For example: TABLE 6-8 ok printenv boot-device boot-device =...
Creating RAID volume c0t1d0 will destroy all data on member disks, proceed (yes/no)? yes Volume ’c0t1d0’ created When you create a RAID striped volume, the other member drives (in this case, c0t2d0 and c0t3d0) disappear from the Solaris device tree. Sun Fire V445 Server Administration Guide • September 2007...
As an alternative, you can use the –f option to force the creation if you are sure of the member disks, and sure that the data on all other member disks can be lost. For example: TABLE 6-14 # raidctl -f -c -r 0 c0t1d0 c0t2d0 c0t3d0 Volume ’c0t1d0’...
Page 162
- read and display labels save - save new disk/partition definitions inquiry - show vendor, product and revision volname - set 8-character volume name !<cmd> - execute <cmd>, then return quit Sun Fire V445 Server Administration Guide • September 2007...
Page 163
Caution – If a RAID Volume is created using the on-board controller and a disk drive in the volume set is removed without deleting the RAID Volume, the disk will not be useable in the Solaris Operating System unless special procedures are followed.
Verify which disk drive corresponds with which logical device name and physical device name. See: “About Physical Disk Slot Numbers, Physical Device Names, and Logical Device Names” on page 123 Sun Fire V445 Server Administration Guide • September 2007...
To Delete a Hardware Disk Mirror 1. Determine the name of the mirrored volume. Type: TABLE 6-21 # raidctl RAID RAID RAID Disk Volume Status Disk Status ------------------------------------ c1t0d0 c1t0d0 c1t1d0 In this example, the mirrored volume is c1t0d0 Note – The logical device names might appear differently on your system, depending on the number and type of add-on disk controllers installed.
# raidctl RAID RAID RAID Disk Volume Status Disk Status ---------------------------------------- c1t1d0 DEGRADED c1t1d0 c1t2d0 DEGRADED This example indicates that the disk mirror has degraded due to a failure in disk c1t2d0. Sun Fire V445 Server Administration Guide • September 2007...
Page 167
There is no need to issue a software command to bring the drive offline when the drive has failed and the OK-to-Remove indicator is lit. 3. Install a new disk drive, as described in the Sun Fire V445 Server Service Manual. The RAID utility automatically restores the data to the disk.
Sun Fire V445 Server Administration Guide • September 2007...
Page 169
This warning is issued because these commands attempt to quiesce the SAS bus, but the Sun Fire V445 server firmware prevents it. This warning message can be safely ignored in the Sun Fire V445 server, but the following procedure avoids this warning message altogether.
The corresponding disk drive OK-to-Remove indicator is lit. 3. Remove the disk drive, as described in the Sun Fire V445 Server Parts Installation and Removal Guide. The blue OK-to-Remove indicator goes out when you remove the disk drive.
Page 171
4. Install a new disk drive, as described in the Sun Fire V445 Server Parts Installation and Removal Guide. 5. To configure the new disk drive, type: TABLE 6-37 # cfgadm -c configure Ap-Id For example: TABLE 6-38 # cfgadm -c configure c1::dsk/c1t3d0 The green Activity indicator flashes as the new disk at c1t3d0 is added to the device tree.
Page 172
Sun Fire V445 Server Administration Guide • September 2007...
“Configuring Additional Network Interfaces” on page 145 About the Network Interfaces The Sun Fire V445 server provides four on-board Sun Gigabit Ethernet interfaces, which reside on the system motherboard and conform to the IEEE 802.3z Ethernet standard. For an illustration of the Ethernet ports, see .
PCI buses, which are supported by separate PCI bridges. For additional details, see “About the PCI Cards and Buses” on page Sun Fire V445 Server Administration Guide • September 2007...
To Attach a Twisted-Pair Ethernet Cable 1. Install the server into the rack. Refer to the Sun Fire V445 Server Installation Guide. 2. Locate the RJ-45 twisted-pair Ethernet (TPE) connector for the appropriate Ethernet interface – the left top (net0), left bottom (net1), right top (net2, right bottom (net3).
You need to furnish the address in a later step. An IP address must be assigned by the network administrator. Each network device or interface must have a unique IP address. Sun Fire V445 Server Administration Guide • September 2007...
145. Note – The Sun Fire V445 server conforms to the Ethernet 10/100BASE-T standard, which states that the Ethernet 10BASE-T link integrity test function should always be enabled on both the host system and the Ethernet hub. If you have problems establishing a connection between this system and your hub, verify that the Ethernet hub also has the link test function enabled.
Install the Sun Fire V445 server as described in the Sun Fire V445 Server Installation Guide. If you are setting up a redundant network interface, see “About Redundant Network Interfaces” on page 142. If you need to install a PCI network interface card, follow the installation instructions in the Sun Fire V445 Server Parts Installation and Removal Guide.
Page 179
The name of the file you create should be of the form /etc/hostname.typenum, where type is the network interface type identifier (some common types are ce, le, hme, eri, and ge) and num is the device instance number of the interface according to the order in which it was installed in the system.
Page 180
# ifconfig e1000g0 plumb inet ip-address netmask ip-netmask ..up For more information, see the ifconfig(1M) man page. Note – The Sun Fire V445 server conforms to the Ethernet 10/100BASE-T standard, which states that the Ethernet 10BASE-T link integrity test function should always be enabled on both the host system and the Ethernet hub.
C H A P T E R Diagnostics This chapter describes the diagnostic tools available for the Sun Fire V445 server. Topics in this chapter include: “Diagnostic Tools Overview” on page 152 “About Sun Advanced Lights-Out Manager 1.0 (ALOM)” on page 154 “About Status Indicators”...
Diagnostic Tools Overview Sun provides a range of diagnostic tools for use with the Sun Fire V445 server. The diagnostic tools are summarized in TABLE 8-1 Summary of Diagnostic Tools TABLE 8-1 Diagnostic Tool Type What It Does Accessibility and Availability...
Page 185
Summary of Diagnostic Tools (Continued) TABLE 8-1 Diagnostic Tool Type What It Does Accessibility and Availability Remote Capab SunVTS Software Exercises and stresses the Requires OS. Optional View and co system, running tests in package that needs to be over network parallel installed separately Software...
About Sun Advanced Lights-Out Manager 1.0 (ALOM) The Sun Fire V445 server ships with Sun Advanced Lights Out Manager (ALOM) 1.0 installed. The system console is directed to ALOM by default and is configured to show server console information on startup.
Another serial port – labeled TTYB – is available for general purpose serial data transfer. This port uses a DB-9 connector. For information on pinouts, refer to the Sun Fire V445 Server Installation Guide. In addition, the server has one 10BASE-T Ethernet management domain interface, labelled NET MGT.
If another user is logged on and has write capability, you will see the message below after issuing the console command: TABLE 8-5 sc> Console session already in use. [view mode] To take console write capability away from another user, type: TABLE 8-6 sc> console -f Sun Fire V445 Server Administration Guide • September 2007...
About Status Indicators For a summary of the server’s LED status indicators, see “Front Panel Indicators” on page 10 “Back Panel Indicators” on page About POST Diagnostics POST is a firmware program that is useful in determining if a portion of the system has failed.
– Diagnostics are executed at a Sun-prescribed level. diag-trigger – Replaces and consolidates the functions of post-trigger and obdiag-trigger. verbosity – Controls the amount and detail of firmware output. Redefined variable: Sun Fire V445 Server Administration Guide • September 2007...
diag-switch? parameter has modified behaviors for controlling diagnostic execution in normal mode on Sun UltraSPARC based volume servers. Behavior of the diag-switch? parameter is unchanged on Sun workstations. Default value changes: auto-boot-on-error? – New default value is true. diag-level – New default value is max. error-reset-recovery –...
Page 192
OpenBoot Diagnostics (obdiag) menu. Default is 1. Note: diag-passes applies only to systems with firmware that contains OpenBoot Diagnostics and has no effect outside the OpenBoot Diagnostics menu. Sun Fire V445 Server Administration Guide • September 2007...
Page 193
OpenBoot Configuration Variables That Control Diagnostic Testing and Automatic System TABLE 8-7 Restoration (Continued) OpenBoot Configuration Variable Description and Keywords Determines which devices are tested by OpenBoot Diagnostics. Default is diag-script normal. – • none OpenBoot Diagnostics do not run. –...
OpenBoot PROM configuration variables revert to the user- defined settings. In this way, you or your service provider can quickly invoke a known and maximum level of diagnostics and still preserve your normal mode settings. Sun Fire V445 Server Administration Guide • September 2007...
lists the OpenBoot configuration variables that are affected by service TABLE 8-8 mode and the overrides that are applied when you select service mode. Service Mode Overrides TABLE 8-8 OpenBoot Configuration Variable Service Mode Override auto-boot? false diag-level diag-trigger power-on-reset error-reset user-reset Factory default input-device Factory default...
TABLE 8-7 control diagnostic testing. Note – The standard (default) configuration is recommended for improved fault isolation and system restoration, and for increased system availability. Sun Fire V445 Server Administration Guide • September 2007...
When you are deciding whether to enable diagnostic testing in your normal environment, remember that you always should run diagnostics to troubleshoot an existing problem or after the following events: Initial system installation New hardware installation and replacement of defective hardware Hardware configuration modification Hardware relocation Firmware upgrade...
Page 198
If you specify a setting for level only, the post command uses the normal mode value for verbosity with the following exception: If the normal mode value of verbosity = none, post uses verbosity = min. Sun Fire V445 Server Administration Guide • September 2007...
If you specify settings for neither level nor verbosity, the post command uses the normal mode values you specified for the configuration variables, diag-level and verbosity, with two exceptions: If the normal mode value of diag-level = off, post uses level = min. If the normal mode value of verbosity = none, post uses verbosity = min.
CPU. Because the competition for system resources makes CPU testing a less linear process than memory testing, CPU test time will depend on the number of CPUs. Sun Fire V445 Server Administration Guide • September 2007...
If you need to know the approximate boot time of your new system before you power on for the first time, the following sections describe two methods you can use to estimate boot time: If your system configuration matches one of the three typical configurations cited “Boot Time Estimates for Typical Configurations”...
Note – The diag-level configuration variable also affects how much output the system generates. The following samples were produced with diag-level set to max, the default setting. Sun Fire V445 Server Administration Guide • September 2007...
Page 203
OpenBoot Diagnostics. TABLE 5 Executing POST w/%o0 = 0000.0400.0101.2041 Sun Fire V445, Keyboard Present Copyright 1998-2006 Sun Microsystems, Inc. All rights reserved. OpenBoot 4.15.0, 4096 MB memory installed, Serial #12980804. Ethernet address 8:0:20:c6:12:44, Host ID: 80c61244. Running diagnostic script obdiag/normal...
OpenBoot variables affect whether a system boots in normal or service mode, as well as whether any overrides occur. CODE EXAMPLE 8-1 {3} ok post SC Alert: Host System has Reset Executing Power On Self Test Q#0> Sun Fire V445 Server Administration Guide • September 2007...
Page 205
CODE EXAMPLE 8-1 0>@(#)Sun Fire[TM] V445 POST 4.22.11 2006/06/12 15:10 /export/delivery/delivery/4.22/4.22.11/post4.22.x/Fiesta/boston/ integrated (root) 0>Copyright ? 2006 Sun Microsystems, Inc. All rights reserved SUN PROPRIETARY/CONFIDENTIAL. Use is subject to license terms. 0>OBP->POST Call with %o0=00000800.01012000. 0>Diag level set to MIN. 0>Verbosity level set to NORMAL.
Page 206
CODE EXAMPLE 8-1 Copyright 2006 Sun Microsystems, Inc. All rights reserved. OpenBoot 4.22.11, 24576 MB memory installed, Serial #64548465. Ethernet address 0:3:ba:d8:ee:71, Host ID: 83d8ee71. Sun Fire V445 Server Administration Guide • September 2007...
Page 207
System Reset Normal Mode skip_diag diag System Controller one-shot execution bootmode with some overrides normal true service-mode? variable false Service Mode Sun-prescribed level of diagnostics diag System Control normal user-reset error-reset power-on-reset false diag-switch? diag-trigger variable variable true none Normal Mode Power-On Reset? full user control OpenBoot...
Summary of Diagnostic Operation (Continued) TABLE 8-10 User Action Sets Configuration Variables And Initiates Issue bootmode diag command Overrides service mode settings and uses Normal mode diagnostics normal mode settings with the following with the exceptions in the exceptions: preceding column. •...
A summary of the tests is available. At the obdiag> prompt, type: TABLE 8-15 obdiag> help Sun Fire V445 Server Administration Guide • September 2007...
Page 211
4. You can also run all tests, type: TABLE 8-16 obdiag> test-all Hit the spacebar to interrupt testing Testing /pci@1f,700000/pci@0/pci@2/pci@0/pci@8/LSILogic,sas@1 ..passed Testing /ebus@1f,464000/flashprom@0,0 ......... passed Testing /pci@1f,700000/pci@0/pci@2/pci@0/pci@8/pci@2/network@0 Internal loopback test -- succeeded. Link is -- up ..passed Testing /ebus@1f,464000/rmc-comm@0,c28000 ......
You can also run OpenBoot Diagnostics tests directly from the ok prompt. To do this, type the test command, followed by the full hardware path of the device (or set of devices) to be tested. For example: TABLE 8-19 ok test /pci@x,y/SUNW,qlc@2 Sun Fire V445 Server Administration Guide • September 2007...
Page 213
Note – Knowing how to construct an appropriate hardware device path requires precise knowledge of the hardware architecture of the Sun Fire V445 system. To customize an individual test, you can use test-args as follows: TABLE 8-20 ok test /usb@1,3:test-args={verbose,debug} This affects only the current test without changing the value of the test-args OpenBoot configuration variable.
The probe-scsi-all command diagnoses problems with the SAS devices. Caution – If you used the halt command or the Stop-A key sequence to reach the ok prompt, then issuing the probe-scsi-all command can hang the system. Sun Fire V445 Server Administration Guide • September 2007...
Page 215
The probe-scsi-all command communicates with all SAS devices connected to on-board SAS controllers and accesses devices connected to any host adapters installed in PCI slots. For any SAS device that is connected and active, the probe-scsi-all command displays its loop ID, host adapter, logical unit number, unique World Wide Name (WWN), and a device description that includes type and manufacturer.
About Predictive Self-Healing In Solaris 10 systems, the Solaris Predictive Self-Healing (PSH) technology enables Sun Fire V445 server to diagnose problems while the Solaris OS is running, and mitigate many problems before they negatively affect operations. The Solaris OS uses the fault manager daemon, fmd(1M), which starts at boot time and runs in the background to monitor the system.
Use the following web site to interpret faults and obtain information on a fault: http://www.sun.com/msg/ This web site directs you to provide the message ID that your system displayed. The web site then provides knowledge articles about the fault and corrective action to resolve the fault.
The fmdump command displays the contents of any log files associated with the Solaris Fault Manager. The fmdump command produces output similar to . This example assumes TABLE 8-23 there is only one fault. Sun Fire V445 Server Administration Guide • September 2007...
DE (also seen in the console output) has received an event which it accepted. A case is opened for that event and a diagnosis is performed to solve the cause for the failure. Sun Fire V445 Server Administration Guide • September 2007...
TABLE 8-29 # fmstat module ev_recv ev_acpt wait svc_t open solve memsz bufsz cpumem-diagnosis 3.0K cpumem-retire 713K fmd-self-diagnosis io-retire snmp-trapgen sysevent-transport 0.0 6704.4 syslog-msgs zfs-diagnosis About Traditional Solaris OS Diagnostic Tools If a system passes OpenBoot Diagnostics tests, it normally attempts to boot its multiuser OS.
Sun Fire V445 server: prtconf prtdiag prtfru psrinfo showrev This section describes the information these commands give you. For more information on using these commands, refer to the Solaris man pages. Sun Fire V445 Server Administration Guide • September 2007...
Page 225
The output of prtconf also includes the total amount of system memory, and shows an excerpt of prtconf output (truncated to save space). prtconf Command Output (Truncated) CODE EXAMPLE 8-6 # prtconf System Configuration: Sun Microsystems sun4u Memory size: 1024 Megabytes System Peripherals (Software Nodes): SUNW,Sun-Fire-V445 packages (driver not attached)
Page 226
The display format used by the prtdiag command can vary depending on what version of the Solaris OS is running on your system. Following is an excerpt of some of the output produced by prtdiag on a Sun Fire V445 server. prtdiag Command Output...
Page 228
=============================== usb Devices =============================== Name Port# ------------ ----- HUB0 bash-3.00# Page 177 Verbose output with fan tach fail ============================ Environmental Status ============================ Fan Status: ------------------------------------------- Location Sensor Status ------------------------------------------- MB/FT0/F0 TACH okay Sun Fire V445 Server Administration Guide • September 2007...
[NO_FAULT] CPU1 0000 [FAULT] Using the prtfru Command The Sun Fire V445 system maintains a hierarchical list of all FRUs in the system, as well as specific information about various FRUs. Sun Fire V445 Server Administration Guide • September 2007...
Page 231
The prtfru command can display this hierarchical list, as well as data contained in the serial electrically erasable programmable read-only memory (SEEPROM) devices located on many FRUs. shows an excerpt of a hierarchical list of CODE EXAMPLE 8-11 FRUs generated by the prtfru command with the -l option. prtfru -l Command Output (Truncated) CODE EXAMPLE 8-11 # prtfru -l...
Page 232
/InstallationR[3] /InstallationR[3]/UNIX_Timestamp32: Fri Apr 21 08:50:32 EDT 2006 /InstallationR[3]/Fru_Path: MB.SEEPROM /InstallationR[3]/Parent_Part_Number: 3753302 /InstallationR[3]/Parent_Serial_Number: 0001 /InstallationR[3]/Parent_Dash_Level: 03 /InstallationR[3]/System_Id: /InstallationR[3]/System_Tz: 0 /InstallationR[3]/Geo_North: 1 /InstallationR[3]/Geo_East: 16531457 /InstallationR[3]/Geo_Alt: 251 /InstallationR[3]/Geo_Location: /Status_EventsR (0 iterations) SEGMENT: PE Sun Fire V445 Server Administration Guide • September 2007...
When used with the -p option, this command displays installed patches. TABLE 8-30 shows a partial sample output from the showrev command with the -p option. showrev -p Command Output TABLE 8-30 Patch: 109729-01 Obsoletes: Requires: Incompatibles: Packages: SUNWcsu Patch: 109783-01 Obsoletes: Requires: Incompatibles: Packages: SUNWcsu...
This section explains how to access and modify OpenBoot configuration variables. For a list of important OpenBoot configuration variables, see TABLE 8-7 Changes to OpenBoot configuration variables usually take effect upon the next reboot. Sun Fire V445 Server Administration Guide • September 2007...
Page 237
To View and Set OpenBoot Configuration Variables 1. Obtain the ok prompt. To display the current values of all OpenBoot configuration variables, use the printenv command. The following example shows a short excerpt of this command’s output. TABLE 8-33 ok printenv Variable Name Value Default Value...
36 Gbyte Hard Disk Drives, both of them active. probe-scsi-all Output Message CODE EXAMPLE 8-16 ok probe-scsi-all /pci@1f,0/pci@1/scsi@8,1 /pci@1f,0/pci@1/scsi@8 Target 0 Unit 0 Disk SEAGATE ST336605LSUN36G 4207 Target 1 Unit 0 Disk SEAGATE ST336605LSUN36G 0136 Sun Fire V445 Server Administration Guide • September 2007...
Using the probe-ide Command To Confirm That the DVD Drive is Connected The probe-ide command transmits an inquiry command to internal and external IDE devices connected to the system’s on-board IDE interface. The following sample output reports a DVD drive installed (as Device 0) and active in a server. probe-ide Output Message CODE EXAMPLE 8-17 ok probe-ide...
About Automatic Server Restart Note – Automatic Server Restart is not the same as Automatic System Restoration (ASR), which the Sun Fire V445 server also supports. Automatic Server Restart is a functional part of ALOM. It monitors the Solaris OS while it is running and, by default, captures cpu register and memory contents to the dump-device using the firmware level sync command.
Page 241
About Automatic System Restoration Note – Automatic System Restoration (ASR) is not the same as Automatic Server Restart, which the Sun Fire V445 server also supports. Automatic System Restoration (ASR) consists of self-test features and an auto- configuring capability to detect failed hardware components and unconfigure them.
Auto-Boot Options The OpenBoot firmware stores configuration variables on a ROM chip called auto- boot? and auto-boot-on-error? The default setting on the Sun Fire V445 server for both of these variables is true. The auto-boot? setting controls whether or not the firmware automatically boots the OS after each reset.
Error Handling Summary Error handling during the power-on sequence falls into one of the following three cases: If no errors are detected by POST or OpenBoot Diagnostics, the system attempts to boot if auto-boot? is true. If only nonfatal errors are detected by POST or OpenBoot Diagnostics, the system attempts to boot if auto-boot? is true and auto-boot-on-error? is true.
In addition, ASR is enabled by default because diag-trigger is set to power-on- reset and error-reset. This default setting remains when the diag-switch? variable is set to false. auto-boot? and auto-boot-on-error? are set to true by default. Sun Fire V445 Server Administration Guide • September 2007...
Automatic System Restoration User Commands The OpenBoot commands .asr, asr-disable, and asr-enable are available for obtaining ASR status information and for manually unconfiguring or reconfiguring system devices. For more information, see “Unconfiguring a Device Manually” on page 112. Enabling Automatic System Restoration The ASR feature is enabled by default.
Page 246
In the .asr command output, any devices marked disabled have been manually unconfigured using the asr-disable command. The .asr command also lists devices that have failed firmware diagnostics and have been automatically unconfigured by the OpenBoot ASR feature. Sun Fire V445 Server Administration Guide • September 2007...
About SunVTS SunVTS is a software suite that performs system and subsystem stress testing. You can view and control a SunVTS session over a network. Using a remote machine, you can view the progress of a testing session, change testing options, and control all testing features of another machine on the network.
PCI cards Performs a loopback test to check that the Ethernet adapter can send netlbtest and receive packets Tests the physical memory (read only) pmemtest Tests the server’s on-board serial ports sutest Sun Fire V445 Server Administration Guide • September 2007...
SunVTS Tests TABLE 8-35 SunVTS Test Description Tests the virtual memory (a combination of the swap partition and vmemtest the physical memory) Tests the environmental devices env6test Tests ALOM hardware devices ssptest Tests I2C devices for correct operation i2c2test To Find Out Whether SunVTS Is Installed Type: TABLE 8-36 # pkginfo -l SUNWvts...
What Sun Management Center Monitors Disk drives Status Fans Status CPUs Temperature and any thermal warning or failure conditions Power supply Status System temperature Temperature and any thermal warning or failure conditions Sun Fire V445 Server Administration Guide • September 2007...
The servers being monitored must be up and running if you want to use Sun Management Center, since this tool relies on the Solaris OS. For instructions on using this tool to monitor a Sun Fire V445 server, see Chapter...
Sun Management Center periodically check whether it is up and running, and notify you if it goes out of commission. Sun Fire V445 Server Administration Guide • September 2007...
Hardware Diagnostic Suite The Hardware Diagnostic Suite is a package that you can purchase as an add-on to Sun Management Center. The suite enables you to exercise a system while it is still up and running in a production environment. See “Hardware Diagnostic Suite”...
Page 254
Hardware Diagnostic Suite. Instructions for setting up Sun Management Center, as well as for using the Hardware Diagnostic Suite, can be found in the Sun Management Center Software User’s Guide. Sun Fire V445 Server Administration Guide • September 2007...
C H A P T E R Troubleshooting This chapter describes the diagnostic tools available for the Sun Fire V445 server. Topics in this chapter include: “Troubleshooting Options” on page 223 “About Updated Troubleshooting Information” on page 224 “About Firmware and Software Patch Management” on page 225 “About Sun Install Check Tool”...
V445 Server Product Notes and at Sun web sites. These resources can help you understand and diagnose problems that you might encounter. Product Notes Sun Fire V445 Server Product Notes contain late-breaking information about the system, including the following: Current recommended and required software patches...
When you install the Sun Install Check tool, you also install Sun Explorer Data Collector. The Sun Install Check tool uses Sun Explorer Data Collector to help you confirm that Sun Fire V445 server installation has been completed optimally. Together, they can evaluate your system for the following:...
About Sun Remote Services Net Connect Sun Remote Services (SRS) Net Connect is a collection of system management services designed to help you better control your computing environment. These Web-delivered services enable you to monitor systems, to create performance and trend reports, and to receive automatic notification of system events.
For more information about how ASR works, and complete instructions for enabling ASR capability, see “About Automatic System Restoration” on page 209. Sun Fire V445 Server Administration Guide • September 2007...
Remote Troubleshooting Capabilities You can use the Sun Advanced Lights Out Manager (ALOM) system controller to troubleshoot and diagnose the system remotely. The ALOM system controller enables you to do the following: Turn system power on and off Control the Locator indicator Change OpenBoot configuration variables View system environmental status information View system event logs...
core dump directory to another locally mounted location so that you can better manage any system core dumps. In certain testing and pre-production environments, this is recommended since core dump files can take up a large amount of file system space. Swap space is used to save the dump of system memory.
Page 264
System dump time: Wed Apr 23 17:03:48 2003 savecore: not enough space in /var/crash/sf440-a (216 MB avail, 246 MB needed) Perform Steps 4 and 5 if there is not enough space. Sun Fire V445 Server Administration Guide • September 2007...
4. Type the df -k1 command to identify locations with more space. TABLE 9-6 # df -k1 Filesystem kbytes used avail capacity Mounted on /dev/dsk/c1t0d0s0 832109 552314 221548 /proc /proc /dev/fd mnttab /etc/mntab swap 3626264 362624 /var/run swap 3626656 362624 /tmp /dev/dsk/c1t0d0s7 33912732...
Page 266
There should also be a bounds file that contains the next crash number savecore will use. If a core dump is not generated, perform the procedure described in “Enabling the Core Dump Process” on page 231. Sun Fire V445 Server Administration Guide • September 2007...
A P P E N D I X Connector Pinouts This appendix provides reference information about the system back panel ports and pin assignments. Topics covered in this appendix include: “Reference for the Serial Management Port Connector” on page 235 “Reference for the Network Management Port Connector”...
The network management connector (labeled NET MGT) is an RJ-45 connector located on the ALOM card and can be accessed from the back panel. This port needs to be configured prior to use. Sun Fire V445 Server Administration Guide • September 2007...
Serial Port Connector Signals TABLE A-3 Signal Description Signal Description Data Carrier Detect Data Set Ready Receive Data Request to Send Transmit Data Clear to Send Data Terminal Ready Ring Indicate Ground Sun Fire V445 Server Administration Guide • September 2007...
Reference for the USB Connectors Two Universal Serial Bus (USB) ports are located on the motherboard in a double- stacked layout and can be accessed from the back panel. USB Connector Diagram USB3 USB2 USB Connector Diagram FIGURE A-4 USB Connector Signals For USB connector signals, see TABLE A-4 USB Connector Signals...
Four RJ-45 Gigabit Ethernet connectors (NET0, NET1, NET2, NET3) are located on the system motherboard and can be accessed from the back panel. The Ethernet interfaces operate at 10 Mbit/sec, 100 Mbit/sec, and 1000 Mbit/sec. Gigabit Ethernet Connector Diagram Sun Fire V445 Server Administration Guide • September 2007...
A P P E N D I X System Specifications This appendix provides the following specifications for the Sun Fire V445 server: “Reference for Physical Specifications” on page 244 “Reference for Electrical Specifications” on page 244 “Reference for Environmental Specifications” on page 245 “Reference for Agency Compliance Specifications”...
13.2 A @ 100 VAC 11 A @ 120 VAC 6.6 A @ 200 VAC 6.35 A @ 208 VAC 6 A @ 220 VAC 5.74 A @ 230 VAC 5.5A @ 240 VAC Output Sun Fire V445 Server Administration Guide • September 2007...
Electrical Specifications TABLE B-2 Parameter Value +12 VDC 0.5 to 45A -12 VDC 0 to 0.8A +5 VDC 0.5 to 28A -5 VDC 0.5 to 50A Maximum DC Output of Two (2) Power 1100W Max AC power consumption 1320W for Supplies operation @ 100 VAC to 240 VAC Max heat dissipation 4505 BTUs/Hr for operation @ 200...
0.5 inch roll-off side to sideIEC 60068-2-27 Handling Drops 60 mm, 1 drop per corner, 4 cornersIEC 60068-2-31 Threshold Impact 0.85m/s, 3 impacts per caster, all 4 casters, 25 mm step-upETE 1010-01 Sun Fire V445 Server Administration Guide • September 2007...
Page 279
Reference for Agency Compliance Specifications The system complies with the following specifications. Agency Compliance Specifications TABLE B-4 Category Relevant Standards Safety UL/CSA-60950-1, EN60950-1, IEC60950-1 CB Scheme with all country deviations, IEC825-1, 2, CFR21 part 1040, CNS14336 RFI/EMI EN55022 Class A 47 CFR 15B Class A ICES-003 Class A VCCI Class A...
Minimum clearances needed for servicing the system are as follows. Clearance and Service Access Specifications TABLE B-5 Blockage Required Clearance Front of System 36 in (91.4 cm) Back of System 36 in (91.4 cm) Sun Fire V445 Server Administration Guide • September 2007...
A P P E N D I X OpenBoot Configuration Variables describes the OpenBoot firmware configuration variables stored on an TABLE C-1 IDPROM module on a new system controller. The OpenBoot configuration variables are printed here in the order in which they appear when you issue the showenv command.
Page 282
If true, execute commands in NVRAMRC use-nvramrc? true, false false during server start-up. variable-name Command script to execute if use-nvramrc? nvramrc none is true. Firmware security level. security-mode none, command, full none Sun Fire V445 Server Administration Guide • September 2007...
Page 283
OpenBoot Configuration Variables Stored on a ROM Chip (Continued) TABLE C-1 Variable Possible Values Default Value Description variable-name Firmware security password if security-password none security-mode is not none (never displayed) - do not set this directly. variable-name Number of incorrect security password security- none attempts.
Page 284
Sun Fire V445 Server Administration Guide • September 2007...
Page 288
142 link integrity test, 145, 148 network management port (NET MGT) local graphics monitor about, 27 remote power-off, 64, 67 activating, 42 remote power-on, 62 advantages over serial management port, 30 Sun Fire V445 Server Administration Guide • September 2007...
Page 289
configuration rules, 81 test command, 179 configuring IP address, 43 test-all command, 180 configuring using Dynamic Host Configuration OpenBoot emergency procedures Protocol (DHCP), 42 performing, 109 issuing an externally initiated reset (XIR) from, 4 OpenBoot firmware non-mirrored disk hot-plug operation, 136 scenarios for control, 35 selecting a boot device, 69 operating environment software, suspending, 38...
Page 290
(#.), 34 RAID 0 (striping), 121, 128 system console, switching between, 38 RAID 1 (mirroring), 121, 124 ways to access, 34 raidctl (Solaris command), ?? to 135 scadm (Solaris utility), 106 Sun Fire V445 Server Administration Guide • September 2007...
Page 291
SEAM (Sun Enterprise Authentication psrinfo, 201 Mechanism), 215 raidctl, ?? to 135 scadm, 106 serial management port (SERIAL MGT) setlocator, 108 about, 4, 7 showlocator, 109 acceptable console device connections, 29 showrev, 201 as default communication port on initial shutdown, 36, 41 startup, 26 swap -l, 232 as default console connection, 96...
Page 292
USB ports about, 4 temperature sensors, 100 configuration rules, 95 terminal server connecting to, 95 accessing system console from, 29, 44 connection through patch panel, 44 connection through serial management port, 27 Sun Fire V445 Server Administration Guide • September 2007...
Page 293
VERITAS Volume Manager, 118, 119, 120 watchdog, hardware, See hardware watchdog mechanism watch-net all diagnostic output message, 207 watch-net diagnostic output message, 207 World Wide Name (probe-scsi), 181 XIR, See externally initiated reset XIR, See externally initiated reset (XIR) Index...
Page 294
Sun Fire V445 Server Administration Guide • September 2007...
Need help?
Do you have a question about the Sun Fire V445 Server and is the answer not in the manual?
Questions and answers