Reliability, Availability, And Serviceability - IBM System x3850 X5 Installation And User Manual

Types 7145, 7146, 7143, and 7191
Hide thumbs Also See for System x3850 X5:
Table of Contents

Advertisement

v Redundant connection
v Redundant cooling and power capabilities
v ServeRAID support
v Symmetric multiprocessing (SMP)

Reliability, availability, and serviceability

Three important server design features are reliability, availability, and serviceability
(RAS). The RAS features help to ensure the integrity of the data that is stored in
the server, the availability of the server when you need it, and the ease with which
you can diagnose and correct problems.
The server has the following RAS features:
v Advanced memory features:
v Automatic BIOS recovery (ABR) for UEFI
v Automatic error retry and recovery
v Automatic restart after a power failure
v Availability of microcode and diagnostic levels
v Integrated management module (service processor)
v Built-in, menu-driven electrically erasable programmable ROM (EEPROM) based
v Built-in monitoring for fan, power, temperature, voltage, and power-supply
v Error codes and messages
v Error correcting code (ECC) L2 cache and system memory
v Fault-resistant startup
v Hot-swap hard disk drives
v IBM Systems Director workgroup-hardware-management tool
v Information and light path diagnostics LED panels
v Integrated management module
v Service processor adapter for remote systems management
v Parity checking on the SAS bus and PCI Express buses
10
IBM System x3850 X5 and x3950 X5 Types 7145, 7146, 7143, and 7191: Installation and User's Guide
The addition of an optional network interface card (NIC) provides a failover
capability to a redundant Ethernet connection. If a problem occurs with the
primary Ethernet connection, all Ethernet traffic that is associated with the
primary connection is automatically switched to the redundant NIC. If the
applicable device drivers are installed, this switching occurs without data loss
and without user intervention.
The redundant cooling of the fans in the server enables continued operation if
one of the fans fails. The server supports up to two hot-swap power supplies,
which provide redundant power for many server configurations.
The server supports ServeRAID controllers to create redundant array of
independent disks (RAID) configurations.
The server supports up to four multi-core Intel Xeon microprocessors. One or
more multi-core microprocessors provides SMP capability.
– Single-bit memory error detection
– Single-bit memory error hardware correction
– Multi single-bit memory error recovery and corrections
– Uncorrectable error (UE) detection
– Full array memory mirroring (FAMM) redundancy
– Automatic failover recovery for UEs when FAMM is configured
– Automated logical removal of failed DIMMs on reboots prior to replacement
– Automatic address parity checking during writes and reads
setup, system configuration, and diagnostic programs
redundancy

Advertisement

Table of Contents
loading

This manual is also suitable for:

System x3950 x5

Table of Contents