Huawei FusionServer Pro CH242 V5 Technical White Paper page 53

Table of Contents

Advertisement

Huawei FusionServer Pro CH242 V5 Compute Node
Technical White Paper
Module
PCIe
UPI
System
Issue 07 (2020-07-31)
Feature
Memory Demand and
Patrol Scrubbing
Memory Mirroring
Single Device Data
Correction
Device Tagging
Data Scrambling
PCIe Advanced Error
Reporting
Intel UPI Link Level Retry
Intel UPI Protocol
Protection via CRC
Core Disable for Fault
Resilient Boot (FRB)
Corrupt Data Containment
Mode
Socket disable for Fault
Resilient Boot (FRB)
Architected Error Records
Copyright © Huawei Technologies Co., Ltd.
Description
Corrects errors upon detection. If these
errors are not corrected promptly,
uncorrectable errors may occur.
Improves system reliability.
Provides a single-device multi-bit error
correction capability to improve
memory reliability.
Degrades and rectifies DIMM device
faults to improve DIMM availability.
Optimizes data stream distribution
and reduces the error possibility to
improve the reliability of data streams
in the memory and the capability to
detect address errors.
Improves server serviceability.
Provides a retry mechanism upon
errors to improve UPI reliability.
Provides cyclic redundancy check
(CRC) protection for UPI packets to
improve system reliability.
Isolates a faulty CPU core during
startup to improve system reliability
and availability.
Identifies the memory storage unit
that contains corrupted data to
minimize the impact on the running
programs and improve system
reliability.
Isolates a faulty socket during the
BIOS startup process to improve
system reliability.
With the enhanced machine check
architecture (eMCA) feature, the BIOS
collects error information from
hardware registers in compliance with
UEFI specifications, sends the error
information to the OS over the APEI of
the Advanced Configuration and
Power Interface (ACPI), and locates
the error unit, improving system
availability.
A Appendix
47

Advertisement

Table of Contents
loading

Table of Contents