Disabling Cmci; Disabling Kdump Restart On Udev Triggered By Logical Processor Offline - NEC Express5800/A1040b User Manual

Machine check monitoring service
Hide thumbs Also See for Express5800/A1040b:
Table of Contents

Advertisement

Setting in mcemonitor.conf
core-ce-action = soft
core-ce-action = account
core-ce-action = off
Table 3-2 mcemonitor configuration file(memory-ce-action)
Setting in mcemonitor.conf
memory-ce-action = soft
memory-ce-action = off
The system must be rebooted if configuration file is modified.
3.3.3

Disabling CMCI

In RHEL6.6 kernel 2.6.32-504.23.4.el6.x86_64, it is reported that the frequent occurrence of
CMCI(Corrected Machine Check Interrupt), which notifies the operating system of the detected
corrrectable error, may cause System panic.
To change the error detecting mode from "interrupt mode" to "polling mode", you need to add
"mce=no_cmci" to the kernel line in the "/boot/efi/EFI/redhat/grub.conf".
The system must be rebooted if configuration file is modified.
title Red Hat Enterprise Linux Server (2.6.32-504.23.4.el6.x86_64)
root (hd0,0)
kernel /vmlinuz-2.6.32-504.23.4.el6.x86_64 ro
root=/dev/mapper/VolGroup00-LogVol00
rd_LVM_LV= VolGroup00/LogVol00 rd_NO_LUKS nomodeset rd_NO_MD rhgb quiet
crashkernel=256M KEYBOARDTYPE=pc KEYTABLE=jp106 LANG=ja_JP.UTF-8 rd_NO_DM
mce=no_cmci
initrd /initramfs-2.6.32-504.23.4.el6.x86_64.img
3.3.4

Disabling kdump restart on udev triggered by logical processor offline

Add # at the top of the following line in /etc/udev/rules.d/98-kexec.rules file to disable the rule.
#SUBSYSTEM=="cpu", ACTION=="offline", PROGRAM="/etc/init.d/kdump restart"
Restart udev after modifying configuration file.
udevadm control --reload-rules
Note
Table 3-1 mcemonitor configuration file(core-ce-action)
Collects log and makes CPU Core Offline if the CPU error count
exceeds the threshold value. (Default)
Collects log but does not make CPU Core Offline even if the CPU
error count exceeds the threshold value.
Does not collect log nor make CPU Core Offline.
Collects log and makes Memory Page Offline if the memory error count
exceeds the threshold value. (Default)
The process running on the relevant memory is transferred to another
memory.
Does not collect log nor make Memory Page Offline.
kdump is restarted when capmonitor executes script upon completion of Core
Offline. You need to place the script file to be used after Core Offline according to
"3.3.5 Script file to be executed after Core Offline".
Description
Description
14

Advertisement

Table of Contents
loading

This manual is also suitable for:

Express5800/a2010bExpress5800/a2020bExpress5800/a2040b

Table of Contents