The new handler can be configured at system run time by reading or writing the
control files in /sys/devices/system/machinecheck/machinecheck0/ 10 Valid fields
are:
• tolerant Tolerance level. The higher this level the more risk the machine check
handler takes to keep the machine running.
Valid levels are:
0 always panic on uncorrected errors.
1 panic if deadlock possible
2 try to avoid panic at slight deadlock risk
3 never panic or exit (for testing only)
Specifying oops=panic on the kernel command line implies zero tolerance.
For a cluster setting tolerant to zero may be best, together with panic=10 to
force an reboot.
• check interval Interval in seconds to check for silent machine check events.
Default 5 minutes. 0 disables background checking.
• bank0ctl … bankNctl Binary mask of errors enabled in bank N. Default is to
enable all errors in each bank. An disabled error will be ignored. For details
on the banks and their sub-errors for AMD and Intel CPUs see [opteron]
https://git.kernel.org/pub/scm/utils/cpu/mce/mcelog.git
linux下提供的MCA SYS 接口
于 2023-02-28 14:51:00 首次发布