Comment 0 for bug 1857413

Revision history for this message
fan jinke (fanjinke) wrote :

Using Linux kernel, When inject 1bit ecc error, there are some mce log recorded in the dmesg.like:

[ 1561.511210] mce: [Hardware Error]: Machine check events logged
[ 1561.511221] [Hardware Error]: Corrected error, no action required.
[ 1561.511311] [Hardware Error]: CPU:0 (18:0:2) MC16_STATUS[Over|CE|MiscV|-|AddrV|-|-|SyndV|-|CECC]: 0xdc2040000000011b
[ 1561.511388] [Hardware Error]: Error Addr: 0x000000077cd66940
[ 1561.511439] [Hardware Error]: IPID: 0x0000009600150f00, Syndrome: 0x000010ce0a400d01
[ 1561.511499] [Hardware Error]: Unified Memory Controller Extended Error Code: 0
[ 1561.511556] [Hardware Error]: Unified Memory Controller Error: DRAM ECC error.
[ 1561.511646] EDAC MC0: 1 CE on mc#0csrow#1channel#1 (csrow:1 channel:1 page:0x7fcd66 offset:0x940 grain:0 syndrome:0x10ce)
[ 1561.511648] [Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: RD

*But, there are no the log when Using "Ubuntu 18.04.3 LTS"*

The upstream related commit is de0e0624d86ff9fc512dedb297f8978698abf21a .

After merged this commit, dmesg can record the mce log.