Comment 4 for bug 1871965

Revision history for this message
dann frazier (dannf) wrote :

= bionic verification =

== AMD EPYC ==

# dmesg
[ 631.470101] mce: [Hardware Error]: Machine check events logged
[ 631.470104] [Hardware Error]: Deferred error, no action required.
[ 631.470153] [Hardware Error]: CPU:0 (17:31:0) MC0_STATUS[-|-|MiscV|AddrV|-|-|SyndV|UECC|Deferred|-|-]: 0x9c2030000000011b
[ 631.470213] [Hardware Error]: Error Addr: 0x000000035dd8bfc0
[ 631.470245] [Hardware Error]: IPID: 0x000000b000000000, Syndrome: 0x000000030b404000
[ 631.470287] [Hardware Error]: Load Store Unit Ext. Error Code: 0, Load queue parity error.
[ 631.470332] [Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: RD
# ras-mc-ctl --errors
No Memory errors.

No PCIe AER errors.

No Extlog errors.

MCE events:
1 2020-04-15 18:06:00 +0000 error: Deferred error, no action required., CPU 2, bank Load Store Unit (bank=0), mcg mcgstatus=0, mci UECC, mcgcap=0x0000011c, status=0x9c2030000000011b, addr=0x35dd8bfc0, walltime=0x5e974d09, cpuid=0x00830f10

== Skylake ==
$ sudo ./mce-inject < test/corrected
$ dmesg | tail
[ 18.176600] EXT4-fs (sda2): resizing filesystem from 97545216 to 97546513 blocks
[ 18.176939] EXT4-fs (sda2): resized filesystem to 97546513
[ 19.097678] new mount options do not match the existing superblock, will be ignored
[ 3952.080562] mce: Machine check injector initialized
[ 3960.953025] mce: Starting machine check poll CPU 0
[ 3960.953063] mce: Machine check poll done on CPU 0
[ 3960.953174] mce: [Hardware Error]: Machine check events logged
[ 3960.953328] mce: Starting machine check poll CPU 1
[ 3960.953360] mce: Machine check poll done on CPU 1
[ 3960.953378] mce: [Hardware Error]: Machine check events logged
$ sudo ras-mc-ctl --errors
No Memory errors.

No PCIe AER errors.

No Extlog errors.

MCE events:
1 2020-04-15 19:41:29 +0000 error: No Error, mcg mcgstatus=0, mci Corrected_error Error_enabled, mcgcap=0x0f000814, status=0x9400000000000000, addr=0x0000abcd, walltime=0x5e976369, cpuid=0x00050654, bank=0x00000001
2 2020-04-15 19:41:29 +0000 error: No Error, mcg mcgstatus=0, mci Corrected_error Error_enabled, mcgcap=0x0f000814, status=0x9400000000000000, addr=0x00001234, walltime=0x5e976369, cpu=0x00000001, cpuid=0x00050654, apicid=0x00000002, bank=0x00000002