2019-12-24 07:00:12 |
fan jinke |
bug |
|
|
added bug |
2019-12-24 07:14:47 |
fan jinke |
description |
Using Linux kernel, When inject 1bit ecc error, there are some mce log recorded in the dmesg.like:
[ 1561.511210] mce: [Hardware Error]: Machine check events logged
[ 1561.511221] [Hardware Error]: Corrected error, no action required.
[ 1561.511311] [Hardware Error]: CPU:0 (18:0:2) MC16_STATUS[Over|CE|MiscV|-|AddrV|-|-|SyndV|-|CECC]: 0xdc2040000000011b
[ 1561.511388] [Hardware Error]: Error Addr: 0x000000077cd66940
[ 1561.511439] [Hardware Error]: IPID: 0x0000009600150f00, Syndrome: 0x000010ce0a400d01
[ 1561.511499] [Hardware Error]: Unified Memory Controller Extended Error Code: 0
[ 1561.511556] [Hardware Error]: Unified Memory Controller Error: DRAM ECC error.
[ 1561.511646] EDAC MC0: 1 CE on mc#0csrow#1channel#1 (csrow:1 channel:1 page:0x7fcd66 offset:0x940 grain:0 syndrome:0x10ce)
[ 1561.511648] [Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: RD
*But, there are no the log when Using "Ubuntu 18.04.3 LTS"*
The upstream related commit is de0e0624d86ff9fc512dedb297f8978698abf21a .
After merged this commit, dmesg can record the mce log. |
Using Linux kernel, When inject 1bit ecc error, there are some mce log recorded in the dmesg.like:
[ 1561.511210] mce: [Hardware Error]: Machine check events logged
[ 1561.511221] [Hardware Error]: Corrected error, no action required.
[ 1561.511311] [Hardware Error]: CPU:0 (18:0:2) MC16_STATUS[Over|CE|MiscV|-|AddrV|-|-|SyndV|-|CECC]: 0xdc2040000000011b
[ 1561.511388] [Hardware Error]: Error Addr: 0x000000077cd66940
[ 1561.511439] [Hardware Error]: IPID: 0x0000009600150f00, Syndrome: 0x000010ce0a400d01
[ 1561.511499] [Hardware Error]: Unified Memory Controller Extended Error Code: 0
[ 1561.511556] [Hardware Error]: Unified Memory Controller Error: DRAM ECC error.
[ 1561.511646] EDAC MC0: 1 CE on mc#0csrow#1channel#1 (csrow:1 channel:1 page:0x7fcd66 offset:0x940 grain:0 syndrome:0x10ce)
[ 1561.511648] [Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: RD
*But, there are no the log when Using "Ubuntu 18.04.3 LTS"*
The upstream related commit is de0e0624d86ff9fc512dedb297f8978698abf21a .
After merged this commit, Ubuntu kernel's dmesg can record the mce log as well. |
|
2019-12-24 07:30:08 |
Ubuntu Kernel Bot |
linux (Ubuntu): status |
New |
Incomplete |
|
2019-12-24 09:46:45 |
fan jinke |
tags |
|
apport-collected disco |
|
2019-12-24 09:46:46 |
fan jinke |
description |
Using Linux kernel, When inject 1bit ecc error, there are some mce log recorded in the dmesg.like:
[ 1561.511210] mce: [Hardware Error]: Machine check events logged
[ 1561.511221] [Hardware Error]: Corrected error, no action required.
[ 1561.511311] [Hardware Error]: CPU:0 (18:0:2) MC16_STATUS[Over|CE|MiscV|-|AddrV|-|-|SyndV|-|CECC]: 0xdc2040000000011b
[ 1561.511388] [Hardware Error]: Error Addr: 0x000000077cd66940
[ 1561.511439] [Hardware Error]: IPID: 0x0000009600150f00, Syndrome: 0x000010ce0a400d01
[ 1561.511499] [Hardware Error]: Unified Memory Controller Extended Error Code: 0
[ 1561.511556] [Hardware Error]: Unified Memory Controller Error: DRAM ECC error.
[ 1561.511646] EDAC MC0: 1 CE on mc#0csrow#1channel#1 (csrow:1 channel:1 page:0x7fcd66 offset:0x940 grain:0 syndrome:0x10ce)
[ 1561.511648] [Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: RD
*But, there are no the log when Using "Ubuntu 18.04.3 LTS"*
The upstream related commit is de0e0624d86ff9fc512dedb297f8978698abf21a .
After merged this commit, Ubuntu kernel's dmesg can record the mce log as well. |
Using Linux kernel, When inject 1bit ecc error, there are some mce log recorded in the dmesg.like:
[ 1561.511210] mce: [Hardware Error]: Machine check events logged
[ 1561.511221] [Hardware Error]: Corrected error, no action required.
[ 1561.511311] [Hardware Error]: CPU:0 (18:0:2) MC16_STATUS[Over|CE|MiscV|-|AddrV|-|-|SyndV|-|CECC]: 0xdc2040000000011b
[ 1561.511388] [Hardware Error]: Error Addr: 0x000000077cd66940
[ 1561.511439] [Hardware Error]: IPID: 0x0000009600150f00, Syndrome: 0x000010ce0a400d01
[ 1561.511499] [Hardware Error]: Unified Memory Controller Extended Error Code: 0
[ 1561.511556] [Hardware Error]: Unified Memory Controller Error: DRAM ECC error.
[ 1561.511646] EDAC MC0: 1 CE on mc#0csrow#1channel#1 (csrow:1 channel:1 page:0x7fcd66 offset:0x940 grain:0 syndrome:0x10ce)
[ 1561.511648] [Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: RD
*But, there are no the log when Using "Ubuntu 18.04.3 LTS"*
The upstream related commit is de0e0624d86ff9fc512dedb297f8978698abf21a .
After merged this commit, Ubuntu kernel's dmesg can record the mce log as well.
---
ProblemType: Bug
AlsaDevices:
total 0
crw-rw----+ 1 root audio 116, 1 Dec 24 17:20 seq
crw-rw----+ 1 root audio 116, 33 Dec 24 17:20 timer
AplayDevices: Error: [Errno 2] No such file or directory: 'aplay': 'aplay'
ApportVersion: 2.20.10-0ubuntu27
Architecture: amd64
ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord': 'arecord'
AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1:
DistroRelease: Ubuntu 19.04
InstallationDate: Installed on 2019-12-24 (0 days ago)
InstallationMedia: Ubuntu-Server 19.04 "Disco Dingo" - Release amd64 (20190416.1)
IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig': 'iwconfig'
MachineType: Sugon HygonH210
Package: linux (not installed)
PciMultimedia:
ProcEnviron:
TERM=linux
PATH=(custom, no user)
LANG=en_US.UTF-8
SHELL=/bin/bash
ProcFB: 0 astdrmfb
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-5.0.0-13-generic root=UUID=43f8bc11-d850-4e79-9d14-1232ef50040f ro
ProcVersionSignature: Ubuntu 5.0.0-13.14-generic 5.0.6
RelatedPackageVersions:
linux-restricted-modules-5.0.0-13-generic N/A
linux-backports-modules-5.0.0-13-generic N/A
linux-firmware 1.178
RfKill: Error: [Errno 2] No such file or directory: 'rfkill': 'rfkill'
Tags: disco
Uname: Linux 5.0.0-13-generic x86_64
UpgradeStatus: No upgrade log present (probably fresh install)
UserGroups:
_MarkForUpload: True
dmi.bios.date: 03/15/2019
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: 210ER119
dmi.board.asset.tag: Default string
dmi.board.name: HygonH210
dmi.board.vendor: Sugon
dmi.board.version: Default string
dmi.chassis.asset.tag: Default string
dmi.chassis.type: 17
dmi.chassis.vendor: Sugon
dmi.chassis.version: Default string
dmi.modalias: dmi:bvnAmericanMegatrendsInc.:bvr210ER119:bd03/15/2019:svnSugon:pnHygonH210:pvrDefaultstring:rvnSugon:rnHygonH210:rvrDefaultstring:cvnSugon:ct17:cvrDefaultstring:
dmi.product.family: Rack
dmi.product.name: HygonH210
dmi.product.sku: Default string
dmi.product.version: Default string
dmi.sys.vendor: Sugon |
|
2019-12-24 09:46:47 |
fan jinke |
attachment added |
|
CRDA.txt https://bugs.launchpad.net/bugs/1857413/+attachment/5315017/+files/CRDA.txt |
|
2019-12-24 09:46:49 |
fan jinke |
attachment added |
|
CurrentDmesg.txt https://bugs.launchpad.net/bugs/1857413/+attachment/5315018/+files/CurrentDmesg.txt |
|
2019-12-24 09:46:50 |
fan jinke |
attachment added |
|
Lspci.txt https://bugs.launchpad.net/bugs/1857413/+attachment/5315019/+files/Lspci.txt |
|
2019-12-24 09:46:52 |
fan jinke |
attachment added |
|
Lsusb.txt https://bugs.launchpad.net/bugs/1857413/+attachment/5315020/+files/Lsusb.txt |
|
2019-12-24 09:46:55 |
fan jinke |
attachment added |
|
ProcCpuinfo.txt https://bugs.launchpad.net/bugs/1857413/+attachment/5315021/+files/ProcCpuinfo.txt |
|
2019-12-24 09:46:56 |
fan jinke |
attachment added |
|
ProcCpuinfoMinimal.txt https://bugs.launchpad.net/bugs/1857413/+attachment/5315022/+files/ProcCpuinfoMinimal.txt |
|
2019-12-24 09:46:58 |
fan jinke |
attachment added |
|
ProcInterrupts.txt https://bugs.launchpad.net/bugs/1857413/+attachment/5315023/+files/ProcInterrupts.txt |
|
2019-12-24 09:46:59 |
fan jinke |
attachment added |
|
ProcModules.txt https://bugs.launchpad.net/bugs/1857413/+attachment/5315024/+files/ProcModules.txt |
|
2019-12-24 09:47:01 |
fan jinke |
attachment added |
|
UdevDb.txt https://bugs.launchpad.net/bugs/1857413/+attachment/5315025/+files/UdevDb.txt |
|
2019-12-24 09:47:03 |
fan jinke |
attachment added |
|
WifiSyslog.txt https://bugs.launchpad.net/bugs/1857413/+attachment/5315026/+files/WifiSyslog.txt |
|
2019-12-26 09:47:46 |
Po-Hsu Lin |
nominated for series |
|
Ubuntu Disco |
|
2019-12-26 09:47:46 |
Po-Hsu Lin |
bug task added |
|
linux (Ubuntu Disco) |
|
2019-12-26 12:06:29 |
Po-Hsu Lin |
bug |
|
|
added subscriber Po-Hsu Lin |
2019-12-31 09:40:17 |
Po-Hsu Lin |
linux (Ubuntu Disco): status |
New |
In Progress |
|
2019-12-31 09:40:18 |
Po-Hsu Lin |
linux (Ubuntu Disco): assignee |
|
Po-Hsu Lin (cypressyew) |
|
2019-12-31 09:45:53 |
Po-Hsu Lin |
linux (Ubuntu): status |
Incomplete |
Fix Released |
|
2019-12-31 09:51:46 |
Po-Hsu Lin |
description |
Using Linux kernel, When inject 1bit ecc error, there are some mce log recorded in the dmesg.like:
[ 1561.511210] mce: [Hardware Error]: Machine check events logged
[ 1561.511221] [Hardware Error]: Corrected error, no action required.
[ 1561.511311] [Hardware Error]: CPU:0 (18:0:2) MC16_STATUS[Over|CE|MiscV|-|AddrV|-|-|SyndV|-|CECC]: 0xdc2040000000011b
[ 1561.511388] [Hardware Error]: Error Addr: 0x000000077cd66940
[ 1561.511439] [Hardware Error]: IPID: 0x0000009600150f00, Syndrome: 0x000010ce0a400d01
[ 1561.511499] [Hardware Error]: Unified Memory Controller Extended Error Code: 0
[ 1561.511556] [Hardware Error]: Unified Memory Controller Error: DRAM ECC error.
[ 1561.511646] EDAC MC0: 1 CE on mc#0csrow#1channel#1 (csrow:1 channel:1 page:0x7fcd66 offset:0x940 grain:0 syndrome:0x10ce)
[ 1561.511648] [Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: RD
*But, there are no the log when Using "Ubuntu 18.04.3 LTS"*
The upstream related commit is de0e0624d86ff9fc512dedb297f8978698abf21a .
After merged this commit, Ubuntu kernel's dmesg can record the mce log as well.
---
ProblemType: Bug
AlsaDevices:
total 0
crw-rw----+ 1 root audio 116, 1 Dec 24 17:20 seq
crw-rw----+ 1 root audio 116, 33 Dec 24 17:20 timer
AplayDevices: Error: [Errno 2] No such file or directory: 'aplay': 'aplay'
ApportVersion: 2.20.10-0ubuntu27
Architecture: amd64
ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord': 'arecord'
AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1:
DistroRelease: Ubuntu 19.04
InstallationDate: Installed on 2019-12-24 (0 days ago)
InstallationMedia: Ubuntu-Server 19.04 "Disco Dingo" - Release amd64 (20190416.1)
IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig': 'iwconfig'
MachineType: Sugon HygonH210
Package: linux (not installed)
PciMultimedia:
ProcEnviron:
TERM=linux
PATH=(custom, no user)
LANG=en_US.UTF-8
SHELL=/bin/bash
ProcFB: 0 astdrmfb
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-5.0.0-13-generic root=UUID=43f8bc11-d850-4e79-9d14-1232ef50040f ro
ProcVersionSignature: Ubuntu 5.0.0-13.14-generic 5.0.6
RelatedPackageVersions:
linux-restricted-modules-5.0.0-13-generic N/A
linux-backports-modules-5.0.0-13-generic N/A
linux-firmware 1.178
RfKill: Error: [Errno 2] No such file or directory: 'rfkill': 'rfkill'
Tags: disco
Uname: Linux 5.0.0-13-generic x86_64
UpgradeStatus: No upgrade log present (probably fresh install)
UserGroups:
_MarkForUpload: True
dmi.bios.date: 03/15/2019
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: 210ER119
dmi.board.asset.tag: Default string
dmi.board.name: HygonH210
dmi.board.vendor: Sugon
dmi.board.version: Default string
dmi.chassis.asset.tag: Default string
dmi.chassis.type: 17
dmi.chassis.vendor: Sugon
dmi.chassis.version: Default string
dmi.modalias: dmi:bvnAmericanMegatrendsInc.:bvr210ER119:bd03/15/2019:svnSugon:pnHygonH210:pvrDefaultstring:rvnSugon:rnHygonH210:rvrDefaultstring:cvnSugon:ct17:cvrDefaultstring:
dmi.product.family: Rack
dmi.product.name: HygonH210
dmi.product.sku: Default string
dmi.product.version: Default string
dmi.sys.vendor: Sugon |
== SRU Justification ==
With the 5.0 Disco kernel, the kernel cannot record the mce log while
injecting 1bit ecc error.
== Fix ==
* 09cbd219 (RAS/CEC: Increment cec_entered under the mutex lock)
* de0e0624 (RAS/CEC: Check count_threshold unconditionally)
Commit de0e0624 is the real fix for this issue, 09cbd219 is a fix to
avoid race condition, and it can make the latter become a clean
cherry-pick.
These have been landed on newer kernels.
== Test ==
Test kernel could be found here:
https://people.canonical.com/~phlin/kernel/lp-1857413-ras-err-msg/
Verified by the bug reporter, fan jinke, the patched kernel can log
the error correctly.
== Regression Potential ==
Low, changes are limited to the RAS Correctable Errors Collector. And
the fix has been verified as working as expected.
== Original Bug Report ==
Using Linux kernel, When inject 1bit ecc error, there are some mce log recorded in the dmesg.like:
[ 1561.511210] mce: [Hardware Error]: Machine check events logged
[ 1561.511221] [Hardware Error]: Corrected error, no action required.
[ 1561.511311] [Hardware Error]: CPU:0 (18:0:2) MC16_STATUS[Over|CE|MiscV|-|AddrV|-|-|SyndV|-|CECC]: 0xdc2040000000011b
[ 1561.511388] [Hardware Error]: Error Addr: 0x000000077cd66940
[ 1561.511439] [Hardware Error]: IPID: 0x0000009600150f00, Syndrome: 0x000010ce0a400d01
[ 1561.511499] [Hardware Error]: Unified Memory Controller Extended Error Code: 0
[ 1561.511556] [Hardware Error]: Unified Memory Controller Error: DRAM ECC error.
[ 1561.511646] EDAC MC0: 1 CE on mc#0csrow#1channel#1 (csrow:1 channel:1 page:0x7fcd66 offset:0x940 grain:0 syndrome:0x10ce)
[ 1561.511648] [Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: RD
*But, there are no the log when Using "Ubuntu 18.04.3 LTS"*
The upstream related commit is de0e0624d86ff9fc512dedb297f8978698abf21a .
After merged this commit, Ubuntu kernel's dmesg can record the mce log as well.
---
ProblemType: Bug
AlsaDevices:
total 0
crw-rw----+ 1 root audio 116, 1 Dec 24 17:20 seq
crw-rw----+ 1 root audio 116, 33 Dec 24 17:20 timer
AplayDevices: Error: [Errno 2] No such file or directory: 'aplay': 'aplay'
ApportVersion: 2.20.10-0ubuntu27
Architecture: amd64
ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord': 'arecord'
AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1:
DistroRelease: Ubuntu 19.04
InstallationDate: Installed on 2019-12-24 (0 days ago)
InstallationMedia: Ubuntu-Server 19.04 "Disco Dingo" - Release amd64 (20190416.1)
IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig': 'iwconfig'
MachineType: Sugon HygonH210
Package: linux (not installed)
PciMultimedia:
ProcEnviron:
TERM=linux
PATH=(custom, no user)
LANG=en_US.UTF-8
SHELL=/bin/bash
ProcFB: 0 astdrmfb
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-5.0.0-13-generic root=UUID=43f8bc11-d850-4e79-9d14-1232ef50040f ro
ProcVersionSignature: Ubuntu 5.0.0-13.14-generic 5.0.6
RelatedPackageVersions:
linux-restricted-modules-5.0.0-13-generic N/A
linux-backports-modules-5.0.0-13-generic N/A
linux-firmware 1.178
RfKill: Error: [Errno 2] No such file or directory: 'rfkill': 'rfkill'
Tags: disco
Uname: Linux 5.0.0-13-generic x86_64
UpgradeStatus: No upgrade log present (probably fresh install)
UserGroups:
_MarkForUpload: True
dmi.bios.date: 03/15/2019
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: 210ER119
dmi.board.asset.tag: Default string
dmi.board.name: HygonH210
dmi.board.vendor: Sugon
dmi.board.version: Default string
dmi.chassis.asset.tag: Default string
dmi.chassis.type: 17
dmi.chassis.vendor: Sugon
dmi.chassis.version: Default string
dmi.modalias: dmi:bvnAmericanMegatrendsInc.:bvr210ER119:bd03/15/2019:svnSugon:pnHygonH210:pvrDefaultstring:rvnSugon:rnHygonH210:rvrDefaultstring:cvnSugon:ct17:cvrDefaultstring:
dmi.product.family: Rack
dmi.product.name: HygonH210
dmi.product.sku: Default string
dmi.product.version: Default string
dmi.sys.vendor: Sugon |
|
2020-01-02 07:33:21 |
fan jinke |
attachment removed |
WifiSyslog.txt https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1857413/+attachment/5315026/+files/WifiSyslog.txt |
|
|
2020-01-02 07:33:35 |
fan jinke |
attachment removed |
Lspci.txt https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1857413/+attachment/5315019/+files/Lspci.txt |
|
|
2020-01-02 07:33:49 |
fan jinke |
attachment removed |
Lsusb.txt https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1857413/+attachment/5315020/+files/Lsusb.txt |
|
|
2020-01-02 07:34:00 |
fan jinke |
attachment removed |
ProcCpuinfo.txt https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1857413/+attachment/5315021/+files/ProcCpuinfo.txt |
|
|
2020-01-02 07:34:21 |
fan jinke |
attachment removed |
UdevDb.txt https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1857413/+attachment/5315025/+files/UdevDb.txt |
|
|
2020-01-02 07:34:45 |
fan jinke |
attachment removed |
CRDA.txt https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1857413/+attachment/5315017/+files/CRDA.txt |
|
|
2020-01-02 07:34:58 |
fan jinke |
attachment removed |
CurrentDmesg.txt https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1857413/+attachment/5315018/+files/CurrentDmesg.txt |
|
|
2020-01-02 07:35:08 |
fan jinke |
attachment removed |
ProcCpuinfoMinimal.txt https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1857413/+attachment/5315022/+files/ProcCpuinfoMinimal.txt |
|
|
2020-01-02 07:35:20 |
fan jinke |
attachment removed |
ProcInterrupts.txt https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1857413/+attachment/5315023/+files/ProcInterrupts.txt |
|
|
2020-01-02 07:35:35 |
fan jinke |
attachment removed |
ProcModules.txt https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1857413/+attachment/5315024/+files/ProcModules.txt |
|
|
2020-01-06 22:25:58 |
Khaled El Mously |
linux (Ubuntu Disco): status |
In Progress |
Fix Committed |
|
2020-01-10 18:03:37 |
Ubuntu Kernel Bot |
tags |
apport-collected disco |
apport-collected disco verification-needed-disco |
|
2020-01-27 13:21:23 |
Launchpad Janitor |
linux (Ubuntu Disco): status |
Fix Committed |
Fix Released |
|
2020-01-27 13:21:23 |
Launchpad Janitor |
cve linked |
|
2019-14615 |
|
2020-01-27 13:21:23 |
Launchpad Janitor |
cve linked |
|
2019-18885 |
|
2020-01-27 13:21:23 |
Launchpad Janitor |
cve linked |
|
2019-19050 |
|
2020-01-27 13:21:23 |
Launchpad Janitor |
cve linked |
|
2019-19077 |
|
2020-01-27 13:21:23 |
Launchpad Janitor |
cve linked |
|
2019-19078 |
|
2020-01-27 13:21:23 |
Launchpad Janitor |
cve linked |
|
2019-19082 |
|
2020-01-27 13:21:23 |
Launchpad Janitor |
cve linked |
|
2019-19332 |
|
2020-01-27 13:21:23 |
Launchpad Janitor |
cve linked |
|
2020-7053 |
|