Activity log for bug #1857413

Date Who What changed Old value New value Message
2019-12-24 07:00:12 fan jinke bug added bug
2019-12-24 07:14:47 fan jinke description Using Linux kernel, When inject 1bit ecc error, there are some mce log recorded in the dmesg.like: [ 1561.511210] mce: [Hardware Error]: Machine check events logged [ 1561.511221] [Hardware Error]: Corrected error, no action required. [ 1561.511311] [Hardware Error]: CPU:0 (18:0:2) MC16_STATUS[Over|CE|MiscV|-|AddrV|-|-|SyndV|-|CECC]: 0xdc2040000000011b [ 1561.511388] [Hardware Error]: Error Addr: 0x000000077cd66940 [ 1561.511439] [Hardware Error]: IPID: 0x0000009600150f00, Syndrome: 0x000010ce0a400d01 [ 1561.511499] [Hardware Error]: Unified Memory Controller Extended Error Code: 0 [ 1561.511556] [Hardware Error]: Unified Memory Controller Error: DRAM ECC error. [ 1561.511646] EDAC MC0: 1 CE on mc#0csrow#1channel#1 (csrow:1 channel:1 page:0x7fcd66 offset:0x940 grain:0 syndrome:0x10ce) [ 1561.511648] [Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: RD *But, there are no the log when Using "Ubuntu 18.04.3 LTS"* The upstream related commit is de0e0624d86ff9fc512dedb297f8978698abf21a . After merged this commit, dmesg can record the mce log. Using Linux kernel, When inject 1bit ecc error, there are some mce log recorded in the dmesg.like: [ 1561.511210] mce: [Hardware Error]: Machine check events logged [ 1561.511221] [Hardware Error]: Corrected error, no action required. [ 1561.511311] [Hardware Error]: CPU:0 (18:0:2) MC16_STATUS[Over|CE|MiscV|-|AddrV|-|-|SyndV|-|CECC]: 0xdc2040000000011b [ 1561.511388] [Hardware Error]: Error Addr: 0x000000077cd66940 [ 1561.511439] [Hardware Error]: IPID: 0x0000009600150f00, Syndrome: 0x000010ce0a400d01 [ 1561.511499] [Hardware Error]: Unified Memory Controller Extended Error Code: 0 [ 1561.511556] [Hardware Error]: Unified Memory Controller Error: DRAM ECC error. [ 1561.511646] EDAC MC0: 1 CE on mc#0csrow#1channel#1 (csrow:1 channel:1 page:0x7fcd66 offset:0x940 grain:0 syndrome:0x10ce) [ 1561.511648] [Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: RD *But, there are no the log when Using "Ubuntu 18.04.3 LTS"* The upstream related commit is de0e0624d86ff9fc512dedb297f8978698abf21a . After merged this commit, Ubuntu kernel's dmesg can record the mce log as well.
2019-12-24 07:30:08 Ubuntu Kernel Bot linux (Ubuntu): status New Incomplete
2019-12-24 09:46:45 fan jinke tags apport-collected disco
2019-12-24 09:46:46 fan jinke description Using Linux kernel, When inject 1bit ecc error, there are some mce log recorded in the dmesg.like: [ 1561.511210] mce: [Hardware Error]: Machine check events logged [ 1561.511221] [Hardware Error]: Corrected error, no action required. [ 1561.511311] [Hardware Error]: CPU:0 (18:0:2) MC16_STATUS[Over|CE|MiscV|-|AddrV|-|-|SyndV|-|CECC]: 0xdc2040000000011b [ 1561.511388] [Hardware Error]: Error Addr: 0x000000077cd66940 [ 1561.511439] [Hardware Error]: IPID: 0x0000009600150f00, Syndrome: 0x000010ce0a400d01 [ 1561.511499] [Hardware Error]: Unified Memory Controller Extended Error Code: 0 [ 1561.511556] [Hardware Error]: Unified Memory Controller Error: DRAM ECC error. [ 1561.511646] EDAC MC0: 1 CE on mc#0csrow#1channel#1 (csrow:1 channel:1 page:0x7fcd66 offset:0x940 grain:0 syndrome:0x10ce) [ 1561.511648] [Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: RD *But, there are no the log when Using "Ubuntu 18.04.3 LTS"* The upstream related commit is de0e0624d86ff9fc512dedb297f8978698abf21a . After merged this commit, Ubuntu kernel's dmesg can record the mce log as well. Using Linux kernel, When inject 1bit ecc error, there are some mce log recorded in the dmesg.like: [ 1561.511210] mce: [Hardware Error]: Machine check events logged [ 1561.511221] [Hardware Error]: Corrected error, no action required. [ 1561.511311] [Hardware Error]: CPU:0 (18:0:2) MC16_STATUS[Over|CE|MiscV|-|AddrV|-|-|SyndV|-|CECC]: 0xdc2040000000011b [ 1561.511388] [Hardware Error]: Error Addr: 0x000000077cd66940 [ 1561.511439] [Hardware Error]: IPID: 0x0000009600150f00, Syndrome: 0x000010ce0a400d01 [ 1561.511499] [Hardware Error]: Unified Memory Controller Extended Error Code: 0 [ 1561.511556] [Hardware Error]: Unified Memory Controller Error: DRAM ECC error. [ 1561.511646] EDAC MC0: 1 CE on mc#0csrow#1channel#1 (csrow:1 channel:1 page:0x7fcd66 offset:0x940 grain:0 syndrome:0x10ce) [ 1561.511648] [Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: RD *But, there are no the log when Using "Ubuntu 18.04.3 LTS"* The upstream related commit is de0e0624d86ff9fc512dedb297f8978698abf21a . After merged this commit, Ubuntu kernel's dmesg can record the mce log as well. --- ProblemType: Bug AlsaDevices: total 0 crw-rw----+ 1 root audio 116, 1 Dec 24 17:20 seq crw-rw----+ 1 root audio 116, 33 Dec 24 17:20 timer AplayDevices: Error: [Errno 2] No such file or directory: 'aplay': 'aplay' ApportVersion: 2.20.10-0ubuntu27 Architecture: amd64 ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord': 'arecord' AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1: DistroRelease: Ubuntu 19.04 InstallationDate: Installed on 2019-12-24 (0 days ago) InstallationMedia: Ubuntu-Server 19.04 "Disco Dingo" - Release amd64 (20190416.1) IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig': 'iwconfig' MachineType: Sugon HygonH210 Package: linux (not installed) PciMultimedia: ProcEnviron: TERM=linux PATH=(custom, no user) LANG=en_US.UTF-8 SHELL=/bin/bash ProcFB: 0 astdrmfb ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-5.0.0-13-generic root=UUID=43f8bc11-d850-4e79-9d14-1232ef50040f ro ProcVersionSignature: Ubuntu 5.0.0-13.14-generic 5.0.6 RelatedPackageVersions: linux-restricted-modules-5.0.0-13-generic N/A linux-backports-modules-5.0.0-13-generic N/A linux-firmware 1.178 RfKill: Error: [Errno 2] No such file or directory: 'rfkill': 'rfkill' Tags: disco Uname: Linux 5.0.0-13-generic x86_64 UpgradeStatus: No upgrade log present (probably fresh install) UserGroups: _MarkForUpload: True dmi.bios.date: 03/15/2019 dmi.bios.vendor: American Megatrends Inc. dmi.bios.version: 210ER119 dmi.board.asset.tag: Default string dmi.board.name: HygonH210 dmi.board.vendor: Sugon dmi.board.version: Default string dmi.chassis.asset.tag: Default string dmi.chassis.type: 17 dmi.chassis.vendor: Sugon dmi.chassis.version: Default string dmi.modalias: dmi:bvnAmericanMegatrendsInc.:bvr210ER119:bd03/15/2019:svnSugon:pnHygonH210:pvrDefaultstring:rvnSugon:rnHygonH210:rvrDefaultstring:cvnSugon:ct17:cvrDefaultstring: dmi.product.family: Rack dmi.product.name: HygonH210 dmi.product.sku: Default string dmi.product.version: Default string dmi.sys.vendor: Sugon
2019-12-24 09:46:47 fan jinke attachment added CRDA.txt https://bugs.launchpad.net/bugs/1857413/+attachment/5315017/+files/CRDA.txt
2019-12-24 09:46:49 fan jinke attachment added CurrentDmesg.txt https://bugs.launchpad.net/bugs/1857413/+attachment/5315018/+files/CurrentDmesg.txt
2019-12-24 09:46:50 fan jinke attachment added Lspci.txt https://bugs.launchpad.net/bugs/1857413/+attachment/5315019/+files/Lspci.txt
2019-12-24 09:46:52 fan jinke attachment added Lsusb.txt https://bugs.launchpad.net/bugs/1857413/+attachment/5315020/+files/Lsusb.txt
2019-12-24 09:46:55 fan jinke attachment added ProcCpuinfo.txt https://bugs.launchpad.net/bugs/1857413/+attachment/5315021/+files/ProcCpuinfo.txt
2019-12-24 09:46:56 fan jinke attachment added ProcCpuinfoMinimal.txt https://bugs.launchpad.net/bugs/1857413/+attachment/5315022/+files/ProcCpuinfoMinimal.txt
2019-12-24 09:46:58 fan jinke attachment added ProcInterrupts.txt https://bugs.launchpad.net/bugs/1857413/+attachment/5315023/+files/ProcInterrupts.txt
2019-12-24 09:46:59 fan jinke attachment added ProcModules.txt https://bugs.launchpad.net/bugs/1857413/+attachment/5315024/+files/ProcModules.txt
2019-12-24 09:47:01 fan jinke attachment added UdevDb.txt https://bugs.launchpad.net/bugs/1857413/+attachment/5315025/+files/UdevDb.txt
2019-12-24 09:47:03 fan jinke attachment added WifiSyslog.txt https://bugs.launchpad.net/bugs/1857413/+attachment/5315026/+files/WifiSyslog.txt
2019-12-26 09:47:46 Po-Hsu Lin nominated for series Ubuntu Disco
2019-12-26 09:47:46 Po-Hsu Lin bug task added linux (Ubuntu Disco)
2019-12-26 12:06:29 Po-Hsu Lin bug added subscriber Po-Hsu Lin
2019-12-31 09:40:17 Po-Hsu Lin linux (Ubuntu Disco): status New In Progress
2019-12-31 09:40:18 Po-Hsu Lin linux (Ubuntu Disco): assignee Po-Hsu Lin (cypressyew)
2019-12-31 09:45:53 Po-Hsu Lin linux (Ubuntu): status Incomplete Fix Released
2019-12-31 09:51:46 Po-Hsu Lin description Using Linux kernel, When inject 1bit ecc error, there are some mce log recorded in the dmesg.like: [ 1561.511210] mce: [Hardware Error]: Machine check events logged [ 1561.511221] [Hardware Error]: Corrected error, no action required. [ 1561.511311] [Hardware Error]: CPU:0 (18:0:2) MC16_STATUS[Over|CE|MiscV|-|AddrV|-|-|SyndV|-|CECC]: 0xdc2040000000011b [ 1561.511388] [Hardware Error]: Error Addr: 0x000000077cd66940 [ 1561.511439] [Hardware Error]: IPID: 0x0000009600150f00, Syndrome: 0x000010ce0a400d01 [ 1561.511499] [Hardware Error]: Unified Memory Controller Extended Error Code: 0 [ 1561.511556] [Hardware Error]: Unified Memory Controller Error: DRAM ECC error. [ 1561.511646] EDAC MC0: 1 CE on mc#0csrow#1channel#1 (csrow:1 channel:1 page:0x7fcd66 offset:0x940 grain:0 syndrome:0x10ce) [ 1561.511648] [Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: RD *But, there are no the log when Using "Ubuntu 18.04.3 LTS"* The upstream related commit is de0e0624d86ff9fc512dedb297f8978698abf21a . After merged this commit, Ubuntu kernel's dmesg can record the mce log as well. --- ProblemType: Bug AlsaDevices: total 0 crw-rw----+ 1 root audio 116, 1 Dec 24 17:20 seq crw-rw----+ 1 root audio 116, 33 Dec 24 17:20 timer AplayDevices: Error: [Errno 2] No such file or directory: 'aplay': 'aplay' ApportVersion: 2.20.10-0ubuntu27 Architecture: amd64 ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord': 'arecord' AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1: DistroRelease: Ubuntu 19.04 InstallationDate: Installed on 2019-12-24 (0 days ago) InstallationMedia: Ubuntu-Server 19.04 "Disco Dingo" - Release amd64 (20190416.1) IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig': 'iwconfig' MachineType: Sugon HygonH210 Package: linux (not installed) PciMultimedia: ProcEnviron: TERM=linux PATH=(custom, no user) LANG=en_US.UTF-8 SHELL=/bin/bash ProcFB: 0 astdrmfb ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-5.0.0-13-generic root=UUID=43f8bc11-d850-4e79-9d14-1232ef50040f ro ProcVersionSignature: Ubuntu 5.0.0-13.14-generic 5.0.6 RelatedPackageVersions: linux-restricted-modules-5.0.0-13-generic N/A linux-backports-modules-5.0.0-13-generic N/A linux-firmware 1.178 RfKill: Error: [Errno 2] No such file or directory: 'rfkill': 'rfkill' Tags: disco Uname: Linux 5.0.0-13-generic x86_64 UpgradeStatus: No upgrade log present (probably fresh install) UserGroups: _MarkForUpload: True dmi.bios.date: 03/15/2019 dmi.bios.vendor: American Megatrends Inc. dmi.bios.version: 210ER119 dmi.board.asset.tag: Default string dmi.board.name: HygonH210 dmi.board.vendor: Sugon dmi.board.version: Default string dmi.chassis.asset.tag: Default string dmi.chassis.type: 17 dmi.chassis.vendor: Sugon dmi.chassis.version: Default string dmi.modalias: dmi:bvnAmericanMegatrendsInc.:bvr210ER119:bd03/15/2019:svnSugon:pnHygonH210:pvrDefaultstring:rvnSugon:rnHygonH210:rvrDefaultstring:cvnSugon:ct17:cvrDefaultstring: dmi.product.family: Rack dmi.product.name: HygonH210 dmi.product.sku: Default string dmi.product.version: Default string dmi.sys.vendor: Sugon == SRU Justification == With the 5.0 Disco kernel, the kernel cannot record the mce log while injecting 1bit ecc error. == Fix == * 09cbd219 (RAS/CEC: Increment cec_entered under the mutex lock) * de0e0624 (RAS/CEC: Check count_threshold unconditionally) Commit de0e0624 is the real fix for this issue, 09cbd219 is a fix to avoid race condition, and it can make the latter become a clean cherry-pick. These have been landed on newer kernels. == Test == Test kernel could be found here: https://people.canonical.com/~phlin/kernel/lp-1857413-ras-err-msg/ Verified by the bug reporter, fan jinke, the patched kernel can log the error correctly. == Regression Potential == Low, changes are limited to the RAS Correctable Errors Collector. And the fix has been verified as working as expected. == Original Bug Report == Using Linux kernel, When inject 1bit ecc error, there are some mce log recorded in the dmesg.like: [ 1561.511210] mce: [Hardware Error]: Machine check events logged [ 1561.511221] [Hardware Error]: Corrected error, no action required. [ 1561.511311] [Hardware Error]: CPU:0 (18:0:2) MC16_STATUS[Over|CE|MiscV|-|AddrV|-|-|SyndV|-|CECC]: 0xdc2040000000011b [ 1561.511388] [Hardware Error]: Error Addr: 0x000000077cd66940 [ 1561.511439] [Hardware Error]: IPID: 0x0000009600150f00, Syndrome: 0x000010ce0a400d01 [ 1561.511499] [Hardware Error]: Unified Memory Controller Extended Error Code: 0 [ 1561.511556] [Hardware Error]: Unified Memory Controller Error: DRAM ECC error. [ 1561.511646] EDAC MC0: 1 CE on mc#0csrow#1channel#1 (csrow:1 channel:1 page:0x7fcd66 offset:0x940 grain:0 syndrome:0x10ce) [ 1561.511648] [Hardware Error]: cache level: L3/GEN, tx: GEN, mem-tx: RD *But, there are no the log when Using "Ubuntu 18.04.3 LTS"* The upstream related commit is de0e0624d86ff9fc512dedb297f8978698abf21a . After merged this commit, Ubuntu kernel's dmesg can record the mce log as well. --- ProblemType: Bug AlsaDevices:  total 0  crw-rw----+ 1 root audio 116, 1 Dec 24 17:20 seq  crw-rw----+ 1 root audio 116, 33 Dec 24 17:20 timer AplayDevices: Error: [Errno 2] No such file or directory: 'aplay': 'aplay' ApportVersion: 2.20.10-0ubuntu27 Architecture: amd64 ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord': 'arecord' AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1: DistroRelease: Ubuntu 19.04 InstallationDate: Installed on 2019-12-24 (0 days ago) InstallationMedia: Ubuntu-Server 19.04 "Disco Dingo" - Release amd64 (20190416.1) IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig': 'iwconfig' MachineType: Sugon HygonH210 Package: linux (not installed) PciMultimedia: ProcEnviron:  TERM=linux  PATH=(custom, no user)  LANG=en_US.UTF-8  SHELL=/bin/bash ProcFB: 0 astdrmfb ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-5.0.0-13-generic root=UUID=43f8bc11-d850-4e79-9d14-1232ef50040f ro ProcVersionSignature: Ubuntu 5.0.0-13.14-generic 5.0.6 RelatedPackageVersions:  linux-restricted-modules-5.0.0-13-generic N/A  linux-backports-modules-5.0.0-13-generic N/A  linux-firmware 1.178 RfKill: Error: [Errno 2] No such file or directory: 'rfkill': 'rfkill' Tags: disco Uname: Linux 5.0.0-13-generic x86_64 UpgradeStatus: No upgrade log present (probably fresh install) UserGroups: _MarkForUpload: True dmi.bios.date: 03/15/2019 dmi.bios.vendor: American Megatrends Inc. dmi.bios.version: 210ER119 dmi.board.asset.tag: Default string dmi.board.name: HygonH210 dmi.board.vendor: Sugon dmi.board.version: Default string dmi.chassis.asset.tag: Default string dmi.chassis.type: 17 dmi.chassis.vendor: Sugon dmi.chassis.version: Default string dmi.modalias: dmi:bvnAmericanMegatrendsInc.:bvr210ER119:bd03/15/2019:svnSugon:pnHygonH210:pvrDefaultstring:rvnSugon:rnHygonH210:rvrDefaultstring:cvnSugon:ct17:cvrDefaultstring: dmi.product.family: Rack dmi.product.name: HygonH210 dmi.product.sku: Default string dmi.product.version: Default string dmi.sys.vendor: Sugon
2020-01-02 07:33:21 fan jinke attachment removed WifiSyslog.txt https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1857413/+attachment/5315026/+files/WifiSyslog.txt
2020-01-02 07:33:35 fan jinke attachment removed Lspci.txt https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1857413/+attachment/5315019/+files/Lspci.txt
2020-01-02 07:33:49 fan jinke attachment removed Lsusb.txt https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1857413/+attachment/5315020/+files/Lsusb.txt
2020-01-02 07:34:00 fan jinke attachment removed ProcCpuinfo.txt https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1857413/+attachment/5315021/+files/ProcCpuinfo.txt
2020-01-02 07:34:21 fan jinke attachment removed UdevDb.txt https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1857413/+attachment/5315025/+files/UdevDb.txt
2020-01-02 07:34:45 fan jinke attachment removed CRDA.txt https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1857413/+attachment/5315017/+files/CRDA.txt
2020-01-02 07:34:58 fan jinke attachment removed CurrentDmesg.txt https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1857413/+attachment/5315018/+files/CurrentDmesg.txt
2020-01-02 07:35:08 fan jinke attachment removed ProcCpuinfoMinimal.txt https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1857413/+attachment/5315022/+files/ProcCpuinfoMinimal.txt
2020-01-02 07:35:20 fan jinke attachment removed ProcInterrupts.txt https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1857413/+attachment/5315023/+files/ProcInterrupts.txt
2020-01-02 07:35:35 fan jinke attachment removed ProcModules.txt https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1857413/+attachment/5315024/+files/ProcModules.txt
2020-01-06 22:25:58 Khaled El Mously linux (Ubuntu Disco): status In Progress Fix Committed
2020-01-10 18:03:37 Ubuntu Kernel Bot tags apport-collected disco apport-collected disco verification-needed-disco
2020-01-27 13:21:23 Launchpad Janitor linux (Ubuntu Disco): status Fix Committed Fix Released
2020-01-27 13:21:23 Launchpad Janitor cve linked 2019-14615
2020-01-27 13:21:23 Launchpad Janitor cve linked 2019-18885
2020-01-27 13:21:23 Launchpad Janitor cve linked 2019-19050
2020-01-27 13:21:23 Launchpad Janitor cve linked 2019-19077
2020-01-27 13:21:23 Launchpad Janitor cve linked 2019-19078
2020-01-27 13:21:23 Launchpad Janitor cve linked 2019-19082
2020-01-27 13:21:23 Launchpad Janitor cve linked 2019-19332
2020-01-27 13:21:23 Launchpad Janitor cve linked 2020-7053