The system hangs after a while of usage

Bug #1868494 reported by Lawrence Ibarria
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Confirmed
Undecided
Unassigned

Bug Description

Since it is a hard hang, I suspect wireguard. I use wireguard as the vpn solution to connect to the office at work.

ProblemType: KernelCrash
DistroRelease: Ubuntu 20.04
Package: linux-image-5.4.0-18-generic 5.4.0-18.22
ProcVersionSignature: Ubuntu 5.4.0-18.22-generic 5.4.24
Uname: Linux 5.4.0-18-generic x86_64
NonfreeKernelModules: nvidia_modeset nvidia
ApportVersion: 2.20.11-0ubuntu21
Architecture: amd64
AudioDevicesInUse:
 USER PID ACCESS COMMAND
 /dev/snd/controlC1: gdm 1316 F.... pulseaudio
                      lawrence 2050 F.... pulseaudio
 /dev/snd/controlC0: gdm 1316 F.... pulseaudio
                      lawrence 2050 F.... pulseaudio
Date: Sun Mar 22 13:20:14 2020
InstallationDate: Installed on 2020-02-06 (45 days ago)
InstallationMedia: Ubuntu 19.10 "Eoan Ermine" - Release amd64 (20191017)
MachineType: Gigabyte Technology Co., Ltd. X299 AORUS Ultra Gaming
ProcFB: 0 EFI VGA
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-5.4.0-18-generic root=UUID=f3a48b61-47e3-43d3-9d03-b5ad88bf966d ro quiet splash crashkernel=512M-:192M vt.handoff=7
PulseList: Error: command ['pacmd', 'list'] failed with exit code 1: No PulseAudio daemon running, or not running as session daemon.
RelatedPackageVersions:
 linux-restricted-modules-5.4.0-18-generic N/A
 linux-backports-modules-5.4.0-18-generic N/A
 linux-firmware 1.187
RfKill:
 0: hci0: Bluetooth
  Soft blocked: no
  Hard blocked: no
SourcePackage: linux
StagingDrivers: exfat
UpgradeStatus: Upgraded to focal on 2020-03-22 (0 days ago)
dmi.bios.date: 09/14/2017
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: F3
dmi.board.asset.tag: Default string
dmi.board.name: X299 AORUS Ultra Gaming-CF
dmi.board.vendor: Gigabyte Technology Co., Ltd.
dmi.board.version: x.x
dmi.chassis.asset.tag: Default string
dmi.chassis.type: 3
dmi.chassis.vendor: Default string
dmi.chassis.version: Default string
dmi.modalias: dmi:bvnAmericanMegatrendsInc.:bvrF3:bd09/14/2017:svnGigabyteTechnologyCo.,Ltd.:pnX299AORUSUltraGaming:pvrDefaultstring:rvnGigabyteTechnologyCo.,Ltd.:rnX299AORUSUltraGaming-CF:rvrx.x:cvnDefaultstring:ct3:cvrDefaultstring:
dmi.product.family: Default string
dmi.product.name: X299 AORUS Ultra Gaming
dmi.product.sku: Default string
dmi.product.version: Default string
dmi.sys.vendor: Gigabyte Technology Co., Ltd.

Revision history for this message
Lawrence Ibarria (lawrence-verdant) wrote :
Revision history for this message
Ubuntu Foundations Team Bug Bot (crichton) wrote :

Thank you for taking the time to report this bug and helping to make Ubuntu better. It seems that your bug report is not filed about a specific source package though, rather it is just filed against Ubuntu in general. It is important that bug reports be filed about source packages so that people interested in the package can find the bugs about it. You can find some hints about determining what package your bug might be about at https://wiki.ubuntu.com/Bugs/FindRightPackage. You might also ask for help in the #ubuntu-bugs irc channel on Freenode.

To change the source package that this bug is filed about visit https://bugs.launchpad.net/ubuntu/+bug/1868494/+editstatus and add the package name in the text box next to the word Package.

[This is an automated message. I apologize if it reached you inappropriately; please just reply to this message indicating so.]

tags: added: bot-comment
Juhani Numminen (jsonic)
affects: ubuntu → linux (Ubuntu)
Revision history for this message
Ubuntu Kernel Bot (ubuntu-kernel-bot) wrote : Status changed to Confirmed

This change was made by a bot.

Changed in linux (Ubuntu):
status: New → Confirmed
Revision history for this message
Kai-Heng Feng (kaihengfeng) wrote :

[ 1736.165122] mce: [Hardware Error]: CPU 4: Machine Check Exception: 5 Bank 0: f200000000070005
[ 1736.165125] mce: [Hardware Error]: RIP !INEXACT! 33:<00007fffd19a92e1>
[ 1736.165126] mce: [Hardware Error]: TSC e32f5eeda4d6
[ 1736.165127] mce: [Hardware Error]: PROCESSOR 0:50654 TIME 1584908346 SOCKET 0 APIC 8 microcode 2000064
[ 1736.165128] mce: [Hardware Error]: Run the above through 'mcelog --ascii'
[ 1736.165129] mce: [Hardware Error]: Machine check: Processor context corrupt
[ 1736.165130] Kernel panic - not syncing: Fatal machine check

Revision history for this message
Kai-Heng Feng (kaihengfeng) wrote :

Seems to be a hardware error. Do you see this in previous kernel versions?

Revision history for this message
Lawrence Ibarria (lawrence-verdant) wrote : Re: [Bug 1868494] Re: The system hangs after a while of usage
Download full text (3.5 KiB)

I started seeing this error when moving to 19.04 and later to 20.04.
I suspected a hardware error, yet memtest (from a Ubuntu boot) comes out fine, after a couple of hours.
I suspected hardware since failure tends to happen when the system does heavy load, although it happened on normal operation too (Chrome might trigger it).
Is there any indication what hardware (disc, memory, cpu or some other device) could it be?

Sent from my phone. Excuse the brevity and autocorrect.

> On Mar 26, 2020, at 05:50, Kai-Heng Feng <email address hidden> wrote:
>
> Seems to be a hardware error. Do you see this in previous kernel
> versions?
>
> --
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/1868494
>
> Title:
> The system hangs after a while of usage
>
> Status in linux package in Ubuntu:
> Confirmed
>
> Bug description:
> Since it is a hard hang, I suspect wireguard. I use wireguard as the
> vpn solution to connect to the office at work.
>
> ProblemType: KernelCrash
> DistroRelease: Ubuntu 20.04
> Package: linux-image-5.4.0-18-generic 5.4.0-18.22
> ProcVersionSignature: Ubuntu 5.4.0-18.22-generic 5.4.24
> Uname: Linux 5.4.0-18-generic x86_64
> NonfreeKernelModules: nvidia_modeset nvidia
> ApportVersion: 2.20.11-0ubuntu21
> Architecture: amd64
> AudioDevicesInUse:
> USER PID ACCESS COMMAND
> /dev/snd/controlC1: gdm 1316 F.... pulseaudio
> lawrence 2050 F.... pulseaudio
> /dev/snd/controlC0: gdm 1316 F.... pulseaudio
> lawrence 2050 F.... pulseaudio
> Date: Sun Mar 22 13:20:14 2020
> InstallationDate: Installed on 2020-02-06 (45 days ago)
> InstallationMedia: Ubuntu 19.10 "Eoan Ermine" - Release amd64 (20191017)
> MachineType: Gigabyte Technology Co., Ltd. X299 AORUS Ultra Gaming
> ProcFB: 0 EFI VGA
> ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-5.4.0-18-generic root=UUID=f3a48b61-47e3-43d3-9d03-b5ad88bf966d ro quiet splash crashkernel=512M-:192M vt.handoff=7
> PulseList: Error: command ['pacmd', 'list'] failed with exit code 1: No PulseAudio daemon running, or not running as session daemon.
> RelatedPackageVersions:
> linux-restricted-modules-5.4.0-18-generic N/A
> linux-backports-modules-5.4.0-18-generic N/A
> linux-firmware 1.187
> RfKill:
> 0: hci0: Bluetooth
> Soft blocked: no
> Hard blocked: no
> SourcePackage: linux
> StagingDrivers: exfat
> UpgradeStatus: Upgraded to focal on 2020-03-22 (0 days ago)
> dmi.bios.date: 09/14/2017
> dmi.bios.vendor: American Megatrends Inc.
> dmi.bios.version: F3
> dmi.board.asset.tag: Default string
> dmi.board.name: X299 AORUS Ultra Gaming-CF
> dmi.board.vendor: Gigabyte Technology Co., Ltd.
> dmi.board.version: x.x
> dmi.chassis.asset.tag: Default string
> dmi.chassis.type: 3
> dmi.chassis.vendor: Default string
> dmi.chassis.version: Default string
> dmi.modalias: dmi:bvnAmericanMegatrendsInc.:bvrF3:bd09/14/2017:svnGigabyteTechnologyCo.,Ltd.:pnX299AORUSUltraGaming:pvrDefaultstring:rvnGigabyteTechnologyCo.,Ltd.:rnX299AORUSUltraGaming-CF:rvrx.x:cvnDefaultstring:ct3:c...

Read more...

Revision history for this message
Kai-Heng Feng (kaihengfeng) wrote :

MCE is from CPU, so it's possible that memtest can pass.

Please test latest mainline kernel:
https://kernel.ubuntu.com/~kernel-ppa/mainline/v5.6-rc7/

Revision history for this message
Lawrence Ibarria (lawrence-verdant) wrote :

I got the same error with the suggested mainline kernel. I am unable to run the ubuntu-bug linux command, it complains there is a missing package.

The CPU is water cooled, so either it has broken or there is a problem in some other hardware.
I might have to change CPU and thus a large portion of the system.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.