NMI watchdog: Watchdog detected hard LOCKUP on cpu

Bug #1849803 reported by David Roberts
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Confirmed
Undecided
Unassigned

Bug Description

My system has been intermittently freezing (from once every couple of days up to several times a day; absolutely everything appears to be frozen, only the magic sysrq key works) for the past couple of weeks. Nothing in particular has changed recently other than updating packages. I've finally been able to get linux-crashdump set up so I at least have an error message to work with. I've attached the dmesg log it collected. I also have the kdump file it collected, which I'm hesistant to upload as I imagine it likely contains sensitive information (and it's also quite large), but let me know if there's any analysis it would be helpful for me to perform with it.

ProblemType: Bug
DistroRelease: Ubuntu 18.04
Package: linux-image-4.15.0-66-generic 4.15.0-66.75
ProcVersionSignature: Ubuntu 4.15.0-66.75-generic 4.15.18
Uname: Linux 4.15.0-66-generic x86_64
NonfreeKernelModules: nvidia_modeset nvidia
ApportVersion: 2.20.9-0ubuntu7.7
Architecture: amd64
AudioDevicesInUse:
 USER PID ACCESS COMMAND
 /dev/snd/controlC1: david 3067 F.... pulseaudio
 /dev/snd/pcmC0D0p: david 3067 F...m pulseaudio
 /dev/snd/controlC0: david 3067 F.... pulseaudio
CurrentDesktop: KDE
Date: Fri Oct 25 19:22:59 2019
InstallationDate: Installed on 2018-09-16 (404 days ago)
InstallationMedia: Ubuntu 18.04.1 LTS "Bionic Beaver" - Release amd64 (20180725)
IwConfig:
 docker0 no wireless extensions.

 lo no wireless extensions.

 enp3s0 no wireless extensions.
MachineType: System manufacturer System Product Name
ProcFB: 0 EFI VGA
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-4.15.0-66-generic root=UUID=42ee6282-7d26-455d-a36f-66ba166e1ec8 ro quiet splash crashkernel=512M-:192M vt.handoff=1
RelatedPackageVersions:
 linux-restricted-modules-4.15.0-66-generic N/A
 linux-backports-modules-4.15.0-66-generic N/A
 linux-firmware 1.173.9
RfKill:

SourcePackage: linux
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 03/26/2012
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: 0906
dmi.board.asset.tag: To be filled by O.E.M.
dmi.board.name: P8Z77-M
dmi.board.vendor: ASUSTeK COMPUTER INC.
dmi.board.version: Rev 1.xx
dmi.chassis.asset.tag: Asset-1234567890
dmi.chassis.type: 3
dmi.chassis.vendor: Chassis Manufacture
dmi.chassis.version: Chassis Version
dmi.modalias: dmi:bvnAmericanMegatrendsInc.:bvr0906:bd03/26/2012:svnSystemmanufacturer:pnSystemProductName:pvrSystemVersion:rvnASUSTeKCOMPUTERINC.:rnP8Z77-M:rvrRev1.xx:cvnChassisManufacture:ct3:cvrChassisVersion:
dmi.product.family: To be filled by O.E.M.
dmi.product.name: System Product Name
dmi.product.version: System Version
dmi.sys.vendor: System manufacturer

Revision history for this message
David Roberts (david.roberts) wrote :
Revision history for this message
Ubuntu Kernel Bot (ubuntu-kernel-bot) wrote : Status changed to Confirmed

This change was made by a bot.

Changed in linux (Ubuntu):
status: New → Confirmed
Revision history for this message
David Roberts (david.roberts) wrote :
Download full text (3.8 KiB)

Another freeze, end of dmesg below. Given that zram keeps being mentioned in the error logs, I've now uninstalled zram-config to see if that helps.

[71288.261364] NMI watchdog: Watchdog detected hard LOCKUP on cpu 0
[71288.261364] Modules linked in: cfg80211 xt_conntrack ipt_MASQUERADE nf_nat_masquerade_ipv4 nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype iptable_filter iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack br_netfilter bridge stp llc pci_stub vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) aufs overlay zram intel_rapl x86_pkg_temp_thermal intel_powerclamp binfmt_misc coretemp kvm_intel nls_iso8859_1 kvm eeepc_wmi asus_wmi irqbypass sparse_keymap wmi_bmof mxm_wmi crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc snd_hda_codec_hdmi aesni_intel aes_x86_64 crypto_simd glue_helper cryptd intel_cstate snd_seq_midi joydev input_leds intel_rapl_perf snd_seq_midi_event snd_rawmidi snd_seq snd_hda_codec_realtek snd_hda_codec_generic snd_hda_intel snd_hda_codec
[71288.261372] snd_hda_core snd_hwdep serio_raw snd_pcm snd_seq_device snd_timer snd mei_me mei lpc_ich mac_hid soundcore wmi shpchp ie31200_edac nvidia_uvm(OE) sch_fq_codel sunrpc parport_pc ppdev lp parport ip_tables x_tables autofs4 btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear uas usb_storage hid_generic usbhid hid nvidia_drm(POE) nvidia_modeset(POE) nvidia(POE) psmouse ahci libahci drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops video drm r8169 ipmi_devintf mii ipmi_msghandler
[71288.261380] CPU: 0 PID: 9823 Comm: code Tainted: P OE 4.15.0-66-generic #75-Ubuntu
[71288.261380] Hardware name: System manufacturer System Product Name/P8Z77-M, BIOS 0906 03/26/2012
[71288.261380] RIP: 0010:zram_slot_free_notify+0x54/0x70 [zram]
[71288.261380] RSP: 0000:ffffb0924a1afc60 EFLAGS: 00000006
[71288.261381] RAX: ffffb092427be8f0 RBX: 00000000003438f0 RCX: ffffb092427be8f8
[71288.261381] RDX: 000000000200036b RSI: 000000000003438f RDI: ffff9fce0e1ec340
[71288.261381] RBP: ffffb0924a1afc70 R08: 00000000000001d5 R09: 0000000102000164
[71288.261382] R10: ffffb0924a1afbe0 R11: ffff9fcc16cf5b00 R12: ffff9fce0a9cc600
[71288.261382] R13: ffffffffc1efea80 R14: 000000000003438f R15: 020000000003438f
[71288.261382] FS: 00007f1109bc2700(0000) GS:ffff9fce1ec00000(0000) knlGS:0000000000000000
[71288.261382] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[71288.261383] CR2: 00001fa6faba3798 CR3: 0000000049f68004 CR4: 00000000001606f0
[71288.261383] Call Trace:
[71288.261383] swap_range_free+0x92/0xe0
[71288.261383] swapcache_free_entries+0x120/0x200
[71288.261384] free_swap_slot+0xd2/0xe0
[71288.261384] swap_free+0x36/0x40
[71288.261384] do_swap_page+0x73b/0x960
[71288.261384] __handle_mm_fault+0x7a3/0x1290
[71288.261384] handle_mm_fault+0xb1/0x210
[71288.261385] __do_page_fault+0x281/0x4b0
[71288.261385] ? SyS_futex+0x13b/0x180
[71288.261385] do_page_fault+0x2e/0xe0
[71288.261385] ? page_fault+0x2f/0x50
[71288.261385] page_fault+0x45/0x50
[71288.261385] RIP: 0033:0x55e07741813d
[71288.261386] RSP: 002b:00007f1...

Read more...

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.