Resume fail after suspend with NVIDIA driver on Ubuntu 22.04
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
nvidia-graphics-drivers-510 (Ubuntu) |
Confirmed
|
Undecided
|
Unassigned |
Bug Description
After upgrading from 21.10 to 22.04, the suspend/resume stopped working on my laptop with an NVIDIA discrete card.
The computer goes in sleep mode, but when resuming, the screen remain black.
The issue happens with both NVIDIA driver 470 and 510. Previously, I was running the 470 because I use MATE and there was a bug with Xorg/MATE/NVIDIA 510, so I don't know how it behave in Ubuntu 21.10 and NVIDIA driver 510.
I could not try with an older kernel on Ubuntu 22.04, because DKMS refused to build the driver for older kernels.
The dmesg/kern.log contains a backtrace:
Apr 24 11:26:16 thebat kernel: [ 70.354793] ------------[ cut here ]------------
Apr 24 11:26:16 thebat kernel: [ 70.354795] WARNING: CPU: 2 PID: 4127 at /var/lib/
Apr 24 11:26:16 thebat kernel: [ 70.354974] Modules linked in: vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) nvme_fabrics rfcomm ccm xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT xt_tcpudp nft_compat nf_nat_tftp nft_objref nf_conntrack_tftp nft_counter nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib bridge stp llc nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ip_set nf_tables nfnetlink cmac algif_hash algif_skcipher af_alg bnep intel_rapl_msr intel_rapl_common intel_tcc_cooling uvcvideo videobuf2_vmalloc videobuf2_memops x86_pkg_
Apr 24 11:26:16 thebat kernel: [ 70.355006] input_leds snd_hda_codec snd_seq snd_hda_core iwlwifi snd_hwdep serio_raw intel_wmi_
Apr 24 11:26:16 thebat kernel: [ 70.355040] CPU: 2 PID: 4127 Comm: nvidia-sleep.sh Tainted: P OE 5.15.0-25-generic #25-Ubuntu
Apr 24 11:26:16 thebat kernel: [ 70.355042] Hardware name: Notebook P7xxDM(-G) /P775DM(-G) , BIOS 1.05.09 12/28/2015
Apr 24 11:26:16 thebat kernel: [ 70.355043] RIP: 0010:nv_
Apr 24 11:26:16 thebat kernel: [ 70.355158] Code: b2 b3 fa be 01 00 00 00 4c 89 ef e8 ec b3 00 00 48 89 df e8 14 b2 b3 fa ba 02 00 00 00 4c 89 ee 4c 89 e7 e8 04 61 94 00 eb 93 <0f> 0b eb c6 41 be 51 00 00 00 eb 9e 66 0f 1f 44 00 00 0f 1f 44 00
Apr 24 11:26:16 thebat kernel: [ 70.355159] RSP: 0018:ffffb5af83
Apr 24 11:26:16 thebat kernel: [ 70.355160] RAX: 0000000000000003 RBX: ffff8c4d1ec00800 RCX: ffffb5af83a5bce0
Apr 24 11:26:16 thebat kernel: [ 70.355161] RDX: 0000000000000087 RSI: 0000000000000246 RDI: ffff8c4d00ee1068
Apr 24 11:26:16 thebat kernel: [ 70.355162] RBP: ffffb5af83a5bd70 R08: 0000000000000000 R09: ffff8c54a65b1040
Apr 24 11:26:16 thebat kernel: [ 70.355163] R10: ffff8c4d03d04660 R11: 0000000000000000 R12: ffff8c4d1146b000
Apr 24 11:26:16 thebat kernel: [ 70.355164] R13: ffff8c4d1ec00800 R14: 0000000000000003 R15: ffff8c4d1ec00d10
Apr 24 11:26:16 thebat kernel: [ 70.355165] FS: 00007f14620a874
Apr 24 11:26:16 thebat kernel: [ 70.355166] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Apr 24 11:26:16 thebat kernel: [ 70.355167] CR2: 000055bb914470c8 CR3: 000000010ee62003 CR4: 00000000003706e0
Apr 24 11:26:16 thebat kernel: [ 70.355168] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Apr 24 11:26:16 thebat kernel: [ 70.355168] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Apr 24 11:26:16 thebat kernel: [ 70.355169] Call Trace:
Apr 24 11:26:16 thebat kernel: [ 70.355170] <TASK>
Apr 24 11:26:16 thebat kernel: [ 70.355172] nv_set_
Apr 24 11:26:16 thebat kernel: [ 70.355286] nv_procfs_
Apr 24 11:26:16 thebat kernel: [ 70.355401] proc_reg_
Apr 24 11:26:16 thebat kernel: [ 70.355404] ? __cond_
Apr 24 11:26:16 thebat kernel: [ 70.355407] vfs_write+
Apr 24 11:26:16 thebat kernel: [ 70.355410] ksys_write+
Apr 24 11:26:16 thebat kernel: [ 70.355411] __x64_sys_
Apr 24 11:26:16 thebat kernel: [ 70.355412] do_syscall_
Apr 24 11:26:16 thebat kernel: [ 70.355414] ? do_user_
Apr 24 11:26:16 thebat kernel: [ 70.355417] ? exit_to_
Apr 24 11:26:16 thebat kernel: [ 70.355419] ? irqentry_
Apr 24 11:26:16 thebat kernel: [ 70.355420] ? irqentry_
Apr 24 11:26:16 thebat kernel: [ 70.355421] ? exc_page_
Apr 24 11:26:16 thebat kernel: [ 70.355422] ? asm_exc_
Apr 24 11:26:16 thebat kernel: [ 70.355424] entry_SYSCALL_
Apr 24 11:26:16 thebat kernel: [ 70.355426] RIP: 0033:0x7f14621bfa37
Apr 24 11:26:16 thebat kernel: [ 70.355427] Code: 10 00 f7 d8 64 89 02 48 c7 c0 ff ff ff ff eb b7 0f 1f 00 f3 0f 1e fa 64 8b 04 25 18 00 00 00 85 c0 75 10 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 51 c3 48 83 ec 28 48 89 54 24 18 48 89 74 24
Apr 24 11:26:16 thebat kernel: [ 70.355428] RSP: 002b:00007fff93
Apr 24 11:26:16 thebat kernel: [ 70.355430] RAX: ffffffffffffffda RBX: 0000000000000007 RCX: 00007f14621bfa37
Apr 24 11:26:16 thebat kernel: [ 70.355430] RDX: 0000000000000007 RSI: 0000555dafdaed00 RDI: 0000000000000001
Apr 24 11:26:16 thebat kernel: [ 70.355431] RBP: 0000555dafdaed00 R08: 0000000000000000 R09: 0000555dafdaed00
Apr 24 11:26:16 thebat kernel: [ 70.355432] R10: 0000000000000077 R11: 0000000000000246 R12: 0000000000000007
Apr 24 11:26:16 thebat kernel: [ 70.355433] R13: 00007f14622c5780 R14: 00007f14622c1600 R15: 00007f14622c0a00
Apr 24 11:26:16 thebat kernel: [ 70.355435] </TASK>
Apr 24 11:26:16 thebat kernel: [ 70.355435] ---[ end trace ce9942c23cb7434d ]---
Apr 24 11:26:16 thebat kernel: [ 70.355466] ------------[ cut here ]------------
Apr 24 11:26:16 thebat kernel: [ 70.355467] WARNING: CPU: 6 PID: 4127 at /var/lib/
Apr 24 11:26:16 thebat kernel: [ 70.355593] Modules linked in: vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) nvme_fabrics rfcomm ccm xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT xt_tcpudp nft_compat nf_nat_tftp nft_objref nf_conntrack_tftp nft_counter nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib bridge stp llc nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ip_set nf_tables nfnetlink cmac algif_hash algif_skcipher af_alg bnep intel_rapl_msr intel_rapl_common intel_tcc_cooling uvcvideo videobuf2_vmalloc videobuf2_memops x86_pkg_
Apr 24 11:26:16 thebat kernel: [ 70.355620] input_leds snd_hda_codec snd_seq snd_hda_core iwlwifi snd_hwdep serio_raw intel_wmi_
Apr 24 11:26:16 thebat kernel: [ 70.355646] CPU: 6 PID: 4127 Comm: nvidia-sleep.sh Tainted: P W OE 5.15.0-25-generic #25-Ubuntu
Apr 24 11:26:16 thebat kernel: [ 70.355648] Hardware name: Notebook P7xxDM(-G) /P775DM(-G) , BIOS 1.05.09 12/28/2015
Apr 24 11:26:16 thebat kernel: [ 70.355648] RIP: 0010:nv_
Apr 24 11:26:16 thebat kernel: [ 70.355762] Code: ff ff 41 83 fd 02 74 e9 49 8b 84 24 70 02 00 00 ba 02 00 00 00 48 8b 70 78 48 8b 78 60 e8 98 cf ff ff 85 c0 74 cb 0f 0b eb c7 <0f> 0b e9 5c ff ff ff 48 c7 c7 10 ea 95 c2 e8 2d 7f b3 fa e8 08 48
Apr 24 11:26:16 thebat kernel: [ 70.355763] RSP: 0018:ffffb5af83
Apr 24 11:26:16 thebat kernel: [ 70.355764] RAX: 0000000000000003 RBX: 0000000000000002 RCX: ffff8c4d00ee1cc0
Apr 24 11:26:16 thebat kernel: [ 70.355764] RDX: 0000000080020002 RSI: ffffffffc05f18e8 RDI: ffff8c4d204e3800
Apr 24 11:26:16 thebat kernel: [ 70.355765] RBP: ffffb5af83a5bdb0 R08: 0000000000000001 R09: 0000000000000000
Apr 24 11:26:16 thebat kernel: [ 70.355766] R10: 0000000000000001 R11: 0000000000000001 R12: ffff8c4d1ec00800
Apr 24 11:26:16 thebat kernel: [ 70.355767] R13: 0000000000000000 R14: 0000555dafdaed00 R15: ffffb5af83a5be58
Apr 24 11:26:16 thebat kernel: [ 70.355767] FS: 00007f14620a874
Apr 24 11:26:16 thebat kernel: [ 70.355768] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Apr 24 11:26:16 thebat kernel: [ 70.355769] CR2: 0000562ef2ef45e8 CR3: 000000010ee62006 CR4: 00000000003706e0
Apr 24 11:26:16 thebat kernel: [ 70.355770] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Apr 24 11:26:16 thebat kernel: [ 70.355771] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Apr 24 11:26:16 thebat kernel: [ 70.355771] Call Trace:
Apr 24 11:26:16 thebat kernel: [ 70.355772] <TASK>
Apr 24 11:26:16 thebat kernel: [ 70.355773] nv_procfs_
Apr 24 11:26:16 thebat kernel: [ 70.355888] proc_reg_
Apr 24 11:26:16 thebat kernel: [ 70.355890] ? __cond_
Apr 24 11:26:16 thebat kernel: [ 70.355893] vfs_write+
Apr 24 11:26:16 thebat kernel: [ 70.355895] ksys_write+
Apr 24 11:26:16 thebat kernel: [ 70.355896] __x64_sys_
Apr 24 11:26:16 thebat kernel: [ 70.355897] do_syscall_
Apr 24 11:26:16 thebat kernel: [ 70.355899] ? do_user_
Apr 24 11:26:16 thebat kernel: [ 70.355901] ? exit_to_
Apr 24 11:26:16 thebat kernel: [ 70.355902] ? irqentry_
Apr 24 11:26:16 thebat kernel: [ 70.355904] ? irqentry_
Apr 24 11:26:16 thebat kernel: [ 70.355905] ? exc_page_
Apr 24 11:26:16 thebat kernel: [ 70.355906] ? asm_exc_
Apr 24 11:26:16 thebat kernel: [ 70.355907] entry_SYSCALL_
Apr 24 11:26:16 thebat kernel: [ 70.355909] RIP: 0033:0x7f14621bfa37
Apr 24 11:26:16 thebat kernel: [ 70.355910] Code: 10 00 f7 d8 64 89 02 48 c7 c0 ff ff ff ff eb b7 0f 1f 00 f3 0f 1e fa 64 8b 04 25 18 00 00 00 85 c0 75 10 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 51 c3 48 83 ec 28 48 89 54 24 18 48 89 74 24
Apr 24 11:26:16 thebat kernel: [ 70.355911] RSP: 002b:00007fff93
Apr 24 11:26:16 thebat kernel: [ 70.355912] RAX: ffffffffffffffda RBX: 0000000000000007 RCX: 00007f14621bfa37
Apr 24 11:26:16 thebat kernel: [ 70.355913] RDX: 0000000000000007 RSI: 0000555dafdaed00 RDI: 0000000000000001
Apr 24 11:26:16 thebat kernel: [ 70.355913] RBP: 0000555dafdaed00 R08: 0000000000000000 R09: 0000555dafdaed00
Apr 24 11:26:16 thebat kernel: [ 70.355914] R10: 0000000000000077 R11: 0000000000000246 R12: 0000000000000007
Apr 24 11:26:16 thebat kernel: [ 70.355915] R13: 00007f14622c5780 R14: 00007f14622c1600 R15: 00007f14622c0a00
Apr 24 11:26:16 thebat kernel: [ 70.355916] </TASK>
Apr 24 11:26:16 thebat kernel: [ 70.355917] ---[ end trace ce9942c23cb7434e ]---
Apr 24 11:26:19 thebat kernel: [ 73.358424] nvidia-modeset: WARNING: GPU:0: Lost display notification (0:0x00000000); continuing.
Apr 24 11:26:21 thebat kernel: [ 75.613255] nvidia-modeset: ERROR: GPU:0: Idling display engine timed out: 0x0000957d:0:0:407
A workaround I found is to disable nvidia-resume and nvidia-suspend services:
systemctl disable nvidia- hibernate. service nvidia- resume. service nvidia- suspend. service
Suspend/resume works again after that.