Comment 0 for bug 2012908

Revision history for this message
Simon Chopin (schopin) wrote :

I just updated my Lunar install, which brought me the 6.2 kernel, and it failed to start, stalling after enumerating my USB devices, until something times out with a message saying that the udev event queue failed to be drained.

When attempting to move on to the normal graphics boot from rescue mode once the timeout is hit, I simply get a black screen.

I'm blaming this on the nvidia driver because of this appearing in the log:

mars 27 10:33:54 gandalf kernel: INFO: task systemd-udevd:304 blocked for more than 120 seconds.
mars 27 10:33:54 gandalf kernel: Tainted: P OE 6.2.0-18-generic #18-Ubuntu
mars 27 10:33:54 gandalf kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
mars 27 10:33:54 gandalf kernel: task:systemd-udevd state:D stack:0 pid:304 ppid:258 flags:0x00004006
mars 27 10:33:54 gandalf kernel: Call Trace:
mars 27 10:33:54 gandalf kernel: <TASK>
mars 27 10:33:54 gandalf kernel: __schedule+0x2aa/0x610
mars 27 10:33:54 gandalf kernel: schedule+0x63/0x110
mars 27 10:33:54 gandalf kernel: schedule_preempt_disabled+0x15/0x30
mars 27 10:33:54 gandalf kernel: __mutex_lock.constprop.0+0x3f8/0x7a0
mars 27 10:33:54 gandalf kernel: ? __kmem_cache_alloc_node+0x19d/0x340
mars 27 10:33:54 gandalf kernel: ? nv_drm_calloc+0x1e/0x40 [nvidia_drm]
mars 27 10:33:54 gandalf kernel: __mutex_lock_slowpath+0x13/0x20
mars 27 10:33:54 gandalf kernel: mutex_lock+0x3c/0x50
mars 27 10:33:54 gandalf kernel: __nv_drm_connector_detect_internal+0x15c/0x2f0 [nvidia_drm]
mars 27 10:33:54 gandalf kernel: nv_drm_connector_detect+0xe/0x20 [nvidia_drm]
mars 27 10:33:54 gandalf kernel: drm_helper_probe_detect_ctx+0xa3/0x120 [drm_kms_helper]
mars 27 10:33:54 gandalf kernel: check_connector_changed+0x52/0x200 [drm_kms_helper]
mars 27 10:33:54 gandalf kernel: drm_helper_hpd_irq_event+0xbc/0x170 [drm_kms_helper]
mars 27 10:33:54 gandalf kernel: nv_drm_load+0x2e7/0x480 [nvidia_drm]
mars 27 10:33:54 gandalf kernel: ? __pfx_nv_drm_event_callback+0x10/0x10 [nvidia_drm]
mars 27 10:33:54 gandalf kernel: drm_dev_register+0x10e/0x250 [drm]
mars 27 10:33:54 gandalf kernel: nv_drm_probe_devices+0x111/0x200 [nvidia_drm]
mars 27 10:33:54 gandalf kernel: ? __pfx_init_module+0x10/0x10 [nvidia_drm]
mars 27 10:33:54 gandalf kernel: nv_drm_init+0x1e/0x60 [nvidia_drm]
mars 27 10:33:54 gandalf kernel: nv_linux_drm_init+0xe/0xff0 [nvidia_drm]
mars 27 10:33:54 gandalf kernel: do_one_initcall+0x5e/0x250
mars 27 10:33:54 gandalf kernel: do_init_module+0x7b/0x260
mars 27 10:33:54 gandalf kernel: load_module+0xc76/0xd60
mars 27 10:33:54 gandalf kernel: ? kernel_read_file+0x2a4/0x320
mars 27 10:33:54 gandalf kernel: __do_sys_finit_module+0xc4/0x140
mars 27 10:33:54 gandalf kernel: ? __do_sys_finit_module+0xc4/0x140
mars 27 10:33:54 gandalf kernel: __x64_sys_finit_module+0x18/0x30
mars 27 10:33:54 gandalf kernel: do_syscall_64+0x5b/0x90
mars 27 10:33:54 gandalf kernel: ? ksys_mmap_pgoff+0x120/0x260
mars 27 10:33:54 gandalf kernel: ? exit_to_user_mode_prepare+0x30/0xb0
mars 27 10:33:54 gandalf kernel: ? exit_to_user_mode_prepare+0x30/0xb0
mars 27 10:33:54 gandalf kernel: ? syscall_exit_to_user_mode+0x29/0x50
mars 27 10:33:54 gandalf kernel: ? do_syscall_64+0x67/0x90
mars 27 10:33:54 gandalf kernel: ? do_syscall_64+0x67/0x90
mars 27 10:33:54 gandalf kernel: ? exit_to_user_mode_prepare+0x30/0xb0
mars 27 10:33:54 gandalf kernel: ? syscall_exit_to_user_mode+0x29/0x50
mars 27 10:33:54 gandalf kernel: ? do_syscall_64+0x67/0x90
mars 27 10:33:54 gandalf kernel: ? syscall_exit_to_user_mode+0x29/0x50
mars 27 10:33:54 gandalf kernel: ? do_syscall_64+0x67/0x90
mars 27 10:33:54 gandalf kernel: ? do_syscall_64+0x67/0x90
mars 27 10:33:54 gandalf kernel: entry_SYSCALL_64_after_hwframe+0x72/0xdc
mars 27 10:33:54 gandalf kernel: RIP: 0033:0x7fd3dc85d89d
mars 27 10:33:54 gandalf kernel: RSP: 002b:00007ffc801034a8 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
mars 27 10:33:54 gandalf kernel: RAX: ffffffffffffffda RBX: 000055f373907c60 RCX: 00007fd3dc85d89d
mars 27 10:33:54 gandalf kernel: RDX: 0000000000000000 RSI: 000055f373889af0 RDI: 0000000000000012
mars 27 10:33:54 gandalf kernel: RBP: 000055f373889af0 R08: 0000000000000000 R09: 00007ffc801035d0
mars 27 10:33:54 gandalf kernel: R10: 0000000000000012 R11: 0000000000000246 R12: 0000000000020000
mars 27 10:33:54 gandalf kernel: R13: 000055f37389c120 R14: 0000000000000000 R15: 000055f373909080
mars 27 10:33:54 gandalf kernel: </TASK>

Also, the last entry before I had to do a hard shutdown was this:

mars 27 10:34:54 gandalf systemd-udevd[1023]: nvidia: Spawned process '/sbin/modprobe nvidia-drm' [1124] is taking longer than 59s to complete

I'm attaching the kernel logs up to the beginning of userspace logs. I can provide more if necessary.

My driver package:
ii nvidia-driver-515 515.86.01-0ubuntu3 amd64

My graphics card:
0b:00.0 VGA compatible controller: NVIDIA Corporation GA102 [GeForce RTX 3080 Ti] (rev a1)