NVIDIA quad-GPU system crashes starting Wayland session
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
mutter (Ubuntu) |
Expired
|
Low
|
Unassigned |
Bug Description
When logged into a desktop session, I created a new user account and then tried to "switch to" it. A new gnome session started for that user (test1), but after a few seconds, gnome-shell appears to have crashed. This only seems to happen on the first attempt to "switch to" a new user.
ProblemType: Bug
DistroRelease: Ubuntu 22.04
Package: xorg 1:7.7+23ubuntu2
ProcVersionSign
Uname: Linux 5.15.0-69-generic x86_64
NonfreeKernelMo
.proc.driver.
.proc.driver.
.proc.driver.
.proc.driver.
.proc.driver.
.proc.driver.
.proc.driver.
.proc.driver.
.proc.driver.
.proc.driver.
.proc.driver.
.proc.driver.
.proc.driver.
NVRM version: NVIDIA UNIX x86_64 Kernel Module 525.89.02 Wed Feb 1 23:23:25 UTC 2023
GCC version:
ApportVersion: 2.20.11-0ubuntu82.3
Architecture: amd64
CasperMD5CheckR
Date: Wed Apr 5 17:23:28 2023
DistUpgraded: Fresh install
DistroCodename: jammy
DistroVariant: ubuntu
ExtraDebuggingI
MachineType: NVIDIA DGX Station
ProcEnviron:
TERM=xterm-
PATH=(custom, no user)
LANG=C.UTF-8
SHELL=/bin/bash
ProcKernelCmdLine: BOOT_IMAGE=
SourcePackage: xorg
Symptom: display
Title: Xorg crash
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 08/27/2018
dmi.bios.release: 5.11
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: 0406
dmi.board.
dmi.board.name: X99-E-10G WS
dmi.board.vendor: EMPTY
dmi.board.version: Rev 1.xx
dmi.chassis.
dmi.chassis.type: 3
dmi.chassis.vendor: EMPTY
dmi.chassis.
dmi.modalias: dmi:bvnAmerican
dmi.product.family: DGX
dmi.product.name: DGX Station
dmi.product.sku: 920-22587-2510-000
dmi.product.
dmi.sys.vendor: NVIDIA
version.compiz: compiz N/A
version.libdrm2: libdrm2 2.4.113-
version.
version.
version.
version.
version.
version.
version.
version.
tags: | added: multigpu nvidia wayland wayland-session |
tags: | removed: need-amd64-retrace |
tags: | removed: need-amd64-retrace |
summary: |
- gnome-shell crashes after "switching to" a new user from an existing - login + Multi-NVIDIA-GPU system crashes starting Wayland session |
affects: | gnome-shell (Ubuntu) → mutter (Ubuntu) |
Changed in mutter (Ubuntu): | |
status: | Incomplete → New |
tags: | added: nvidia-wayland |
We need the robots to retrace that properly, but what I can figure so far is:
#0 __pthread_ kill_implementa tion (no_tid=0, signo=11, 139744913655232 ) at ./nptl/ pthread_ kill.c: 44 pthread_ kill.c: No such file or directory. kill_implementa tion (no_tid=0, signo=11, 139744913655232 ) at ./nptl/ pthread_ kill.c: 44 kill_internal (signo=11, threadid= 139744913655232 ) pthread_ kill.c: 78 139744913655232 , signo=signo@ entry=11) pthread_ kill.c: 89 posix/raise. c:26 x86_64/ multiarch/ strlen- avx2.S: 74 x86_64- linux-gnu/ dri/nouveau_ dri.so x86_64- linux-gnu/ dri/nouveau_ dri.so x86_64- linux-gnu/ dri/nouveau_ dri.so x86_64- linux-gnu/ dri/nouveau_ dri.so x86_64- linux-gnu/ dri/nouveau_ dri.so x86_64- linux-gnu/ dri/nouveau_ dri.so x86_64- linux-gnu/ dri/nouveau_ dri.so x86_64- linux-gnu/ mutter- 10/libmutter- cogl-10. so.0 x86_64- linux-gnu/ mutter- 10/libmutter- cogl-10. so.0 x86_64- linux-gnu/ mutter- 10/libmutter- cogl-10. so.0 x86_64- linux-gnu/ mutter- 10/libmutter- cogl-10. so.0 x86_64- linux-gnu/ mutter- 10/libmutter- cogl-10. so.0 x86_64- linux-gnu/ mutter- 10/libmutter- cogl-10. so.0 x86_64- linux-gnu/ mutter- 10/libmutter- cogl-10. so.0 x86_64- linux-gnu/ mutter- 10/libmutter- cogl-10. so.0 x86_64- linux-gnu/ mutter- 10/libmutter- cogl-10. so.0 x86_64- linux-gnu/ mutter- 10/libmutter- cogl-10. so.0 64-linux- gnu/libglib- 2.0.so. 0 x86_64- linux-gnu/ mutter- 10/libmutter- cogl-10. so.0 framebuffer () x86_64- linux-gnu/ mutter- 10/libmutter- cogl-10. so.0 stage_view_ before_ swap_buffer () x86_64- linux-gnu/ mutter- 10/libmutter- clutter- 10.so.0 64-linux- gnu/libmutter- 10.so.0 64-linux- gnu/libmutter- 10.so.0 64-linux- gnu/libmutter- 10.so.0 x86_64- linux-gnu/ mutter- 10/libmutter- clutter- 10.so.0. ..
threadid=
44 ./nptl/
[Current thread is 1 (Thread 0x7f18e5f005c0 (LWP 7141))]
(gdb) bt
#0 __pthread_
threadid=
#1 __pthread_
at ./nptl/
#2 __GI___pthread_kill (threadid=
at ./nptl/
#3 0x00007f18eb1c9476 in __GI_raise (sig=11) at ../sysdeps/
#4 0x0000561d62d507aa in ?? ()
#5 <signal handler called>
#6 __strlen_avx2_rtm () at ../sysdeps/
#7 0x00007f18d5f2192a in ?? ()
from /usr/lib/
#8 0x00007f18d5f21567 in ?? ()
from /usr/lib/
#9 0x00007f18d62afed0 in ?? ()
from /usr/lib/
#10 0x00007f18d62ab825 in ?? ()
from /usr/lib/
#11 0x00007f18d62ad4c7 in ?? ()
from /usr/lib/
#12 0x00007f18d61940da in ?? ()
from /usr/lib/
#13 0x00007f18d617523f in ?? ()
from /usr/lib/
#14 0x00007f18eae25cf2 in ?? ()
from /usr/lib/
#15 0x00007f18eae22b3c in ?? ()
from /usr/lib/
#16 0x00007f18eae22fee in ?? ()
from /usr/lib/
#17 0x00007f18eae33c71 in ?? ()
from /usr/lib/
#18 0x00007f18eae1bc76 in ?? ()
from /usr/lib/
#19 0x00007f18eae4aeb2 in ?? ()
from /usr/lib/
#20 0x00007f18eae4b233 in ?? ()
from /usr/lib/
#21 0x00007f18eae4b4c6 in ?? ()
from /usr/lib/
#22 0x00007f18eae4bf4f in ?? ()
from /usr/lib/
#23 0x00007f18eae53343 in ?? ()
from /usr/lib/
#24 0x00007f18ec109f80 in g_list_foreach ()
from /lib/x86_
#25 0x00007f18eae52e7b in ?? ()
from /usr/lib/
#26 0x00007f18eae55e30 in cogl_blit_
from /usr/lib/
#27 0x00007f18eb695b4d in clutter_
from /usr/lib/
#28 0x00007f18eb434803 in ?? () from /lib/x86_
#29 0x00007f18eb438f71 in ?? () from /lib/x86_
#30 0x00007f18eb52482b in ?? () from /lib/x86_
#31 0x00007f18eb696258 in ?? ()
from /usr/lib/