Comment 0 for bug 1772638

Revision history for this message
Thorsten (thorstenr-42) wrote : segfault in libmutter-2.so after suspend/resume using wayland

Hi,

i have the problem when running the gnome wayland session in ubuntu 18.04 that in ~1 out of 5 resume/suspend cycles a segfault in libmutter-2.so occurs which then kills my current session. Here is my journctl from the crash:

Mai 22 12:39:03 x1 systemd[1]: Reached target Sleep.
Mai 22 12:39:03 x1 systemd[1]: Starting Suspend...
Mai 22 12:39:03 x1 systemd-sleep[2349]: Suspending system...
Mai 22 12:39:03 x1 kernel: PM: suspend entry (s2idle)
Mai 22 12:39:03 x1 kernel: PM: Syncing filesystems ... done.
Mai 22 12:39:03 x1 gnome-shell[1483]: Failed to set CRTC mode 2560x1440: Permission denied
Mai 22 12:39:03 x1 wpa_supplicant[976]: nl80211: deinit ifname=wlp2s0 disabled_11b_rates=0
Mai 22 12:39:03 x1 kernel: gnome-shell[1483]: segfault at 20 ip 00007ff704b28b17 sp 00007ffd6c1cd1b0 error 4 in libmutter-2.so.0.0.0[7ff704a3b000+156000]
Mai 22 12:39:03 x1 kernel: [drm] Reducing the compressed framebuffer size. This may lead to less power savings than a non-reduced-size. Try to increase stolen memory size if available in BIOS.
Mai 22 13:11:15 x1 kernel: Freezing user space processes ... (elapsed 0.001 seconds) done.
Mai 22 13:11:15 x1 kernel: OOM killer disabled.
Mai 22 13:11:15 x1 kernel: Freezing remaining freezable tasks ... (elapsed 0.001 seconds) done.
Mai 22 13:11:15 x1 kernel: Suspending console(s) (use no_console_suspend to debug)
Mai 22 13:11:15 x1 kernel: thinkpad_acpi: unknown possible thermal alarm or keyboard event received
Mai 22 13:11:15 x1 kernel: thinkpad_acpi: unhandled HKEY event 0x6032
Mai 22 13:11:15 x1 kernel: thinkpad_acpi: please report the conditions when this event happened to <email address hidden>
Mai 22 13:11:15 x1 kernel: [drm] GuC: Loaded firmware i915/kbl_guc_ver9_39.bin (version 9.39)
Mai 22 13:11:15 x1 kernel: i915 0000:00:02.0: GuC firmware version 9.39
Mai 22 13:11:15 x1 kernel: i915 0000:00:02.0: GuC submission enabled
Mai 22 13:11:15 x1 kernel: i915 0000:00:02.0: HuC disabled
Mai 22 13:11:15 x1 kernel: [drm] Reducing the compressed framebuffer size. This may lead to less power savings than a non-reduced-size. Try to increase stolen memory size if available in BIOS.
Mai 22 13:11:15 x1 kernel: OOM killer enabled.
Mai 22 13:11:15 x1 kernel: Restarting tasks ... done.
Mai 22 13:11:15 x1 systemd-logind[986]: Lid opened.
Mai 22 13:11:15 x1 kernel: rfkill: input handler enabled
Mai 22 13:11:15 x1 systemd[1]: Started Run anacron jobs.
Mai 22 13:11:15 x1 anacron[2420]: Anacron 2.3 started on 2018-05-22
Mai 22 13:11:15 x1 anacron[2420]: Normal exit (0 jobs run)
Mai 22 13:11:15 x1 kernel: thermal thermal_zone6: failed to read out thermal zone (-61)
Mai 22 13:11:15 x1 tilix[1980]: Error reading events from display: Broken pipe
Mai 22 13:11:15 x1 update-notifier[1901]: Error reading events from display: Broken pipe
Mai 22 13:11:15 x1 systemd-sleep[2349]: System resumed.
Mai 22 13:11:15 x1 kernel: PM: suspend exit
Mai 22 13:11:15 x1 org.gnome.Shell.desktop[1483]: (EE)
Mai 22 13:11:15 x1 org.gnome.Shell.desktop[1483]: Fatal server error:
Mai 22 13:11:15 x1 org.gnome.Shell.desktop[1483]: (EE) failed to read Wayland events: Broken pipe
Mai 22 13:11:15 x1 org.gnome.Shell.desktop[1483]: (EE)
Mai 22 13:11:15 x1 systemd[1]: Started Suspend.
Mai 22 13:11:15 x1 systemd[1]: sleep.target: Unit not needed anymore. Stopping.
Mai 22 13:11:15 x1 systemd[1]: Stopped target Sleep.
Mai 22 13:11:15 x1 systemd[1]: Reached target Suspend.
Mai 22 13:11:15 x1 systemd[1]: suspend.target: Unit not needed anymore. Stopping.
Mai 22 13:11:15 x1 systemd[1]: Stopped target Suspend.
Mai 22 13:11:15 x1 gnome-session[1455]: gnome-session-binary[1455]: WARNING: Application 'org.gnome.Shell.desktop' killed by signal 11
Mai 22 13:11:15 x1 gnome-session-binary[1455]: WARNING: Application 'org.gnome.Shell.desktop' killed by signal 11
Mai 22 13:11:15 x1 gnome-session-binary[1455]: Unrecoverable failure in required component org.gnome.Shell.desktop
Mai 22 13:11:15 x1 kdeconnectd.desktop[1725]: ICE default IO error handler doing an exit(), pid = 1725, errno = 11
Mai 22 13:11:15 x1 org.gnome.SettingsDaemon.Power.desktop[1611]: xcb_connection_has_error() returned true
Mai 22 13:11:15 x1 firefox.desktop[2050]: xcb_connection_has_error() returned true
Mai 22 13:11:15 x1 pulseaudio[2430]: [pulseaudio] client-conf-x11.c: xcb_connection_has_error() returned true
Mai 22 13:11:15 x1 polkitd(authority=local)[1059]: Unregistered Authentication Agent for unix-session:2 (system bus name :1.74, object path /org/freedesktop/PolicyKit1/AuthenticationAgent, locale en_US.UTF-8) (disconnected from bus)
Mai 22 13:11:15 x1 rtkit-daemon[1171]: Successfully made thread 2434 of process 2434 (n/a) owned by '1000' high priority at nice level -11.
Mai 22 13:11:15 x1 rtkit-daemon[1171]: Supervising 2 threads of 2 processes of 1 users.
Mai 22 13:11:15 x1 pulseaudio[2434]: [pulseaudio] pid.c: Stale PID file, overwriting.
Mai 22 13:11:15 x1 gdm-password][1430]: pam_unix(gdm-password:session): session closed for user thorsten
Mai 22 13:11:15 x1 gsd-color[1217]: failed to connect to device: Failed to connect to missing device /org/freedesktop/ColorManager/devices/xrandr_AU_Optronics_thorsten_1000
Mai 22 13:11:15 x1 pulseaudio[2434]: [pulseaudio] sink.c: Default and alternate sample rates are the same.
... followup crashes...

Moreover, here is the apport.log:
ERROR: apport (pid 2357) Tue May 22 12:39:03 2018: called for pid 1483, signal 11, core limit 0, dump mode 1
ERROR: apport (pid 2357) Tue May 22 12:39:03 2018: executable: /usr/bin/gnome-shell (command line "/usr/bin/gnome-shell")
ERROR: apport (pid 2357) Tue May 22 12:39:03 2018: debug: session gdbus call: (true,)

ERROR: apport (pid 2357) Tue May 22 13:11:15 2018: wrote report /var/crash/_usr_bin_gnome-shell.1000.crash
ERROR: apport (pid 2429) Tue May 22 13:11:15 2018: called for pid 1502, signal 6, core limit 18446744073709551615, dump mode 1
ERROR: apport (pid 2429) Tue May 22 13:11:15 2018: ignoring implausibly big core limit, treating as unlimited
ERROR: apport (pid 2429) Tue May 22 13:11:15 2018: executable: /usr/bin/Xwayland (command line "/usr/bin/Xwayland :0 -rootless -terminate -accessx -core -listen 4 -listen 5 -displayfd 6")
ERROR: apport (pid 2429) Tue May 22 13:11:15 2018: gdbus call error: Error: GDBus.Error:org.freedesktop.DBus.Error.ServiceUnknown: The name org.gnome.SessionManager was not provided by any .service files

ERROR: apport (pid 2429) Tue May 22 13:11:15 2018: debug: session gdbus call:
ERROR: apport (pid 2429) Tue May 22 13:11:19 2018: wrote report /var/crash/_usr_bin_Xwayland.1000.crash
ERROR: apport (pid 2429) Tue May 22 13:11:19 2018: writing core dump to /home/thorsten/core (limit: -1)
ERROR: apport (pid 2429) Tue May 22 13:11:20 2018: writing core dump /home/thorsten/core of size 107855872

Additionally, i have ~100MB "core" file in my home directory.

I tried to report the created _usr_bin_gnome-shell.1000.crash file using the workaround from https://bugs.launchpad.net/ubuntu/+source/apport/+bug/994921 and ubuntu-bug. However, i get the error: "The problem cannot be reported: Invalid core dump: BFD: warning /tmp/apport_core_ersfx7ym is truncated: expected core file size >= 846290944, found: 216288"

reporting the corresponding _usr_bin_Xwayland.1000.crash file forwards me to https://bugs.launchpad.net/ubuntu/+source/xorg-server/+bug/1731911. But this only seems to indicate that Xwayland crashes since mutter has crashed.

This error only occurs when using the wayland and not in the xorg session.

I am running 4.16.10 mainline kernel due to some driver issues with my laptop (thinkpad x1 carbon gen6) but this also occurs when using the 4.15 kernel and with and without loading the guc firmware.

Thanks!