Resuming from suspend on Lenovo ThinkPad X1 Carbon 4th gen does not work reliably

Bug #1570484 reported by Mikko Kortelainen
44
This bug affects 9 people
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Confirmed
Medium
Unassigned

Bug Description

Sometimes resuming results in a black screen and a blinking caps lock light, and powering off is the only option. Sometimes it does resume. After such an event I found this in dmesg:

[66603.698393] BUG: unable to handle kernel NULL pointer dereference at (null)
[66603.698537] IP: [<ffffffff811cb762>] unlink_anon_vmas+0x92/0x1f0
[66603.698649] PGD 0
[66603.698689] Oops: 0000 [#1] SMP
[66603.698752] Modules linked in: ctr ccm rfcomm bnep binfmt_misc drbg ansi_cprng dm_crypt arc4 intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm acer_wmi sparse_keymap irqbypass snd_soc_skl crct10dif_pclmul snd_soc_skl_ipc snd_hda_ext_core snd_soc_sst_ipc snd_soc_sst_dsp snd_hda_codec_hdmi snd_soc_core crc32_pclmul snd_hda_codec_conexant iwlmvm snd_hda_codec_generic snd_compress mac80211 aesni_intel ac97_bus snd_pcm_dmaengine aes_x86_64 lrw gf128mul glue_helper ablk_helper cryptd dw_dmac_core iwlwifi snd_hda_intel snd_hda_codec uvcvideo videobuf2_vmalloc snd_hda_core videobuf2_memops videobuf2_v4l2 snd_hwdep videobuf2_core cdc_mbim v4l2_common input_leds joydev cdc_wdm qcserial serio_raw usb_wwan cdc_ncm videodev snd_pcm usbnet usbserial mii cfg80211 rtsx_pci_ms btusb media btrtl
[66603.700162] btbcm btintel memstick thinkpad_acpi bluetooth nvram snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq mei_me mei snd_seq_device shpchp snd_timer snd soundcore mac_hid parport_pc ppdev lp parport autofs4 rtsx_pci_sdmmc i915_bpo intel_ips i2c_algo_bit e1000e psmouse drm_kms_helper ptp syscopyarea sysfillrect pps_core sysimgblt fb_sys_fops rtsx_pci drm ahci libahci wmi video fjes
[66603.700853] CPU: 3 PID: 26940 Comm: systemd-udevd Not tainted 4.4.0-18-generic #34-Ubuntu
[66603.700981] Hardware name: LENOVO 20FB003PMS/20FB003PMS, BIOS N1FET37W (1.11 ) 03/15/2016
[66603.701114] task: ffff88010e908ec0 ti: ffff88010fad0000 task.ti: ffff88010fad0000
[66603.701219] RIP: 0010:[<ffffffff811cb762>] [<ffffffff811cb762>] unlink_anon_vmas+0x92/0x1f0
[66603.701365] RSP: 0018:ffff88010fad3d58 EFLAGS: 00010246
[66603.701447] RAX: ffff880213cb0010 RBX: ffff880213cb0000 RCX: 0000000000000000
[66603.701559] RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff88010fadc000
[66603.701663] RBP: ffff88010fad3d90 R08: 0000000000000000 R09: 000000018040003e
[66603.701777] R10: ffff8801f2220840 R11: ffff88010fad6f30 R12: 00007f827a65a000
[66603.701885] R13: fffffffffffffff0 R14: ffff88010fadc078 R15: 0000000000000000
[66603.701994] FS: 0000000000000000(0000) GS:ffff8802224c0000(0000) knlGS:0000000000000000
[66603.702104] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[66603.702187] CR2: 0000000000000000 CR3: 000000010fb68000 CR4: 00000000003406e0
[66603.702294] Stack:
[66603.702331] ffffffff811c3fe2 ffff88010fadc000 ffff88010fadc0c8 00007f827a65a000
[66603.702467] 0000000000000000 ffff88010fad3de8 ffff88010fadc000 ffff88010fad3dd0
[66603.702596] ffffffff811bd191 0000000000000000 0000000000000000 ffff88010c11c740
[66603.702730] Call Trace:
[66603.702777] [<ffffffff811c3fe2>] ? __remove_shared_vm_struct+0x32/0x40
[66603.702892] [<ffffffff811bd191>] free_pgtables+0xa1/0x120
[66603.703012] [<ffffffff811c7c51>] exit_mmap+0xc1/0x170
[66603.703090] [<ffffffff8107de07>] mmput+0x57/0x130
[66603.703165] [<ffffffff81083b3d>] do_exit+0x27d/0xae0
[66603.703248] [<ffffffff81084423>] do_group_exit+0x43/0xb0
[66603.703341] [<ffffffff810844a4>] SyS_exit_group+0x14/0x20
[66603.703427] [<ffffffff818243b2>] entry_SYSCALL_64_fastpath+0x16/0x71
[66603.703524] Code: 00 00 ad de 48 89 43 18 e8 2c 07 02 00 49 8b 45 10 49 8d 4d 10 4c 89 eb 4c 89 e2 48 83 e8 10 49 39 ce 74 43 49 89 c5 4c 8b 7b 08 <4d> 8b 27 49 39 d4 74 13 48 85 d2 0f 85 0b 01 00 00 49 8d 7c 24
[66603.704080] RIP [<ffffffff811cb762>] unlink_anon_vmas+0x92/0x1f0
[66603.704180] RSP <ffff88010fad3d58>
[66603.704229] CR2: 0000000000000000
[66603.735771] ---[ end trace 4454f1c4b387baf0 ]---
[66603.735774] Fixing recursive fault but reboot is needed!

Another thing that caught my eye were the following lines but they may be a separate issue:

[66598.121484] thinkpad_acpi: unknown possible thermal alarm or keyboard event received
[66598.121488] thinkpad_acpi: unhandled HKEY event 0x60c0
[66598.121489] thinkpad_acpi: please report the conditions when this event happened to <email address hidden>
[66598.122040] i915_bpo 0000:00:02.0: BAR 6: [??? 0x00000000 flags 0x2] has bogus alignment
[66598.135264] thinkpad_acpi: unknown possible thermal alarm or keyboard event received
[66598.135267] thinkpad_acpi: unhandled HKEY event 0x60f0
[66598.135268] thinkpad_acpi: please report the conditions when this event happened to <email address hidden>

ProblemType: Bug
DistroRelease: Ubuntu 16.04
Package: linux-image-4.4.0-18-generic 4.4.0-18.34 [modified: boot/vmlinuz-4.4.0-18-generic]
ProcVersionSignature: Ubuntu 4.4.0-18.34-generic 4.4.6
Uname: Linux 4.4.0-18-generic x86_64
ApportVersion: 2.20.1-0ubuntu2
Architecture: amd64
AudioDevicesInUse:
 USER PID ACCESS COMMAND
 /dev/snd/controlC0: mkortela 1891 F.... pulseaudio
CurrentDesktop: Unity
Date: Thu Apr 14 19:44:26 2016
EcryptfsInUse: Yes
HibernationDevice: RESUME=UUID=b5d97bd1-6a16-4736-9da1-65ed058e7e2b
InstallationDate: Installed on 2016-04-13 (0 days ago)
InstallationMedia: Ubuntu 16.04 LTS "Xenial Xerus" - Beta amd64 (20160412)
MachineType: LENOVO 20FB003PMS
ProcFB: 0 inteldrmfb
ProcKernelCmdLine: BOOT_IMAGE=/vmlinuz-4.4.0-18-generic root=/dev/mapper/rootvg-ubuntulv ro splash vt.handoff=7
RelatedPackageVersions:
 linux-restricted-modules-4.4.0-18-generic N/A
 linux-backports-modules-4.4.0-18-generic N/A
 linux-firmware 1.157
SourcePackage: linux
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 03/15/2016
dmi.bios.vendor: LENOVO
dmi.bios.version: N1FET37W (1.11 )
dmi.board.asset.tag: Not Available
dmi.board.name: 20FB003PMS
dmi.board.vendor: LENOVO
dmi.board.version: SDK0J40705 WIN
dmi.chassis.asset.tag: No Asset Information
dmi.chassis.type: 10
dmi.chassis.vendor: LENOVO
dmi.chassis.version: None
dmi.modalias: dmi:bvnLENOVO:bvrN1FET37W(1.11):bd03/15/2016:svnLENOVO:pn20FB003PMS:pvrThinkPadX1Carbon4th:rvnLENOVO:rn20FB003PMS:rvrSDK0J40705WIN:cvnLENOVO:ct10:cvrNone:
dmi.product.name: 20FB003PMS
dmi.product.version: ThinkPad X1 Carbon 4th
dmi.sys.vendor: LENOVO

Revision history for this message
Mikko Kortelainen (kortsi) wrote :
Revision history for this message
Brad Figg (brad-figg) wrote : Status changed to Confirmed

This change was made by a bot.

Changed in linux (Ubuntu):
status: New → Confirmed
summary: - Resuming from suspend on Lenovo X1 Carbon 4th gen does not work reliably
+ Resuming from suspend on Lenovo ThinkPad X1 Carbon 4th gen does not work
+ reliably
tags: added: kernel-bug resume suspend
Revision history for this message
Joseph Salisbury (jsalisbury) wrote :

Did this issue start happening after an update/upgrade? Was there a prior kernel version where you were not having this particular problem?

Would it be possible for you to test the latest upstream kernel? Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Please test the latest v4.6 kernel[0].

If this bug is fixed in the mainline kernel, please add the following tag 'kernel-fixed-upstream'.

If the mainline kernel does not fix this bug, please add the tag: 'kernel-bug-exists-upstream'.

Once testing of the upstream kernel is complete, please mark this bug as "Confirmed".

Thanks in advance.

[0] http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.6-rc4-wily/

Changed in linux (Ubuntu):
importance: Undecided → Medium
status: Confirmed → Incomplete
Revision history for this message
Mikko Kortelainen (kortsi) wrote :

I just bought the machine a couple of days ago and installed Xenial straight away so I havent' had a working kernel on this.

I was thinking about installing a mainline build earlier but did not get to it. Then I discovered that the battery life on this thing is pretty amazing. I can use it through the whole day without charging even with suspend disabled.

But I'll test the mainline build and report back.

Revision history for this message
Mikko Kortelainen (kortsi) wrote :
Download full text (4.3 KiB)

I tested with:

4.6.0-040600rc4-generic #201604172330 SMP Mon Apr 18 03:32:32 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux

I can confirm that this one does not hang during resume after trying about ten times.

Two times out of 10 I got this in dmesg:

[ 77.422675] ------------[ cut here ]------------
[ 77.422733] WARNING: CPU: 0 PID: 2141 at /home/kernel/COD/linux/drivers/gpu/drm/i915/intel_uncore.c:649 __unclaimed_reg_debug+0x80/0x90 [i915]
[ 77.422735] Unclaimed register detected after reading register 0x65f10
[ 77.422789] Modules linked in: drbg ansi_cprng ctr ccm msr bnep binfmt_misc dm_crypt arc4 intel_rapl x86_pkg_temp_thermal snd_soc_skl acer_wmi intel_powerclamp sparse_keymap coretemp kvm_intel snd_soc_skl_ipc snd_soc_sst_ipc kvm snd_soc_sst_dsp irqbypass snd_hda_codec_hdmi snd_hda_ext_core snd_soc_sst_match crct10dif_pclmul snd_soc_core crc32_pclmul ghash_clmulni_intel iwlmvm snd_hda_codec_conexant aesni_intel snd_hda_codec_generic mac80211 snd_compress ac97_bus aes_x86_64 lrw gf128mul glue_helper snd_pcm_dmaengine ablk_helper dw_dmac_core cryptd snd_hda_intel iwlwifi snd_hda_codec input_leds joydev snd_hda_core snd_hwdep serio_raw uvcvideo cdc_mbim cdc_wdm videobuf2_vmalloc videobuf2_memops qcserial cdc_ncm snd_pcm videobuf2_v4l2 usbnet usb_wwan videobuf2_core mii videodev usbserial rtsx_pci_ms
[ 77.422824] cfg80211 media memstick btusb btrtl btbcm btintel bluetooth thinkpad_acpi nvram snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq mei_me mei shpchp snd_seq_device intel_pch_thermal snd_timer snd soundcore mac_hid parport_pc ppdev lp parport autofs4 i915 i2c_algo_bit drm_kms_helper rtsx_pci_sdmmc e1000e syscopyarea sysfillrect ptp sysimgblt psmouse fb_sys_fops pps_core ahci rtsx_pci drm libahci wmi video fjes
[ 77.422828] CPU: 0 PID: 2141 Comm: kworker/u16:6 Not tainted 4.6.0-040600rc4-generic #201604172330
[ 77.422830] Hardware name: LENOVO 20FB003PMS/20FB003PMS, BIOS N1FET37W (1.11 ) 03/15/2016
[ 77.422839] Workqueue: events_unbound async_run_entry_fn
[ 77.422844] 0000000000000086 000000003e1632fd ffff880089e47a58 ffffffff813eee03
[ 77.422848] ffff880089e47aa8 0000000000000000 ffff880089e47a98 ffffffff810827cb
[ 77.422851] 0000028989e47a78 0000000000000000 0000000000000001 0000000000065f10
[ 77.422852] Call Trace:
[ 77.422862] [<ffffffff813eee03>] dump_stack+0x63/0x90
[ 77.422867] [<ffffffff810827cb>] __warn+0xcb/0xf0
[ 77.422871] [<ffffffff8108284f>] warn_slowpath_fmt+0x5f/0x80
[ 77.422915] [<ffffffffc01e6db0>] __unclaimed_reg_debug+0x80/0x90 [i915]
[ 77.422953] [<ffffffffc01ec85b>] gen9_read32+0x35b/0x390 [i915]
[ 77.422996] [<ffffffffc01f2dd9>] i915_audio_component_codec_wake_override+0x39/0xb0 [i915]
[ 77.423008] [<ffffffffc061aa8f>] snd_hdac_set_codec_wakeup+0x3f/0xa0 [snd_hda_core]
[ 77.423014] [<ffffffffc037b0b4>] azx_runtime_resume+0x184/0x1a0 [snd_hda_intel]
[ 77.423019] [<ffffffff8144175b>] pci_pm_runtime_resume+0x7b/0xa0
[ 77.423025] [<ffffffff81562303>] __rpm_callback+0x33/0x70
[ 77.423029] [<ffffffff814416e0>] ? pci_restore_standard_config+0x40/0x40
[ 77.423033] [<ffffffff81562364>] rpm_callback+0x24/0x80
[ 77.423037] [<ffffffff814416e0>] ? pci_restore...

Read more...

tags: added: kernel-fixed-upstream
Changed in linux (Ubuntu):
status: Incomplete → Confirmed
Revision history for this message
Tomislav (hefest) wrote :

I also encounter random failures resuming from suspend on 14.04 ("4.4.0-64-generic #85~14.04.1-Ubuntu SMP Mon Feb 20 12:10:54 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux"). Latitude E5570, fresh install (as of a week ago). No blinking caps lock, though.

I just cycled through suspend-resume several times and it worked well, so it's intermittent. I tried to switch to the first virtual terminal (<Ctrl>-<Alt>-F1) and restart (<Ctrl>-<Alt>-<Del>) assuming it was just the screen, but nothing happened, suggesting that it's not screen-related. Haven't checked dmesg yet, though, but I will next time it happens.

I also registered one "Unclaimed register detected after reading register" event in /var/log/kern.log, but I don't believe it's related as my machine froze a number of times this week, yet this event is present in kern.log only once, 5 days ago.

Revision history for this message
Jorge Avarias (javarias) wrote :
Download full text (6.6 KiB)

I am having similar kernel message in a HP XW9400 workstation.

I have no machine freezes and until now I can still use the machine.

[159882.429604] ------------[ cut here ]------------
[159882.429623] WARNING: CPU: 6 PID: 22042 at /build/linux-0XAgc4/linux-4.4.0/mm/rmap.c:231 unlink_anon_vmas+0x1da/0x1f0()
[159882.429625] Modules linked in: veth ip6t_MASQUERADE nf_nat_masquerade_ipv6 ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack ip6table_filter ip6_tables xt_CHECKSUM iptable_mangle xt_comment xt_tcpudp bridge stp llc iptable_filter ip_tables x_tables unix_diag tcp_diag inet_diag nvram video msr binfmt_misc input_leds snd_hda_codec_realtek snd_hda_codec_generic snd_hda_codec_hdmi snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep snd_pcm snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device hp_wmi nvidia_uvm(POE) sparse_keymap snd_timer snd amd64_edac_mod kvm_amd edac_mce_amd edac_core kvm wmi shpchp soundcore 8250_fintek i2c_nforce2 k10temp serio_raw irqbypass mac_hid
[159882.429674] ppdev lp parport autofs4 btrfs xor raid6_pq hid_generic usbhid hid nvidia_drm(POE) nvidia_modeset(POE) nvidia(POE) drm_kms_helper syscopyarea sysfillrect mptsas sysimgblt mptscsih fb_sys_fops mptbase drm psmouse pata_acpi scsi_transport_sas firewire_ohci firewire_core forcedeth crc_itu_t sata_nv pata_amd fjes [last unloaded: cpuid]
[159882.429700] CPU: 6 PID: 22042 Comm: git-svn Tainted: P OE 4.4.0-78-generic #99-Ubuntu
[159882.429703] Hardware name: Hewlett-Packard HP xw9400 Workstation/0A1Ch, BIOS 786D6 v04.02 04/21/2009
[159882.429706] 0000000000000286 00000000acc8e888 ffff88076ca03ba8 ffffffff813f8dd3
[159882.429709] 0000000000000000 ffffffff81cd3a70 ffff88076ca03be0 ffffffff81081302
[159882.429711] ffff88069e8c3300 ffff880042755d70 ffff8800cb21ee78 ffff8800cb21ee88
[159882.429714] Call Trace:
[159882.429723] [<ffffffff813f8dd3>] dump_stack+0x63/0x90
[159882.429728] [<ffffffff81081302>] warn_slowpath_common+0x82/0xc0
[159882.429731] [<ffffffff8108144a>] warn_slowpath_null+0x1a/0x20
[159882.429734] [<ffffffff811cdc7a>] unlink_anon_vmas+0x1da/0x1f0
[159882.429739] [<ffffffff811bf411>] free_pgtables+0xa1/0x120
[159882.429742] [<ffffffff811ca011>] exit_mmap+0xc1/0x170
[159882.429745] [<ffffffff8107e0d7>] mmput+0x57/0x130
[159882.429750] [<ffffffff81216284>] flush_old_exec+0x614/0x8d0
[159882.429755] [<ffffffff8126a969>] load_elf_binary+0x359/0x1170
[159882.429762] [<ffffffffc01ec064>] ? load_misc_binary+0x64/0x3f0 [binfmt_misc]
[159882.429765] [<ffffffff81271417>] ? get_acl+0x17/0x60
[159882.429769] [<ffffffff8121558e>] search_binary_handler+0x9e/0x1d0
[159882.429772] [<ffffffff81216d24>] do_execveat_common.isra.33+0x554/0x740
[159882.429775] [<ffffffff8121716a>] SyS_execve+0x3a/0x50
[159882.429780] [<ffffffff81840cd5>] stub_execve+0x5/0x5
[159882.429783] [<ffffffff81840a32>] ? entry_SYSCALL_64_fastpath+0x16/0x71
[159882.429786] ---[ end trace e9e562cba7ac90a1 ]---

[159882.714222] ------------[ cut here ]------------
[159882.714234] kernel BUG at /build/linux-0XAgc4/linux-4.4.0/mm/rmap.c:4...

Read more...

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.