*ERROR* Timed out waiting for DP idle patterns / drm_wait_one_vblank+0x1b5/0x1c0 [drm]

Bug #1567425 reported by blabber
14
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Expired
Medium
Unassigned

Bug Description

Hi,

I'm using a new Skylake Thinkpad T460s with and external monitor connected via Displayport and Ubuntu 16.04 and the latest kernel as of today: Linux test 4.4.0-17-generic #33-Ubuntu SMP Tue Mar 29 17:17:28 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux

X will from time to time lock up when the computer is left idle for a long time and both displays (laptop and external) go to sleep and then woken up.
pr 6 07:33:24 test kernel: [39056.603685] [drm:intel_dp_set_idle_link_train [i915_bpo]] *ERROR* Timed out waiting for DP idle patterns
Apr 6 07:33:24 test kernel: [39056.798309] [drm:intel_mst_enable_dp [i915_bpo]] *ERROR* Timed out waiting for ACT sent
Apr 6 07:33:24 test kernel: [39056.853808] snd_hda_codec_hdmi hdaudioC0D2: HDMI: ELD buf size is 0, force 128
Apr 6 07:33:24 test kernel: [39056.853928] snd_hda_codec_hdmi hdaudioC0D2: HDMI: invalid ELD data byte 0
Apr 6 07:33:24 test kernel: [39056.853994] snd_hda_codec_hdmi hdaudioC0D2: HDMI: ELD buf size is 0, force 128
Apr 6 07:33:24 test kernel: [39056.854113] snd_hda_codec_hdmi hdaudioC0D2: HDMI: invalid ELD data byte 0
Apr 6 07:33:24 test kernel: [39056.854452] snd_hda_codec_hdmi hdaudioC0D2: HDMI: ELD buf size is 0, force 128
Apr 6 07:33:24 test kernel: [39056.854511] snd_hda_codec_hdmi hdaudioC0D2: HDMI: invalid ELD data byte 0
Apr 6 07:33:24 test kernel: [39056.854575] snd_hda_codec_hdmi hdaudioC0D2: HDMI: ELD buf size is 0, force 128
Apr 6 07:33:24 test kernel: [39056.854636] snd_hda_codec_hdmi hdaudioC0D2: HDMI: invalid ELD data byte 0
Apr 6 07:33:24 test kernel: [39056.953538] ------------[ cut here ]------------
Apr 6 07:33:24 test kernel: [39056.953598] WARNING: CPU: 1 PID: 1190 at /build/linux-bi18CQ/linux-4.4.0/drivers/gpu/drm/drm_irq.c:1326 drm_wait_one_vblank+0x1b5/0x1c0 [drm]()
Apr 6 07:33:24 test kernel: [39056.953604] vblank wait timed out on crtc 1
Apr 6 07:33:24 test kernel: [39056.953608] Modules linked in: pci_stub vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 snd_usb_audio videobuf2_core v4l2_common snd_usbmidi_lib videodev media msr acpi_call(OE) bnep btusb btrtl btbcm btintel binfmt_misc bluetooth nls_iso8859_1 arc4 snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic snd_soc_skl snd_soc_skl_ipc snd_hda_ext_core snd_soc_sst_ipc snd_soc_sst_dsp snd_soc_core snd_compress ac97_bus snd_pcm_dmaengine dw_dmac_core iwlmvm snd_hda_intel mac80211 snd_hda_codec intel_rapl iwlwifi x86_pkg_temp_thermal intel_powerclamp coretemp snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_hda_core snd_hwdep joydev snd_seq_device snd_pcm input_leds cfg80211 serio_raw thinkpad_acpi rtsx_pci_ms nvram memstick snd_timer mei_me mei shpchp snd soundcore tpm_crb mac_hid kvm_intel kvm irqbypass parport_pc ppdev lp parport autofs4 drbg ansi_cprng algif_skcipher af_alg dm_crypt hid_microsoft hid_generic usbhid hid rtsx_pci_sdmmc crct10dif_pclmul i915_bpo crc32_pclmul aesni_intel aes_x86_64 lrw gf128mul glue_helper intel_ips ablk_helper i2c_algo_bit cryptd drm_kms_helper nvme syscopyarea e1000e psmouse sysfillrect sysimgblt fb_sys_fops ptp drm pps_core rtsx_pci wmi video fjes
Apr 6 07:33:24 test kernel: [39056.953824] CPU: 1 PID: 1190 Comm: Xorg Tainted: G W OE 4.4.0-16-generic #32-Ubuntu
Apr 6 07:33:24 test kernel: [39056.953830] Hardware name: LENOVO 20F9CTO1WW/20F9CTO1WW, BIOS N1CET40W (1.08 ) 03/09/2016
Apr 6 07:33:24 test kernel: [39056.953835] 0000000000000286 00000000e7611a0a ffff880508c6fad8 ffffffff813e9143
Apr 6 07:33:24 test kernel: [39056.953846] ffff880508c6fb20 ffffffffc0085b10 ffff880508c6fb10 ffffffff8107ff22
Apr 6 07:33:24 test kernel: [39056.953854] ffff880502917800 0000000000000001 0000000000000000 000000000007939d
Apr 6 07:33:24 test kernel: [39056.953863] Call Trace:
Apr 6 07:33:24 test kernel: [39056.953880] [<ffffffff813e9143>] dump_stack+0x63/0x90
Apr 6 07:33:24 test kernel: [39056.953892] [<ffffffff8107ff22>] warn_slowpath_common+0x82/0xc0
Apr 6 07:33:24 test kernel: [39056.953901] [<ffffffff8107ffbc>] warn_slowpath_fmt+0x5c/0x80
Apr 6 07:33:24 test kernel: [39056.953913] [<ffffffff810c2655>] ? finish_wait+0x55/0x70
Apr 6 07:33:24 test kernel: [39056.953954] [<ffffffffc00562f5>] drm_wait_one_vblank+0x1b5/0x1c0 [drm]
Apr 6 07:33:24 test kernel: [39056.953964] [<ffffffff810c2af0>] ? wake_atomic_t_function+0x60/0x60
Apr 6 07:33:24 test kernel: [39056.954068] [<ffffffffc02edeb6>] intel_atomic_commit+0x6b6/0xab0 [i915_bpo]
Apr 6 07:33:24 test kernel: [39056.954131] [<ffffffffc006f91e>] ? drm_atomic_check_only+0x18e/0x590 [drm]
Apr 6 07:33:24 test kernel: [39056.954186] [<ffffffffc006fd57>] drm_atomic_commit+0x37/0x60 [drm]
Apr 6 07:33:24 test kernel: [39056.954216] [<ffffffffc017d846>] drm_atomic_helper_set_config+0x76/0xb0 [drm_kms_helper]
Apr 6 07:33:24 test kernel: [39056.954266] [<ffffffffc005eeb2>] drm_mode_set_config_internal+0x62/0x100 [drm]
Apr 6 07:33:24 test kernel: [39056.954318] [<ffffffffc00633d2>] drm_mode_setcrtc+0x3d2/0x4f0 [drm]
Apr 6 07:33:24 test kernel: [39056.954355] [<ffffffffc0054752>] drm_ioctl+0x152/0x540 [drm]
Apr 6 07:33:24 test kernel: [39056.954403] [<ffffffffc0063000>] ? drm_mode_setplane+0x1b0/0x1b0 [drm]
Apr 6 07:33:24 test kernel: [39056.954416] [<ffffffff8121f6fc>] do_vfs_ioctl+0x2ac/0x4a0
Apr 6 07:33:24 test kernel: [39056.954426] [<ffffffff8108d071>] ? __set_task_blocked+0x41/0xa0
Apr 6 07:33:24 test kernel: [39056.954436] [<ffffffff8108fa16>] ? __set_current_blocked+0x36/0x60
Apr 6 07:33:24 test kernel: [39056.954443] [<ffffffff8121f969>] SyS_ioctl+0x79/0x90
Apr 6 07:33:24 test kernel: [39056.954452] [<ffffffff8108fcce>] ? SyS_rt_sigprocmask+0x8e/0xc0
Apr 6 07:33:24 test kernel: [39056.954464] [<ffffffff81824ef2>] entry_SYSCALL_64_fastpath+0x16/0x71
Apr 6 07:33:24 test kernel: [39056.954472] ---[ end trace 9dcf595db8d43c98 ]---

Attached is the full kernel log from boot until the issue happens, at which point, if I have a lockup of X I need to reboot to get back to normal.

I don't know if this particular bug I'm reporting is the one responsible for the lockups, as I've isolated 4 different warning/error messages which separately or in combination may be the cause. I'm reporting all of them now in separate bug reports.

Please let me know if any information is missing.

# lspci
00:00.0 Host bridge: Intel Corporation Sky Lake Host Bridge/DRAM Registers (rev 08)
00:02.0 VGA compatible controller: Intel Corporation Sky Lake Integrated Graphics (rev 07)
00:08.0 System peripheral: Intel Corporation Sky Lake Gaussian Mixture Model
00:14.0 USB controller: Intel Corporation Sunrise Point-LP USB 3.0 xHCI Controller (rev 21)
00:14.2 Signal processing controller: Intel Corporation Sunrise Point-LP Thermal subsystem (rev 21)
00:16.0 Communication controller: Intel Corporation Sunrise Point-LP CSME HECI (rev 21)
00:1c.0 PCI bridge: Intel Corporation Device 9d10 (rev f1)
00:1c.2 PCI bridge: Intel Corporation Device 9d12 (rev f1)
00:1c.4 PCI bridge: Intel Corporation Sunrise Point-LP PCI Express Root Port (rev f1)
00:1f.0 ISA bridge: Intel Corporation Sunrise Point-LP LPC Controller (rev 21)
00:1f.2 Memory controller: Intel Corporation Sunrise Point-LP PMC (rev 21)
00:1f.3 Audio device: Intel Corporation Sunrise Point-LP HD Audio (rev 21)
00:1f.4 SMBus: Intel Corporation Sunrise Point-LP SMBus (rev 21)
00:1f.6 Ethernet controller: Intel Corporation Ethernet Connection I219-LM (rev 21)
02:00.0 Unassigned class [ff00]: Realtek Semiconductor Co., Ltd. RTS522A PCI Express Card Reader (rev 01)
04:00.0 Network controller: Intel Corporation Wireless 8260 (rev 3a)
05:00.0 Non-Volatile memory controller: Samsung Electronics Co Ltd NVMe SSD Controller (rev 01)

# lspci -vvv
00:00.0 Host bridge: Intel Corporation Sky Lake Host Bridge/DRAM Registers (rev 08)
        Subsystem: Lenovo Skylake Host Bridge/DRAM Registers
        Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
        Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort+ >SERR- <PERR- INTx-
        Latency: 0
        Capabilities: [e0] Vendor Specific Information: Len=10 <?>

00:02.0 VGA compatible controller: Intel Corporation Sky Lake Integrated Graphics (rev 07) (prog-if 00 [VGA controller])
        Subsystem: Lenovo Skylake Integrated Graphics
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 0
        Interrupt: pin A routed to IRQ 133
        Region 0: Memory at f0000000 (64-bit, non-prefetchable) [size=16M]
        Region 2: Memory at e0000000 (64-bit, prefetchable) [size=256M]
        Region 4: I/O ports at e000 [size=64]
        Expansion ROM at <unassigned> [disabled]
        Capabilities: [40] Vendor Specific Information: Len=0c <?>
        Capabilities: [70] Express (v2) Root Complex Integrated Endpoint, MSI 00
                DevCap: MaxPayload 128 bytes, PhantFunc 0
                        ExtTag- RBE+
                DevCtl: Report errors: Correctable- Non-Fatal- Fatal- Unsupported-
                        RlxdOrd- ExtTag- PhantFunc- AuxPwr- NoSnoop-
                        MaxPayload 128 bytes, MaxReadReq 128 bytes
                DevSta: CorrErr- UncorrErr- FatalErr- UnsuppReq- AuxPwr- TransPend-
                DevCap2: Completion Timeout: Not Supported, TimeoutDis-, LTR-, OBFF Not Supported
                DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-, LTR-, OBFF Disabled
        Capabilities: [ac] MSI: Enable+ Count=1/1 Maskable- 64bit-
                Address: fee00018 Data: 0000
        Capabilities: [d0] Power Management version 2
                Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
                Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME-
        Capabilities: [100 v1] #1b
        Capabilities: [200 v1] Address Translation Service (ATS)
                ATSCap: Invalidate Queue Depth: 00
                ATSCtl: Enable-, Smallest Translation Unit: 00
        Capabilities: [300 v1] #13
        Kernel driver in use: i915_bpo
        Kernel modules: i915_bpo

# for file in /sys/module/i915_bpo/parameters/*; do echo -en $file"\t"; cat $file; done;
/sys/module/i915_bpo/parameters/disable_display N
/sys/module/i915_bpo/parameters/disable_power_well 1
/sys/module/i915_bpo/parameters/edp_vswing 0
/sys/module/i915_bpo/parameters/enable_cmd_parser 1
/sys/module/i915_bpo/parameters/enable_dc -1
/sys/module/i915_bpo/parameters/enable_execlists 1
/sys/module/i915_bpo/parameters/enable_fbc -1
/sys/module/i915_bpo/parameters/enable_guc_submission N
/sys/module/i915_bpo/parameters/enable_hangcheck Y
/sys/module/i915_bpo/parameters/enable_ips 1
/sys/module/i915_bpo/parameters/enable_ppgtt 3
/sys/module/i915_bpo/parameters/enable_psr 0
/sys/module/i915_bpo/parameters/enable_rc6 1
/sys/module/i915_bpo/parameters/fastboot N
/sys/module/i915_bpo/parameters/guc_log_level -1
/sys/module/i915_bpo/parameters/invert_brightness 0
/sys/module/i915_bpo/parameters/load_detect_test N
/sys/module/i915_bpo/parameters/lvds_channel_mode 0
/sys/module/i915_bpo/parameters/lvds_use_ssc -1
/sys/module/i915_bpo/parameters/mmio_debug 0
/sys/module/i915_bpo/parameters/modeset -1
/sys/module/i915_bpo/parameters/nuclear_pageflip N
/sys/module/i915_bpo/parameters/panel_ignore_lid 1
/sys/module/i915_bpo/parameters/prefault_disable N
/sys/module/i915_bpo/parameters/preliminary_hw_support 0
/sys/module/i915_bpo/parameters/reset Y
/sys/module/i915_bpo/parameters/semaphores -1
/sys/module/i915_bpo/parameters/use_mmio_flip 0
/sys/module/i915_bpo/parameters/vbt_sdvo_panel_type -1
/sys/module/i915_bpo/parameters/verbose_state_checks Y

Revision history for this message
blabber (blabber-m) wrote :
Revision history for this message
Brad Figg (brad-figg) wrote : Missing required logs.

This bug is missing log files that will aid in diagnosing the problem. From a terminal window please run:

apport-collect 1567425

and then change the status of the bug to 'Confirmed'.

If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status: New → Incomplete
Revision history for this message
blabber (blabber-m) wrote :

I cannot run apport-collect

Changed in linux (Ubuntu):
status: Incomplete → Confirmed
Revision history for this message
Joseph Salisbury (jsalisbury) wrote :

Would it be possible for you to test the latest upstream kernel? Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Please test the latest v4.6 kernel[0].

If this bug is fixed in the mainline kernel, please add the following tag 'kernel-fixed-upstream'.

If the mainline kernel does not fix this bug, please add the tag: 'kernel-bug-exists-upstream'.

Once testing of the upstream kernel is complete, please mark this bug as "Confirmed".

Thanks in advance.

[0] http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.6-rc4-wily/

Changed in linux (Ubuntu):
importance: Undecided → Medium
status: Confirmed → Incomplete
Revision history for this message
Marco Trevisan (Treviño) (3v1n0) wrote :

I'm getting about the same in a T460p (still skylake). After applying these patches I don't see the lockup anymore:

https://bugzilla.kernel.org/attachment.cgi?id=209371
http://comments.gmane.org/gmane.comp.freedesktop.xorg.drivers.intel/71377 (both patches attached)
https://patchwork.freedesktop.org/patch/68712/

However, HDMI still doesn't work properly here, and I'm still getting these errors:
snd_hda_codec_hdmi hdaudioC0D2: HDMI: ELD buf size is 0, force 128
snd_hda_codec_hdmi hdaudioC0D2: HDMI: invalid ELD data byte 0

Revision history for this message
Launchpad Janitor (janitor) wrote :

[Expired for linux (Ubuntu) because there has been no activity for 60 days.]

Changed in linux (Ubuntu):
status: Incomplete → Expired
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Duplicates of this bug

Other bug subscribers

Bug attachments

Remote bug watches

Bug watches keep track of this bug in other bug trackers.