[drm:intel_dp_aux_ch [i915_bpo]] *ERROR* dp aux hw did not signal timeout (has irq: 1)!

Bug #1567423 reported by blabber
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Expired
Medium
Unassigned

Bug Description

Hi,

I'm using a new Skylake Thinkpad T460s with and external monitor connected via Displayport and Ubuntu 16.04 and the latest kernel as of today: Linux test 4.4.0-17-generic #33-Ubuntu SMP Tue Mar 29 17:17:28 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux

X will from time to time lock up when the computer is left idle for a long time and both displays (laptop and external) go to sleep and then woken up.

Apr 6 16:50:29 test kernel: [33353.303816] [drm:intel_dp_aux_ch [i915_bpo]] *ERROR* dp aux hw did not signal timeout (has irq: 1)!
Apr 6 16:50:29 test kernel: [33353.319720] [drm:intel_dp_aux_ch [i915_bpo]] *ERROR* dp aux hw did not signal timeout (has irq: 1)!
Apr 6 16:50:29 test kernel: [33353.335677] [drm:intel_dp_aux_ch [i915_bpo]] *ERROR* dp aux hw did not signal timeout (has irq: 1)!
Apr 6 16:50:29 test kernel: [33353.351668] [drm:intel_dp_aux_ch [i915_bpo]] *ERROR* dp aux hw did not signal timeout (has irq: 1)!
Apr 6 16:50:29 test kernel: [33353.367677] [drm:intel_dp_aux_ch [i915_bpo]] *ERROR* dp aux hw did not signal timeout (has irq: 1)!
Apr 6 16:50:29 test kernel: [33353.367692] [drm:intel_dp_aux_ch [i915_bpo]] *ERROR* dp_aux_ch not done status 0xad40001f
Apr 6 16:50:29 test kernel: [33353.391660] ------------[ cut here ]------------
Apr 6 16:50:29 test kernel: [33353.391686] WARNING: CPU: 3 PID: 1153 at /build/linux-bi18CQ/linux-4.4.0/ubuntu/i915/intel_dp.c:833 intel_dp_aux_ch+0xff/0x6b0 [i915_bpo]()
Apr 6 16:50:29 test kernel: [33353.391687] dp_aux_ch not started status 0xad40001f
Apr 6 16:50:29 test kernel: [33353.391688] Modules linked in: btrfs xor raid6_pq ufs qnx4 hfsplus hfs minix ntfs msdos jfs xfs libcrc32c msr acpi_call(OE) pci_stub vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) bnep uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_core v4l2_common videodev media snd_usb_audio snd_usbmidi_lib btusb btrtl btbcm btintel bluetooth binfmt_misc nls_iso8859_1 arc4 iwlmvm snd_hda_codec_hdmi mac80211 snd_hda_codec_realtek intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp snd_hda_codec_generic snd_soc_skl snd_soc_skl_ipc iwlwifi snd_hda_ext_core snd_soc_sst_ipc snd_soc_sst_dsp snd_soc_core snd_compress ac97_bus snd_pcm_dmaengine dw_dmac_core snd_hda_intel snd_hda_codec snd_hda_core joydev input_leds snd_hwdep serio_raw snd_pcm cfg80211 thinkpad_acpi snd_seq_midi snd_seq_midi_event nvram snd_rawmidi rtsx_pci_ms snd_seq memstick mei_me snd_seq_device mei shpchp snd_timer snd soundcore tpm_crb mac_hid kvm_intel kvm irqbypass parport_pc ppdev lp parport autofs4 drbg ansi_cprng algif_skcipher af_alg dm_crypt hid_microsoft hid_generic usbhid hid rtsx_pci_sdmmc crct10dif_pclmul crc32_pclmul i915_bpo aesni_intel aes_x86_64 lrw gf128mul glue_helper ablk_helper cryptd intel_ips i2c_algo_bit drm_kms_helper syscopyarea sysfillrect psmouse e1000e sysimgblt fb_sys_fops ptp pps_core drm nvme rtsx_pci wmi video fjes
Apr 6 16:50:29 test kernel: [33353.391745] CPU: 3 PID: 1153 Comm: Xorg Tainted: G W OE 4.4.0-16-generic #32-Ubuntu
Apr 6 16:50:29 test kernel: [33353.391746] Hardware name: LENOVO 20F9CTO1WW/20F9CTO1WW, BIOS N1CET40W (1.08 ) 03/09/2016
Apr 6 16:50:29 test kernel: [33353.391747] 0000000000000286 0000000077d2ff7a ffff8800350338d0 ffffffff813e9143
Apr 6 16:50:29 test kernel: [33353.391749] ffff880035033918 ffffffffc0376700 ffff880035033908 ffffffff8107ff22
Apr 6 16:50:29 test kernel: [33353.391751] 00000000ad40001f ffff880501d50000 00000000fffffff0 0000000000000005
Apr 6 16:50:29 test kernel: [33353.391752] Call Trace:
Apr 6 16:50:29 test kernel: [33353.391756] [<ffffffff813e9143>] dump_stack+0x63/0x90
Apr 6 16:50:29 test kernel: [33353.391759] [<ffffffff8107ff22>] warn_slowpath_common+0x82/0xc0
Apr 6 16:50:29 test kernel: [33353.391761] [<ffffffff8107ffbc>] warn_slowpath_fmt+0x5c/0x80
Apr 6 16:50:29 test kernel: [33353.391778] [<ffffffffc032890f>] intel_dp_aux_ch+0xff/0x6b0 [i915_bpo]
Apr 6 16:50:29 test kernel: [33353.391780] [<ffffffff810c2af0>] ? wake_atomic_t_function+0x60/0x60
Apr 6 16:50:29 test kernel: [33353.391795] [<ffffffffc0328fff>] intel_dp_aux_transfer+0x13f/0x220 [i915_bpo]
Apr 6 16:50:29 test kernel: [33353.391809] [<ffffffffc02e2728>] ? gen9_read32+0x128/0x390 [i915_bpo]
Apr 6 16:50:29 test kernel: [33353.391813] [<ffffffffc0171cc4>] drm_dp_dpcd_access+0x64/0x110 [drm_kms_helper]
Apr 6 16:50:29 test kernel: [33353.391817] [<ffffffffc0171e8b>] drm_dp_dpcd_write+0x1b/0x20 [drm_kms_helper]
Apr 6 16:50:29 test kernel: [33353.391831] [<ffffffffc032aedd>] intel_dp_sink_dpms+0x4d/0xe0 [i915_bpo]
Apr 6 16:50:29 test kernel: [33353.391844] [<ffffffffc0322b10>] intel_ddi_pre_enable+0x110/0x150 [i915_bpo]
Apr 6 16:50:29 test kernel: [33353.391858] [<ffffffffc03061d1>] haswell_crtc_enable+0x2b1/0x8c0 [i915_bpo]
Apr 6 16:50:29 test kernel: [33353.391869] [<ffffffffc029f6c8>] ? intel_power_well_enable+0x28/0x50 [i915_bpo]
Apr 6 16:50:29 test kernel: [33353.391883] [<ffffffffc0302f14>] intel_atomic_commit+0x714/0xab0 [i915_bpo]
Apr 6 16:50:29 test kernel: [33353.391896] [<ffffffffc007891e>] ? drm_atomic_check_only+0x18e/0x590 [drm]
Apr 6 16:50:29 test kernel: [33353.391906] [<ffffffffc0078d57>] drm_atomic_commit+0x37/0x60 [drm]
Apr 6 16:50:29 test kernel: [33353.391910] [<ffffffffc017c846>] drm_atomic_helper_set_config+0x76/0xb0 [drm_kms_helper]
Apr 6 16:50:29 test kernel: [33353.391919] [<ffffffffc0067eb2>] drm_mode_set_config_internal+0x62/0x100 [drm]
Apr 6 16:50:29 test kernel: [33353.391927] [<ffffffffc006c3d2>] drm_mode_setcrtc+0x3d2/0x4f0 [drm]
Apr 6 16:50:29 test kernel: [33353.391934] [<ffffffffc005d752>] drm_ioctl+0x152/0x540 [drm]
Apr 6 16:50:29 test kernel: [33353.391942] [<ffffffffc006c000>] ? drm_mode_setplane+0x1b0/0x1b0 [drm]
Apr 6 16:50:29 test kernel: [33353.391945] [<ffffffff8121f6fc>] do_vfs_ioctl+0x2ac/0x4a0
Apr 6 16:50:29 test kernel: [33353.391947] [<ffffffff8108d071>] ? __set_task_blocked+0x41/0xa0
Apr 6 16:50:29 test kernel: [33353.391948] [<ffffffff8108fa16>] ? __set_current_blocked+0x36/0x60
Apr 6 16:50:29 test kernel: [33353.391950] [<ffffffff8121f969>] SyS_ioctl+0x79/0x90
Apr 6 16:50:29 test kernel: [33353.391952] [<ffffffff81824ef2>] entry_SYSCALL_64_fastpath+0x16/0x71
Apr 6 16:50:29 test kernel: [33353.391954] ---[ end trace fd3ca8d6331f649f ]---
Apr 6 16:50:34 test kernel: [33357.975879] [drm:intel_dp_link_training_clock_recovery [i915_bpo]] *ERROR* failed to enable link training
Apr 6 16:50:35 test kernel: [33358.743909] [drm:intel_dp_start_link_train [i915_bpo]] *ERROR* failed to start channel equalization
Apr 6 16:50:35 test kernel: [33359.597556] ------------[ cut here ]------------

Attached is the full kernel log from boot until the issue happens, at which point, if I have a lockup of X I need to reboot to get back to normal.

I don't know if this particular bug I'm reporting is the one responsible for the lockups, as I've isolated 4 different warning/error messages which separately or in combination may be the cause. I'm reporting all of them now in separate bug reports.

Please let me know if any information is missing.

# lspci
00:00.0 Host bridge: Intel Corporation Sky Lake Host Bridge/DRAM Registers (rev 08)
00:02.0 VGA compatible controller: Intel Corporation Sky Lake Integrated Graphics (rev 07)
00:08.0 System peripheral: Intel Corporation Sky Lake Gaussian Mixture Model
00:14.0 USB controller: Intel Corporation Sunrise Point-LP USB 3.0 xHCI Controller (rev 21)
00:14.2 Signal processing controller: Intel Corporation Sunrise Point-LP Thermal subsystem (rev 21)
00:16.0 Communication controller: Intel Corporation Sunrise Point-LP CSME HECI (rev 21)
00:1c.0 PCI bridge: Intel Corporation Device 9d10 (rev f1)
00:1c.2 PCI bridge: Intel Corporation Device 9d12 (rev f1)
00:1c.4 PCI bridge: Intel Corporation Sunrise Point-LP PCI Express Root Port (rev f1)
00:1f.0 ISA bridge: Intel Corporation Sunrise Point-LP LPC Controller (rev 21)
00:1f.2 Memory controller: Intel Corporation Sunrise Point-LP PMC (rev 21)
00:1f.3 Audio device: Intel Corporation Sunrise Point-LP HD Audio (rev 21)
00:1f.4 SMBus: Intel Corporation Sunrise Point-LP SMBus (rev 21)
00:1f.6 Ethernet controller: Intel Corporation Ethernet Connection I219-LM (rev 21)
02:00.0 Unassigned class [ff00]: Realtek Semiconductor Co., Ltd. RTS522A PCI Express Card Reader (rev 01)
04:00.0 Network controller: Intel Corporation Wireless 8260 (rev 3a)
05:00.0 Non-Volatile memory controller: Samsung Electronics Co Ltd NVMe SSD Controller (rev 01)

# lspci -vvv
00:00.0 Host bridge: Intel Corporation Sky Lake Host Bridge/DRAM Registers (rev 08)
        Subsystem: Lenovo Skylake Host Bridge/DRAM Registers
        Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
        Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort+ >SERR- <PERR- INTx-
        Latency: 0
        Capabilities: [e0] Vendor Specific Information: Len=10 <?>

00:02.0 VGA compatible controller: Intel Corporation Sky Lake Integrated Graphics (rev 07) (prog-if 00 [VGA controller])
        Subsystem: Lenovo Skylake Integrated Graphics
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 0
        Interrupt: pin A routed to IRQ 133
        Region 0: Memory at f0000000 (64-bit, non-prefetchable) [size=16M]
        Region 2: Memory at e0000000 (64-bit, prefetchable) [size=256M]
        Region 4: I/O ports at e000 [size=64]
        Expansion ROM at <unassigned> [disabled]
        Capabilities: [40] Vendor Specific Information: Len=0c <?>
        Capabilities: [70] Express (v2) Root Complex Integrated Endpoint, MSI 00
                DevCap: MaxPayload 128 bytes, PhantFunc 0
                        ExtTag- RBE+
                DevCtl: Report errors: Correctable- Non-Fatal- Fatal- Unsupported-
                        RlxdOrd- ExtTag- PhantFunc- AuxPwr- NoSnoop-
                        MaxPayload 128 bytes, MaxReadReq 128 bytes
                DevSta: CorrErr- UncorrErr- FatalErr- UnsuppReq- AuxPwr- TransPend-
                DevCap2: Completion Timeout: Not Supported, TimeoutDis-, LTR-, OBFF Not Supported
                DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis-, LTR-, OBFF Disabled
        Capabilities: [ac] MSI: Enable+ Count=1/1 Maskable- 64bit-
                Address: fee00018 Data: 0000
        Capabilities: [d0] Power Management version 2
                Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
                Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME-
        Capabilities: [100 v1] #1b
        Capabilities: [200 v1] Address Translation Service (ATS)
                ATSCap: Invalidate Queue Depth: 00
                ATSCtl: Enable-, Smallest Translation Unit: 00
        Capabilities: [300 v1] #13
        Kernel driver in use: i915_bpo
        Kernel modules: i915_bpo

# for file in /sys/module/i915_bpo/parameters/*; do echo -en $file"\t"; cat $file; done;
/sys/module/i915_bpo/parameters/disable_display N
/sys/module/i915_bpo/parameters/disable_power_well 1
/sys/module/i915_bpo/parameters/edp_vswing 0
/sys/module/i915_bpo/parameters/enable_cmd_parser 1
/sys/module/i915_bpo/parameters/enable_dc -1
/sys/module/i915_bpo/parameters/enable_execlists 1
/sys/module/i915_bpo/parameters/enable_fbc -1
/sys/module/i915_bpo/parameters/enable_guc_submission N
/sys/module/i915_bpo/parameters/enable_hangcheck Y
/sys/module/i915_bpo/parameters/enable_ips 1
/sys/module/i915_bpo/parameters/enable_ppgtt 3
/sys/module/i915_bpo/parameters/enable_psr 0
/sys/module/i915_bpo/parameters/enable_rc6 1
/sys/module/i915_bpo/parameters/fastboot N
/sys/module/i915_bpo/parameters/guc_log_level -1
/sys/module/i915_bpo/parameters/invert_brightness 0
/sys/module/i915_bpo/parameters/load_detect_test N
/sys/module/i915_bpo/parameters/lvds_channel_mode 0
/sys/module/i915_bpo/parameters/lvds_use_ssc -1
/sys/module/i915_bpo/parameters/mmio_debug 0
/sys/module/i915_bpo/parameters/modeset -1
/sys/module/i915_bpo/parameters/nuclear_pageflip N
/sys/module/i915_bpo/parameters/panel_ignore_lid 1
/sys/module/i915_bpo/parameters/prefault_disable N
/sys/module/i915_bpo/parameters/preliminary_hw_support 0
/sys/module/i915_bpo/parameters/reset Y
/sys/module/i915_bpo/parameters/semaphores -1
/sys/module/i915_bpo/parameters/use_mmio_flip 0
/sys/module/i915_bpo/parameters/vbt_sdvo_panel_type -1
/sys/module/i915_bpo/parameters/verbose_state_checks Y

Revision history for this message
blabber (blabber-m) wrote :
Revision history for this message
Brad Figg (brad-figg) wrote : Missing required logs.

This bug is missing log files that will aid in diagnosing the problem. From a terminal window please run:

apport-collect 1567423

and then change the status of the bug to 'Confirmed'.

If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status: New → Incomplete
Revision history for this message
blabber (blabber-m) wrote :

I cannot run apport-collect

Changed in linux (Ubuntu):
status: Incomplete → Confirmed
Revision history for this message
Joseph Salisbury (jsalisbury) wrote :

Would it be possible for you to test the latest upstream kernel? Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Please test the latest v4.6 kernel[0].

If this bug is fixed in the mainline kernel, please add the following tag 'kernel-fixed-upstream'.

If the mainline kernel does not fix this bug, please add the tag: 'kernel-bug-exists-upstream'.

Once testing of the upstream kernel is complete, please mark this bug as "Confirmed".

Thanks in advance.

[0] http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.6-rc4-wily/

Changed in linux (Ubuntu):
importance: Undecided → Medium
status: Confirmed → Incomplete
Revision history for this message
penalvch (penalvch) wrote :

blabber, in addition to https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1567423/comments/4 , could you please execute the following command only once, as it will automatically gather debugging information, in a terminal:
apport-collect 1567423

Revision history for this message
Launchpad Janitor (janitor) wrote :

[Expired for linux (Ubuntu) because there has been no activity for 60 days.]

Changed in linux (Ubuntu):
status: Incomplete → Expired
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Bug attachments

Remote bug watches

Bug watches keep track of this bug in other bug trackers.