Random system lockup with " BUG: unable to handle kernel NULL pointer dereference at 0000000000000018", "IP: gen8_ppgtt_alloc_page_directories.isra.38+0x115/0x250 [i915]"

Bug #1693357 reported by Daniel Holbert
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Incomplete
Undecided
Unassigned

Bug Description

When starting a Firefox compilation job earlier today, my system locked up; ctrl-alt-f1 had no effect, etc. (I was able to restart with the REISUB technique.)

I'm on 64-bit Ubuntu Zesty, and I was running a kernel version that's currently in zesty-proposed to have the fix for https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1674838 .

"uname -a" output:
Linux olive 4.10.0-22-generic #24-Ubuntu SMP Mon May 22 17:43:20 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux

Here's the relevant snippet from kern.log when the hang happened:
============
May 24 15:43:01 olive kernel: [17665.725782] BUG: unable to handle kernel NULL pointer dereference at 0000000000000018
May 24 15:43:01 olive kernel: [17665.725848] IP: gen8_ppgtt_alloc_page_directories.isra.38+0x115/0x250 [i915]
May 24 15:43:01 olive kernel: [17665.725879] PGD 0
May 24 15:43:01 olive kernel: [17665.725880]
May 24 15:43:01 olive kernel: [17665.725898] Oops: 0002 [#1] SMP
May 24 15:43:01 olive kernel: [17665.725912] Modules linked in: msr ccm pci_stub vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) rfcomm xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 bridge stp llc ebtable_filter ebtables cmac bnep binfmt_misc algif_skcipher af_alg nls_iso8859_1 dm_crypt intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc aesni_intel aes_x86_64 crypto_simd glue_helper cryptd intel_cstate intel_rapl_perf btusb btrtl btbcm btintel bluetooth arc4 uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_core videodev input_leds media joydev intel_pch_thermal serio_raw snd_hda_codec_hdmi snd_hda_codec_realtek iwlmvm snd_hda_codec_generic mac80211 lpc_ich snd_hda_intel thinkpad_acpi
May 24 15:43:01 olive kernel: [17665.726220] snd_hda_codec nvram snd_hda_core snd_hwdep snd_seq_midi snd_seq_midi_event iwlwifi snd_rawmidi snd_pcm rtsx_pci_ms cfg80211 snd_seq memstick snd_seq_device snd_timer snd shpchp mei_me mei soundcore mac_hid ip6t_REJECT nf_reject_ipv6 nf_log_ipv6 xt_hl ip6t_rt nf_conntrack_ipv6 nf_defrag_ipv6 ipt_REJECT nf_reject_ipv4 xt_comment nf_log_ipv4 nf_log_common xt_LOG xt_limit xt_tcpudp xt_addrtype nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack ip6table_filter ip6_tables nf_conntrack_netbios_ns nf_conntrack_broadcast nf_nat_ftp nf_nat libcrc32c nf_conntrack_ftp nf_conntrack iptable_filter parport_pc ppdev lp parport ip_tables x_tables autofs4 hid_generic usbhid hid rtsx_pci_sdmmc i915 nouveau mxm_wmi psmouse i2c_algo_bit ttm rtsx_pci e1000e drm_kms_helper ahci libahci syscopyarea sysfillrect ptp
May 24 15:43:01 olive kernel: [17665.726530] sysimgblt pps_core fb_sys_fops drm wmi fjes video
May 24 15:43:01 olive kernel: [17665.726562] CPU: 1 PID: 1794 Comm: Xorg Tainted: G OE 4.10.0-22-generic #24-Ubuntu
May 24 15:43:01 olive kernel: [17665.726599] Hardware name: LENOVO 20E2CTO1WW/20E2CTO1WW, BIOS N11ET41W (1.17 ) 03/01/2017
May 24 15:43:01 olive kernel: [17665.726630] task: ffff8fdfc78d9680 task.stack: ffffb9324304c000
May 24 15:43:01 olive kernel: [17665.726677] RIP: 0010:gen8_ppgtt_alloc_page_directories.isra.38+0x115/0x250 [i915]
May 24 15:43:01 olive kernel: [17665.726709] RSP: 0018:ffffb9324304f898 EFLAGS: 00010246
May 24 15:43:01 olive kernel: [17665.726735] RAX: ffff8fdfc1cf4cc0 RBX: 0000000000000003 RCX: 0000000000000003
May 24 15:43:01 olive kernel: [17665.726767] RDX: 0000000000000000 RSI: ffff8fde2c46c000 RDI: ffff8fdfc1d50000
May 24 15:43:01 olive kernel: [17665.726799] RBP: ffffb9324304f8f0 R08: 0000000000000000 R09: 0000000000000000
May 24 15:43:01 olive kernel: [17665.726830] R10: 0000000000000000 R11: 0000000000000001 R12: ffff8fdc8a002000
May 24 15:43:01 olive kernel: [17665.726862] R13: ffff8fdfca605630 R14: 00000000ff14e000 R15: 0000000000010000
May 24 15:43:01 olive kernel: [17665.726893] FS: 00007f8241ad1a40(0000) GS:ffff8fdfddc40000(0000) knlGS:0000000000000000
May 24 15:43:01 olive kernel: [17665.726928] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
May 24 15:43:01 olive kernel: [17665.726953] CR2: 0000000000000018 CR3: 000000044a7eb000 CR4: 00000000003406e0
May 24 15:43:01 olive kernel: [17665.726982] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
May 24 15:43:01 olive kernel: [17665.727015] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
May 24 15:43:01 olive kernel: [17665.727044] Call Trace:
May 24 15:43:01 olive kernel: [17665.727079] gen8_alloc_va_range_3lvl+0xfb/0x9e0 [i915]
May 24 15:43:01 olive kernel: [17665.727106] ? swiotlb_map_sg_attrs+0x49/0x110
May 24 15:43:01 olive kernel: [17665.727145] gen8_alloc_va_range+0x23d/0x470 [i915]
May 24 15:43:01 olive kernel: [17665.727187] i915_vma_bind+0x7e/0x170 [i915]
May 24 15:43:01 olive kernel: [17665.727225] __i915_vma_do_pin+0x2a5/0x450 [i915]
May 24 15:43:01 olive kernel: [17665.727265] i915_gem_execbuffer_reserve_vma.isra.31+0x144/0x1b0 [i915]
May 24 15:43:01 olive kernel: [17665.727310] i915_gem_execbuffer_reserve.isra.32+0x39e/0x3d0 [i915]
May 24 15:43:01 olive kernel: [17665.727355] i915_gem_do_execbuffer.isra.38+0x4ca/0x15c0 [i915]
May 24 15:43:01 olive kernel: [17665.727383] ? radix_tree_lookup_slot+0x22/0x50
May 24 15:43:01 olive kernel: [17665.727405] ? find_get_entry+0x20/0x100
May 24 15:43:01 olive kernel: [17665.727441] i915_gem_execbuffer2+0xa1/0x1e0 [i915]
May 24 15:43:01 olive kernel: [17665.727474] drm_ioctl+0x21b/0x4c0 [drm]
May 24 15:43:01 olive kernel: [17665.727510] ? i915_gem_execbuffer+0x310/0x310 [i915]
May 24 15:43:01 olive kernel: [17665.727536] ? __switch_to_xtra+0x13f/0x150
May 24 15:43:01 olive kernel: [17665.727558] do_vfs_ioctl+0xa3/0x610
May 24 15:43:01 olive kernel: [17665.727577] ? __do_page_fault+0x266/0x4e0
May 24 15:43:01 olive kernel: [17665.727598] SyS_ioctl+0x79/0x90
May 24 15:43:01 olive kernel: [17665.727614] entry_SYSCALL_64_fastpath+0x1e/0xad
May 24 15:43:01 olive kernel: [17665.727634] RIP: 0033:0x7f823f4ed987
May 24 15:43:01 olive kernel: [17665.727652] RSP: 002b:00007ffed279f4c8 EFLAGS: 00003246 ORIG_RAX: 0000000000000010
May 24 15:43:01 olive kernel: [17665.727684] RAX: ffffffffffffffda RBX: 0000564a6a723410 RCX: 00007f823f4ed987
May 24 15:43:01 olive kernel: [17665.727716] RDX: 00007ffed279f510 RSI: 00000000c0406469 RDI: 000000000000000e
May 24 15:43:01 olive kernel: [17665.727747] RBP: 00000000000000be R08: 0000000000000000 R09: 0000000000000000
May 24 15:43:01 olive kernel: [17665.727781] R10: 00000000000049a0 R11: 0000000000003246 R12: 0000000000000141
May 24 15:43:01 olive kernel: [17665.727814] R13: 00007ffed279ff78 R14: 0000000000000047 R15: 0000000000000354
May 24 15:43:01 olive kernel: [17665.727844] Code: e6 48 8b 90 20 03 00 00 48 8b b8 d8 02 00 00 48 8b 52 08 48 83 ca 03 e8 aa cd ff ff 48 8b 45 b0 48 8b 4d c8 48 8b 10 48 8b 45 d0 <4c> 89 24 ca 48 0f ab 08 0f 1f 44 00 00 e9 53 ff ff ff 65 8b 05
May 24 15:43:01 olive kernel: [17665.727963] RIP: gen8_ppgtt_alloc_page_directories.isra.38+0x115/0x250 [i915] RSP: ffffb9324304f898
May 24 15:43:01 olive kernel: [17665.728002] CR2: 0000000000000018
May 24 15:43:01 olive kernel: [17665.752162] ---[ end trace d6e694cc9f20f7dc ]---
============
---
ApportVersion: 2.20.4-0ubuntu4.1
Architecture: amd64
AudioDevicesInUse:
 USER PID ACCESS COMMAND
 /dev/snd/controlC1: dholbert 2798 F.... pulseaudio
 /dev/snd/controlC0: dholbert 2798 F.... pulseaudio
CurrentDesktop: GNOME
DistroRelease: Ubuntu 17.04
EcryptfsInUse: Yes
HibernationDevice: RESUME=UUID=f43c97c3-76e5-4eb9-ad7e-b3ce807c19b7
InstallationDate: Installed on 2015-11-07 (564 days ago)
InstallationMedia: Ubuntu 15.10 "Wily Werewolf" - Release amd64 (20151021)
MachineType: LENOVO 20E2CTO1WW
Package: linux (not installed)
ProcFB: 0 inteldrmfb
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-4.10.0-22-generic.efi.signed root=UUID=2a4922db-a7e1-46df-81f6-2fe9d3d8b90b ro quiet splash vt.handoff=7
ProcVersionSignature: Ubuntu 4.10.0-22.24-generic 4.10.15
RelatedPackageVersions:
 linux-restricted-modules-4.10.0-22-generic N/A
 linux-backports-modules-4.10.0-22-generic N/A
 linux-firmware 1.164.1
Tags: zesty
Uname: Linux 4.10.0-22-generic x86_64
UpgradeStatus: Upgraded to zesty on 2017-05-10 (13 days ago)
UserGroups: adm cdrom dip libvirt libvirtd lpadmin plugdev sambashare sudo
_MarkForUpload: True
dmi.bios.date: 03/01/2017
dmi.bios.vendor: LENOVO
dmi.bios.version: N11ET41W (1.17 )
dmi.board.asset.tag: Not Available
dmi.board.name: 20E2CTO1WW
dmi.board.vendor: LENOVO
dmi.board.version: SDK0E50510 WIN
dmi.chassis.asset.tag: No Asset Information
dmi.chassis.type: 10
dmi.chassis.vendor: LENOVO
dmi.chassis.version: None
dmi.modalias: dmi:bvnLENOVO:bvrN11ET41W(1.17):bd03/01/2017:svnLENOVO:pn20E2CTO1WW:pvrThinkPadW550s:rvnLENOVO:rn20E2CTO1WW:rvrSDK0E50510WIN:cvnLENOVO:ct10:cvrNone:
dmi.product.name: 20E2CTO1WW
dmi.product.version: ThinkPad W550s
dmi.sys.vendor: LENOVO

Revision history for this message
Daniel Holbert (dholbert) wrote :
summary: - Random system lockup
+ Random system lockup with " BUG: unable to handle kernel NULL pointer
+ dereference at 0000000000000018", "IP:
+ gen8_ppgtt_alloc_page_directories.isra.38+0x115/0x250 [i915]"
Revision history for this message
Brad Figg (brad-figg) wrote : Missing required logs.

This bug is missing log files that will aid in diagnosing the problem. From a terminal window please run:

apport-collect 1693357

and then change the status of the bug to 'Confirmed'.

If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status: New → Incomplete
Revision history for this message
Daniel Holbert (dholbert) wrote :

(I initially reported this issue via a comment on another bug, being unsure if it was that bug or not:
 https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1674838/comments/208

I'm filing this as its own bug as directed there. Let me know if there's any other information that I can/should capture from my logs that would be useful.)

Revision history for this message
Daniel Holbert (dholbert) wrote : AlsaInfo.txt

apport information

tags: added: apport-collected zesty
description: updated
Revision history for this message
Daniel Holbert (dholbert) wrote : CRDA.txt

apport information

Revision history for this message
Daniel Holbert (dholbert) wrote : CurrentDmesg.txt

apport information

Revision history for this message
Daniel Holbert (dholbert) wrote : IwConfig.txt

apport information

Revision history for this message
Daniel Holbert (dholbert) wrote : JournalErrors.txt

apport information

Revision history for this message
Daniel Holbert (dholbert) wrote : Lspci.txt

apport information

Revision history for this message
Daniel Holbert (dholbert) wrote : Lsusb.txt

apport information

Revision history for this message
Daniel Holbert (dholbert) wrote : ProcCpuinfo.txt

apport information

Revision history for this message
Daniel Holbert (dholbert) wrote : ProcCpuinfoMinimal.txt

apport information

Revision history for this message
Daniel Holbert (dholbert) wrote : ProcEnviron.txt

apport information

Revision history for this message
Daniel Holbert (dholbert) wrote : ProcInterrupts.txt

apport information

Revision history for this message
Daniel Holbert (dholbert) wrote : ProcModules.txt

apport information

Revision history for this message
Daniel Holbert (dholbert) wrote : PulseList.txt

apport information

Revision history for this message
Daniel Holbert (dholbert) wrote : RfKill.txt

apport information

Revision history for this message
Daniel Holbert (dholbert) wrote : UdevDb.txt

apport information

Revision history for this message
Daniel Holbert (dholbert) wrote : WifiSyslog.txt

apport information

Revision history for this message
Daniel Holbert (dholbert) wrote :

Over on the bug where I initially brought this up, Rocko tells me:
===
your lockup looks like https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1680904 (and the upstream bug is https://bugs.freedesktop.org/show_bug.cgi?id=100516). It's a bug in the Intel graphics drivers that unfortunately is present in both kernels 4.10 and 4.11, but should be fixed in 4.12.
===

Indeed, my kern.log looks very similar to the output in the first comment on that bug. So I've marked this as a duplicate.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.