6.2 kernel nouveau driver crashes on Dell Precision 3570 laptops when it worked in 5.9 kernels

Bug #2031064 reported by Russell Coker
30
This bug affects 6 people
Affects Status Importance Assigned to Milestone
linux-hwe-6.2 (Ubuntu)
Confirmed
Undecided
Unassigned

Bug Description

So far 2 of the Precision 3570 laptops running Ubuntu 22.04 on my network have had problems with the 6.2 kernel update. This is with the latest Dell BIOS and everything is up to date with all kernel patches.

Here are some relevant parts of the dmesg log:

[ 2.206821] VGA switcheroo: detected Optimus DSM method \_SB_.PC00.PEG2.PEGP handle
[ 2.206822] nouveau: detected PR support, will not use DSM
[ 2.207310] nouveau 0000:01:00.0: NVIDIA TU117 (167000a1)
[ 2.262620] usb 3-10: new full-speed USB device number 3 using xhci_hcd
[ 2.288809] nouveau 0000:01:00.0: bios: version 90.17.95.00.a6
[ 2.290228] nouveau 0000:01:00.0: pmu: firmware unavailable
[ 2.290747] nouveau 0000:01:00.0: fb: 4096 MiB GDDR6
[ 2.302711] nouveau 0000:01:00.0: sec2: halted
[ 2.302716] fbcon: Taking over console
[ 2.302722] nouveau 0000:01:00.0: sec2: TRACEPC SCTL 00007021 TIDX 1b1f0000
[ 2.302726] nouveau 0000:01:00.0: sec2: TRACEPC: 000023ac
[ 2.302729] nouveau 0000:01:00.0: sec2: TRACEPC: 00002055
[ 2.302731] nouveau 0000:01:00.0: sec2: TRACEPC: 00002305
[ 2.302734] nouveau 0000:01:00.0: sec2: TRACEPC: 0000201c
...
[ 2.336878] [drm] Initialized nouveau 1.3.1 20120801 for 0000:01:00.0 on minor 0
[ 2.336919] nouveau 0000:01:00.0: [drm] Cannot find any crtc or sizes
[ 2.351161] i915 0000:00:02.0: [drm] VT-d active for gfx access
[ 2.351164] i915 0000:00:02.0: vgaarb: deactivate vga console
[ 2.351205] i915 0000:00:02.0: [drm] Using Transparent Hugepages
[ 2.351699] i915 0000:00:02.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=io+mem:owns=io+mem
[ 2.353297] i915 0000:00:02.0: [drm] Finished loading DMC firmware i915/adlp_dmc_ver2_16.bin (v2.16)
[ 2.413089] usb 3-10: New USB device found, idVendor=8087, idProduct=0033, bcdDevice= 0.00
[ 2.413094] usb 3-10: New USB device strings: Mfr=0, Product=0, SerialNumber=0
[ 2.464266] i915 0000:00:02.0: [drm] GuC firmware i915/adlp_guc_70.bin version 70.5.1
[ 2.464272] i915 0000:00:02.0: [drm] HuC firmware i915/tgl_huc.bin version 7.9.3
[ 2.464894] input: PS/2 Generic Mouse as /devices/platform/i8042/serio1/input/input5
...
[ 2.481137] i915 0000:00:02.0: [drm] GuC RC: enabled
[ 2.481930] i915 0000:00:02.0: [drm] Protected Xe Path (PXP) protected content support initialize
d
[ 2.486590] hid-generic 001F:8087:0AC2.0004: hidraw3: SENSOR HUB HID v2.00 Device [hid-ishtp 8087
:0AC2] on
[ 2.489060] hid-generic 001F:8087:0AC2.0005: hidraw4: SENSOR HUB HID v2.00 Device [hid-ishtp 8087
:0AC2] on
[ 3.749235] [drm] Initialized i915 1.6.0 20201103 for 0000:00:02.0 on minor 1
[ 3.749827] ACPI: video: Video Device [PEGP] (multi-head: no rom: yes post: no)
[ 3.749903] input: Video Bus as /devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A08:00/device:05/LNXVIDEO:00
/input/input13
[ 3.752939] ACPI: video: Video Device [GFX0] (multi-head: yes rom: no post: no)
[ 3.754035] input: Video Bus as /devices/LNXSYSTM:00/LNXSYBUS:00/PNP0A08:00/LNXVIDEO:01/input/input14
[ 3.754222] vga_switcheroo: enabled
[ 3.756162] fbcon: i915drmfb (fb0) is primary device
[ 3.761314] Console: switching to colour frame buffer device 240x67
[ 3.781787] i915 0000:00:02.0: [drm] fb0: i915drmfb frame buffer device
[ 12.324375] nouveau 0000:01:00.0: sec2: halted
[ 12.324391] nouveau 0000:01:00.0: sec2: TRACEPC SCTL 00007021 TIDX 1b1f0000
[ 12.324395] nouveau 0000:01:00.0: sec2: TRACEPC: 000023ac
[ 12.324397] nouveau 0000:01:00.0: sec2: TRACEPC: 00002055
[ 12.324399] nouveau 0000:01:00.0: sec2: TRACEPC: 00002305
...
[ 108.900212] ACPI Error: Aborting method \_SB.IETM._OSC due to previous error (AE_NOT_FOUND) (20221020/psparse-529)
[ 116.114703] nouveau 0000:01:00.0: sec2:cmdq: timeout waiting for queue ready
[ 116.114731] nouveau 0000:01:00.0: gr: init failed, -110
[ 118.137764] ------------[ cut here ]------------
[ 118.137767] nouveau 0000:01:00.0: timeout
[ 118.137812] WARNING: CPU: 2 PID: 1563 at drivers/gpu/drm/nouveau/nvkm/engine/gr/gf100.c:840 gf100_gr_fecs_bind_pointer+0x105/0x120 [nouveau]
[ 118.137891] Modules linked in: cmac algif_hash algif_skcipher af_alg snd_sof_pci_intel_tgl bnep snd_hda_codec_hdmi snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_ctl_led snd_sof_utils snd_soc_hdac_hda snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi soundwire_bus snd_soc_core snd_hda_codec_realtek joydev snd_hda_codec_generic snd_compress ac97_bus snd_pcm_dmaengine snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi snd_hda_codec x86_pkg_temp_thermal snd_hda_core intel_powerclamp snd_hwdep coretemp iwlmvm snd_pcm kvm_intel snd_seq_midi dell_rbtn mac80211 snd_seq_midi_event btusb dell_laptop kvm btrtl snd_rawmidi uvcvideo btbcm irqbypass binfmt_misc libarc4 videobuf2_vmalloc btintel snd_seq rapl snd_seq_device processor_thermal_device_pci dell_wmi mei_pxp mei_hdcp videobuf2_memops btmtk videobuf2_v4l2 input_leds processor_thermal_device intel_rapl_msr dell_smbios nls_iso8859_1
[ 118.137916] iwlwifi intel_cstate videodev snd_timer dell_wmi_ddv processor_thermal_rfim bluetooth dell_wmi_sysman cmdlinepart ucsi_acpi dcdbas serio_raw mei_me ledtrig_audio processor_thermal_mbox firmware_attributes_class dell_wmi_descriptor videobuf2_common spi_nor typec_ucsi ecdh_generic wmi_bmof snd processor_thermal_rapl mc cfg80211 mei mtd ecc hid_multitouch soundcore intel_rapl_common igen6_edac typec int3403_thermal intel_hid int340x_thermal_zone mac_hid int3400_thermal acpi_tad sparse_keymap acpi_thermal_rel acpi_pad sch_fq_codel msr parport_pc ppdev lp ramoops parport pstore_blk reed_solomon pstore_zone efi_pstore ip_tables x_tables autofs4 btrfs blake2b_generic xor raid6_pq libcrc32c dm_crypt hid_sensor_custom hid_sensor_hub intel_ishtp_hid i915 nouveau mxm_wmi drm_buddy nvme i2c_algo_bit drm_ttm_helper nvme_core ttm nvme_common ahci drm_display_helper hid_generic libahci cec rc_core drm_kms_helper crct10dif_pclmul syscopyarea crc32_pclmul sysfillrect polyval_clmulni
[ 118.137946] intel_lpss_pci sysimgblt polyval_generic ghash_clmulni_intel sha512_ssse3 aesni_intel crypto_simd rtsx_pci_sdmmc spi_intel_pci i2c_i801 intel_ish_ipc intel_lpss drm cryptd psmouse e1000e xhci_pci thunderbolt spi_intel rtsx_pci i2c_smbus intel_ishtp idma64 vmd i2c_hid_acpi xhci_pci_renesas video i2c_hid hid wmi pinctrl_tigerlake
[ 118.137959] CPU: 2 PID: 1563 Comm: gnome-shell Not tainted 6.2.0-26-generic #26~22.04.1-Ubuntu
[ 118.137961] Hardware name: Dell Inc. Precision 3570/07RFGX, BIOS 1.13.2 05/10/2023
[ 118.137961] RIP: 0010:gf100_gr_fecs_bind_pointer+0x105/0x120 [nouveau]
[ 118.138013] Code: 45 c0 48 8b 40 10 48 8b 78 10 4c 8b 67 50 4d 85 e4 74 20 e8 ed 24 74 de 4c 89 e2 48 c7 c7 dc b2 c6 c0 48 89 c6 e8 ab 4c d7 dd <0f> 0b b8 92 ff ff ff eb a3 4c 8b 27 eb db e8 08 de d1 de 0f 1f 84
[ 118.138014] RSP: 0018:ffffa2ad8421b8f0 EFLAGS: 00010246
[ 118.138015] RAX: 0000000000000000 RBX: ffff971963f4b400 RCX: 0000000000000000
[ 118.138016] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[ 118.138016] RBP: ffffa2ad8421b930 R08: 0000000000000000 R09: 0000000000000000
[ 118.138017] R10: 0000000000000000 R11: 0000000000000000 R12: ffff9719422cac30
[ 118.138017] R13: 0000000000000000 R14: ffff9719608c3300 R15: ffff971944331f00
[ 118.138018] FS: 00007f293b7c85c0(0000) GS:ffff97288f480000(0000) knlGS:0000000000000000
[ 118.138018] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 118.138019] CR2: 000055569b830000 CR3: 0000000115950000 CR4: 0000000000750ee0
[ 118.138020] PKRU: 55555554
[ 118.138020] Call Trace:
[ 118.138021] <TASK>
[ 118.138023] gf100_grctx_generate+0x266/0x670 [nouveau]
[ 118.138074] gf100_gr_chan_new+0x487/0x4d0 [nouveau]
[ 118.138119] nvkm_gr_cclass_new+0x30/0x70 [nouveau]
[ 118.138166] nvkm_cgrp_ectx_get+0x179/0x260 [nouveau]
[ 118.138218] nvkm_cgrp_vctx_get+0xe8/0x2f0 [nouveau]
[ 118.138266] nvkm_chan_cctx_get+0x11a/0x260 [nouveau]
[ 118.138314] ? nvkm_object_ctor+0x26/0xb0 [nouveau]
[ 118.138342] nvkm_uchan_object_new+0xe3/0x210 [nouveau]
[ 118.138390] ? nvkm_gr_oclass_get+0x30/0xc0 [nouveau]
[ 118.138435] ? nvkm_subdev_ref+0x91/0xf0 [nouveau]
[ 118.138464] nvkm_ioctl_new+0x176/0x2c0 [nouveau]
[ 118.138510] ? __pfx_nvkm_uchan_object_new+0x10/0x10 [nouveau]
[ 118.138562] ? __pfx_gf100_gr_object_new+0x10/0x10 [nouveau]
[ 118.138614] nvkm_ioctl+0x12e/0x290 [nouveau]
[ 118.138646] nvkm_client_ioctl+0xe/0x20 [nouveau]
[ 118.138701] nvif_client_ioctl+0x1c/0x30 [nouveau]
[ 118.138728] usif_ioctl+0x27a/0x400 [nouveau]
[ 118.138779] nouveau_drm_ioctl+0xbd/0xd0 [nouveau]
[ 118.138828] __x64_sys_ioctl+0x9a/0xe0
[ 118.138831] do_syscall_64+0x59/0x90
[ 118.138834] ? do_syscall_64+0x69/0x90
[ 118.138836] ? ksys_mmap_pgoff+0x123/0x270
[ 118.138839] ? exit_to_user_mode_prepare+0x3b/0xd0
[ 118.138842] ? syscall_exit_to_user_mode+0x2a/0x50
[ 118.138844] ? do_syscall_64+0x69/0x90
[ 118.138845] entry_SYSCALL_64_after_hwframe+0x72/0xdc
[ 118.138848] RIP: 0033:0x7f294071aaff
[ 118.138850] Code: 00 48 89 44 24 18 31 c0 48 8d 44 24 60 c7 04 24 10 00 00 00 48 89 44 24 08 48 8d 44 24 20 48 89 44 24 10 b8 10 00 00 00 0f 05 <41> 89 c0 3d 00 f0 ff ff 77 1f 48 8b 44 24 18 64 48 2b 04 25 28 00
[ 118.138851] RSP: 002b:00007ffed692de50 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
[ 118.138852] RAX: ffffffffffffffda RBX: 000055569b7c99b0 RCX: 00007f294071aaff
[ 118.138853] RDX: 000055569b7c99b0 RSI: 00000000c0386447 RDI: 0000000000000012
[ 118.138854] RBP: 00000000c0386447 R08: 0000000000000038 R09: 000055569b7d47f0
[ 118.138854] R10: 00007f2940819d10 R11: 0000000000000246 R12: 000055569b81cb10
[ 118.138855] R13: 0000000000000012 R14: 0000000000000000 R15: 000055569b7c99b0
[ 118.138856] </TASK>
[ 118.138857] ---[ end trace 0000000000000000 ]---

After that the graphics hangs for a while, doesn't even allow CTRL-ALT-F3 and similar to change to a different virtual console.

ProblemType: Bug
DistroRelease: Ubuntu 22.04
Package: linux-modules-extra-6.2.0-26-generic 6.2.0-26.26~22.04.1
ProcVersionSignature: Ubuntu 6.2.0-26.26~22.04.1-generic 6.2.13
Uname: Linux 6.2.0-26-generic x86_64
ApportVersion: 2.20.11-0ubuntu82.5
Architecture: amd64
CasperMD5CheckResult: pass
Date: Fri Aug 11 13:06:20 2023
Dependencies:
 linux-modules-6.2.0-26-generic 6.2.0-26.26~22.04.1
 wireless-regdb 2022.06.06-0ubuntu1~22.04.1
InstallationDate: Installed on 2023-08-11 (0 days ago)
InstallationMedia: Ubuntu 22.04.2 LTS "Jammy Jellyfish" - Release amd64 (20230223)
ProcEnviron:
 LANGUAGE=en_AU:en
 TERM=xterm-256color
 PATH=(custom, no user)
 LANG=en_AU.UTF-8
 SHELL=/bin/bash
SourcePackage: linux-hwe-6.2
UpgradeStatus: No upgrade log present (probably fresh install)

Revision history for this message
Russell Coker (etbe-sm) wrote :
Revision history for this message
Launchpad Janitor (janitor) wrote :

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in linux-hwe-6.2 (Ubuntu):
status: New → Confirmed
Revision history for this message
Clément Duval (cduvalwitekio) wrote (last edit ):

nouveau driver is indeed affected.
The workaround to be able to use kernel 6.2 is by selecting NVIDIA driver nvidia-driver-535 (proprietary, tested) from menu Software & updates / Additionnal Drivers

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.