Xorg freeze

Bug #2071565 reported by G Shoukry
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
kwin (Ubuntu)
New
Undecided
Unassigned
nvidia-graphics-drivers-535 (Ubuntu)
New
Undecided
Unassigned

Bug Description

running team fortress 2 on my computer can produce a crash if ran in 144hz.

I have received an error message which might make debugging trivial to an experienced user:

[ 230.929193] UBSAN: array-index-out-of-bounds in build/nvidia/535.183.01/build/nvidia-uvm/uvm_pmm_gpu.c:26414:71
[ 230.930169] index 0 is out of range for type 'uvm_gpu_chunk_t *[*]'
[ 230.930363] UBSAN: array-index-out-of-bounds in build/nvidia/535.183.01/build/nvidia-uvm/uvm_pmm_gpu.c:829:45
[ 230.930363] index 0 is out of range for type 'uvm_gpu_chunk_t *[*]'
[ 230.931019] UBSAN: array-index-out-of-bounds in build/nvidia/535.183.01/build/nvidia-uvm/uvm_pmm_gpu.c:857:39
[ 230.931210] index 0 is out of range for type 'uvm_gpu_chunk_t *[*]'

ProblemType: Bug
DistroRelease: Ubuntu 24.04
Package: xorg 1:7.7+23ubuntu3
ProcVersionSignature: Ubuntu 6.8.0-36.36-generic 6.8.4
Uname: Linux 6.8.0-36-generic x86_64
NonfreeKernelModules: nvidia_modeset nvidia
.proc.driver.nvidia.capabilities.gpu0: Error: path was not a regular file.
.proc.driver.nvidia.capabilities.mig: Error: path was not a regular file.
.proc.driver.nvidia.gpus.0000.01.00.0: Error: path was not a regular file.
.proc.driver.nvidia.registry: Binary: ""
.proc.driver.nvidia.suspend: suspend hibernate resume
.proc.driver.nvidia.suspend_depth: default modeset uvm
.proc.driver.nvidia.version:
 NVRM version: NVIDIA UNIX x86_64 Kernel Module 535.183.01 Sun May 12 19:39:15 UTC 2024
 GCC version:
ApportVersion: 2.28.1-0ubuntu3
Architecture: amd64
BootLog: Error: [Errno 13] Permission denied: '/var/log/boot.log'
CasperMD5CheckResult: unknown
CompositorRunning: None
CurrentDesktop: KDE
Date: Sun Jun 30 14:21:39 2024
DistUpgraded: 2024-05-21 20:08:10,009 DEBUG new topwidget None
DistroCodename: noble
DistroVariant: ubuntu
ExtraDebuggingInterest: Yes, if not too technical
GpuHangFrequency: Several times a day
GpuHangReproducibility: Yes, I can easily reproduce it
GpuHangStarted: Since before I upgraded
GraphicsCard:
 NVIDIA Corporation AD104 [GeForce RTX 4070] [10de:2786] (rev a1) (prog-if 00 [VGA controller])
   Subsystem: Gigabyte Technology Co., Ltd AD104 [GeForce RTX 4070] [1458:40ee]
InstallationDate: Installed on 2023-12-20 (194 days ago)
InstallationMedia: Kubuntu 23.10 "Mantic Minotaur" - Release amd64 (20231010)
MachineType: Micro-Star International Co., Ltd. MS-7D25
ProcEnviron:
 LANG=en_US.UTF-8
 PATH=(custom, user)
 SHELL=/bin/bash
 XDG_RUNTIME_DIR=<set>
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-6.8.0-36-generic root=UUID=d0a4214b-acee-4317-a8dd-45390c3f5fa4 ro quiet splash vt.handoff=7
SourcePackage: xorg
Symptom: display
Title: Xorg freeze
UpgradeStatus: Upgraded to noble on 2024-05-22 (40 days ago)
dmi.bios.date: 11/13/2023
dmi.bios.release: 5.27
dmi.bios.vendor: American Megatrends International, LLC.
dmi.bios.version: A.F0
dmi.board.asset.tag: Default string
dmi.board.name: PRO Z690-A WIFI (MS-7D25)
dmi.board.vendor: Micro-Star International Co., Ltd.
dmi.board.version: 2.0
dmi.chassis.asset.tag: Default string
dmi.chassis.type: 3
dmi.chassis.vendor: Micro-Star International Co., Ltd.
dmi.chassis.version: 2.0
dmi.modalias: dmi:bvnAmericanMegatrendsInternational,LLC.:bvrA.F0:bd11/13/2023:br5.27:svnMicro-StarInternationalCo.,Ltd.:pnMS-7D25:pvr2.0:rvnMicro-StarInternationalCo.,Ltd.:rnPROZ690-AWIFI(MS-7D25):rvr2.0:cvnMicro-StarInternationalCo.,Ltd.:ct3:cvr2.0:skuDefaultstring:
dmi.product.family: Default string
dmi.product.name: MS-7D25
dmi.product.sku: Default string
dmi.product.version: 2.0
dmi.sys.vendor: Micro-Star International Co., Ltd.
version.compiz: compiz N/A
version.libdrm2: libdrm2 2.4.120-2build1
version.libgl1-mesa-dri: libgl1-mesa-dri 24.0.5-1ubuntu1
version.libgl1-mesa-glx: libgl1-mesa-glx N/A
version.nvidia-graphics-drivers: nvidia-graphics-drivers-* N/A
version.xserver-xorg-core: xserver-xorg-core 2:21.1.12-1ubuntu1
version.xserver-xorg-input-evdev: xserver-xorg-input-evdev N/A
version.xserver-xorg-video-ati: xserver-xorg-video-ati 1:22.0.0-1build1
version.xserver-xorg-video-intel: xserver-xorg-video-intel 2:2.99.917+git20210115-1build1
version.xserver-xorg-video-nouveau: xserver-xorg-video-nouveau 1:1.0.17-2build1

Revision history for this message
G Shoukry (capiosus) wrote :
Revision history for this message
Daniel van Vugt (vanvugt) wrote :

Those warnings do indicate a driver bug, but don't prove they're the cause of any freeze or crash. Please also follow these steps in case there's something else going wrong:

1. Run these commands:
    journalctl -b0 > journal.txt
    journalctl -b-1 > prevjournal.txt
and attach the resulting text files here.

2. Look in /var/crash for crash files and if found run:
    ubuntu-bug YOURFILE.crash
Then tell us the ID of the newly-created bug.

3. If step 2 failed then look at https://errors.ubuntu.com/user/ID where ID is the content of file /var/lib/whoopsie/whoopsie-id on the machine. Do you find any links to recent problems on that page? If so then please send the links to us.

Please take care to avoid attaching .crash files to bugs as we are unable to process them as file attachments. It would also be a security risk for yourself.

affects: xorg (Ubuntu) → nvidia-graphics-drivers-535 (Ubuntu)
Changed in nvidia-graphics-drivers-535 (Ubuntu):
status: New → Incomplete
Revision history for this message
G Shoukry (capiosus) wrote :

I believe this error could be related
https://errors.ubuntu.com/oops/e248144a-310e-11ef-9e7f-fa163ec44ecd

Other crashes seem to be unrelated, however the date from this crash seems a bit earlier than my latest crash.

Most often the game causes a freeze and no crash, and hangs the entire desktop environment (KDE) until it closes with no indication of why it closed.

The freezes I get usually:
Entire desktop goes unresponsive (not just team fortress 2)
I swap to tty to kill the process, or wait, and it does close after a while.
To kill the process in tty I use 'kill -kill $(pidof tf_linux64)'
Randomly it is possible to get the errors after killing the process.
I swap back to my X session.
Firefox crashes, and Thunderbird doesn't.

Changed in nvidia-graphics-drivers-535 (Ubuntu):
status: Incomplete → New
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.