Ubuntu freeze randomly

Bug #2045722 reported by Fikrul Arif
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux-hwe-6.2 (Ubuntu)
New
Undecided
Unassigned
nvidia-graphics-drivers-535 (Ubuntu)
New
Undecided
Unassigned

Bug Description

Sometimes happens when I leave it (probably screen lock), sometimes happens when I about to shutdown, sometimes happens when running. The log shows it is related to xorg or nvidia.

ProblemType: Bug
DistroRelease: Ubuntu 22.04
Package: xorg 1:7.7+23ubuntu2
ProcVersionSignature: Ubuntu 6.2.0-37.38~22.04.1-generic 6.2.16
Uname: Linux 6.2.0-37-generic x86_64
NonfreeKernelModules: nvidia_modeset nvidia
.proc.driver.nvidia.capabilities.gpu0: Error: path was not a regular file.
.proc.driver.nvidia.capabilities.mig: Error: path was not a regular file.
.proc.driver.nvidia.gpus.0000.01.00.0: Error: path was not a regular file.
.proc.driver.nvidia.registry: Binary: ""
.proc.driver.nvidia.suspend: suspend hibernate resume
.proc.driver.nvidia.suspend_depth: default modeset uvm
.proc.driver.nvidia.version:
 NVRM version: NVIDIA UNIX x86_64 Kernel Module 535.129.03 Thu Oct 19 18:56:32 UTC 2023
 GCC version: gcc version 11.4.0 (Ubuntu 11.4.0-1ubuntu1~22.04)
ApportVersion: 2.20.11-0ubuntu82.5
Architecture: amd64
BootLog: Error: [Errno 13] Permission denied: '/var/log/boot.log'
CasperMD5CheckResult: unknown
CompositorRunning: None
CurrentDesktop: ubuntu:GNOME
Date: Wed Dec 6 10:53:21 2023
DistUpgraded: Fresh install
DistroCodename: jammy
DistroVariant: ubuntu
DkmsStatus:
 nvidia/535.129.03, 5.15.0-89-generic, x86_64: installed
 nvidia/535.129.03, 6.2.0-36-generic, x86_64: installed
 nvidia/535.129.03, 6.2.0-37-generic, x86_64: installed
ExtraDebuggingInterest: Yes
GpuHangFrequency: Several times a day
GpuHangReproducibility: Seems to happen randomly
GpuHangStarted: Within the last few days
GraphicsCard:
 Intel Corporation CometLake-H GT2 [UHD Graphics] [8086:9bc4] (rev 05) (prog-if 00 [VGA controller])
   Subsystem: Hewlett-Packard Company CometLake-H GT2 [UHD Graphics] [103c:878e]
 NVIDIA Corporation TU106M [GeForce RTX 2060 Max-Q] [10de:1f12] (rev a1) (prog-if 00 [VGA controller])
   Subsystem: Hewlett-Packard Company TU106M [GeForce RTX 2060 Max-Q] [103c:878e]
InstallationDate: Installed on 2021-06-05 (913 days ago)
InstallationMedia: Ubuntu 20.04.2.0 LTS "Focal Fossa" - Release amd64 (20210209.1)
MachineType: HP HP ENVY Laptop 15-ep0xxx
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-6.2.0-37-generic root=UUID=34001051-5122-4187-b21f-2032a99219e8 ro quiet splash vt.handoff=7
SourcePackage: xorg
Symptom: display
Title: Xorg freeze
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 07/30/2021
dmi.bios.release: 15.7
dmi.bios.vendor: AMI
dmi.bios.version: F.07
dmi.board.asset.tag: Base Board Asset Tag
dmi.board.name: 878E
dmi.board.vendor: HP
dmi.board.version: 18.33
dmi.chassis.type: 10
dmi.chassis.vendor: HP
dmi.chassis.version: Chassis Version
dmi.ec.firmware.release: 18.33
dmi.modalias: dmi:bvnAMI:bvrF.07:bd07/30/2021:br15.7:efr18.33:svnHP:pnHPENVYLaptop15-ep0xxx:pvr:rvnHP:rn878E:rvr18.33:cvnHP:ct10:cvrChassisVersion:sku16P91PA#AR6:
dmi.product.family: 103C_5335KV HP ENVY
dmi.product.name: HP ENVY Laptop 15-ep0xxx
dmi.product.sku: 16P91PA#AR6
dmi.sys.vendor: HP
version.compiz: compiz N/A
version.libdrm2: libdrm2 2.4.113-2~ubuntu0.22.04.1
version.libgl1-mesa-dri: libgl1-mesa-dri 23.0.4-0ubuntu1~22.04.1
version.libgl1-mesa-glx: libgl1-mesa-glx N/A
version.nvidia-graphics-drivers: nvidia-graphics-drivers-* N/A
version.xserver-xorg-core: xserver-xorg-core 2:21.1.4-2ubuntu1.7~22.04.2
version.xserver-xorg-input-evdev: xserver-xorg-input-evdev N/A
version.xserver-xorg-video-ati: xserver-xorg-video-ati 1:19.1.0-2ubuntu1
version.xserver-xorg-video-intel: xserver-xorg-video-intel 2:2.99.917+git20210115-1
version.xserver-xorg-video-nouveau: xserver-xorg-video-nouveau 1:1.0.17-2build1

Revision history for this message
Fikrul Arif (fikr4n) wrote :
affects: ubuntu → xorg (Ubuntu)
Revision history for this message
Daniel van Vugt (vanvugt) wrote :

Thank you for taking the time to report this bug and helping to make Ubuntu better. It sounds like some part of the system has crashed. To help us find the cause of the crash please follow these steps:

1. Run these commands:
    journalctl -b0 > journal.txt
    journalctl -b-1 > prevjournal.txt
and attach the resulting text files here.

2. Look in /var/crash for crash files and if found run:
    ubuntu-bug YOURFILE.crash
Then tell us the ID of the newly-created bug.

3. If step 2 failed then look at https://errors.ubuntu.com/user/ID where ID is the content of file /var/lib/whoopsie/whoopsie-id on the machine. Do you find any links to recent problems on that page? If so then please send the links to us.

Please take care to avoid attaching .crash files to bugs as we are unable to process them as file attachments. It would also be a security risk for yourself.

affects: xorg (Ubuntu) → ubuntu
Changed in ubuntu:
status: New → Incomplete
tags: added: nvidia
Revision history for this message
Fikrul Arif (fikr4n) wrote :

Thank you for your fast response Daniel. Here attached the journals, and this is the link (https://errors.ubuntu.com/oops/14e278f0-932b-11ee-9be3-fa163ec44ecd), there's only one for this month, but it was nautilus.

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

Thanks. I don't see any relevant crashes but either of these repeating issues might be related to freezes:

Des 06 11:43:39 zengi kernel: xhci_hcd 0000:01:00.2: xHC error in resume, USBSTS 0x401, Reinit
Des 06 11:43:39 zengi kernel: usb usb3: root hub lost power or was reset
Des 06 11:43:39 zengi kernel: usb usb4: root hub lost power or was reset
Des 06 11:43:58 zengi kernel: xhci_hcd 0000:01:00.2: xHC error in resume, USBSTS 0x401, Reinit
Des 06 11:43:58 zengi kernel: usb usb3: root hub lost power or was reset
Des 06 11:43:58 zengi kernel: usb usb4: root hub lost power or was reset
Des 06 11:44:16 zengi kernel: xhci_hcd 0000:01:00.2: xHC error in resume, USBSTS 0x401, Reinit
Des 06 11:44:16 zengi kernel: usb usb3: root hub lost power or was reset
Des 06 11:44:16 zengi kernel: usb usb4: root hub lost power or was reset

and

Des 06 10:47:26 zengi kernel: nvidia-modeset: ERROR: GPU:0: Error while waiting for GPU progress: 0x0000c57d:0 2:0:4048:4040
Des 06 10:47:31 zengi kernel: nvidia-modeset: ERROR: GPU:0: Error while waiting for GPU progress: 0x0000c57d:0 2:0:4048:4040
Des 06 10:47:36 zengi kernel: nvidia-modeset: ERROR: GPU:0: Error while waiting for GPU progress: 0x0000c57d:0 2:0:4048:4040

affects: ubuntu → nvidia-graphics-drivers-535 (Ubuntu)
Changed in nvidia-graphics-drivers-535 (Ubuntu):
status: Incomplete → New
Revision history for this message
Fikrul Arif (fikr4n) wrote :
Download full text (6.1 KiB)

There is also something like this repeating in /var/log/syslog:

kernel: [ 6983.891059] ------------[ cut here ]------------
kernel: [ 6983.891060] WARNING: CPU: 2 PID: 8427 at /var/lib/dkms/nvidia/535.129.03/build/nvidia/nv.c:4769 nvidia_dev_put+0xb9/0xc0 [nvidia]
kernel: [ 6983.891259] Modules linked in: rfcomm snd_ctl_led snd_soc_skl_hda_dsp snd_soc_intel_hda_dsp_common snd_soc_hdac_hdmi snd_sof_probes snd_hda_codec_realtek snd_hda_codec_generic le
dtrig_audio ccm cmac algif_hash algif_skcipher af_alg nvidia_uvm(PO) xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp nft_compat nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defr
ag_ipv4 nf_tables libcrc32c nfnetlink bridge stp llc bnep snd_soc_dmic snd_sof_pci_intel_cnl snd_sof_intel_hda_common nvidia_drm(PO) soundwire_intel nvidia_modeset(PO) soundwire_generic_allocation soundwire_cade
nce snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils snd_soc_hdac_hda snd_hda_ext_core intel_rapl_msr snd_soc_acpi_intel_match intel_rapl_common snd_soc_acpi intel_tcc_cooling soundwire_bus
 nvidia(PO) x86_pkg_temp_thermal intel_powerclamp snd_soc_core snd_compress ac97_bus snd_pcm_dmaengine snd_hda_codec_hdmi snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi i915 snd_hda_codec
kernel: [ 6983.891291] snd_hda_core snd_hwdep coretemp crct10dif_pclmul snd_pcm kvm_intel iwlmvm polyval_clmulni polyval_generic snd_seq_midi ghash_clmulni_intel btusb kvm mei_hdcp mei_pxp
 snd_seq_midi_event binfmt_misc drm_buddy sha512_ssse3 btrtl irqbypass mac80211 snd_rawmidi ttm btbcm aesni_intel btintel drm_display_helper snd_seq crypto_simd cec btmtk cryptd libarc4 nls_iso8859_1 hp_wmi rc_c
ore snd_seq_device bluetooth cmdlinepart rapl iwlwifi sparse_keymap snd_timer spi_nor drm_kms_helper i2c_algo_bit intel_cstate platform_profile serio_raw wmi_bmof intel_wmi_thunderbolt mxm_wmi mtd ee1004 syscopy
area mei_me snd ecdh_generic joydev input_leds cfg80211 sysfillrect ecc mei hid_multitouch soundcore sysimgblt intel_pch_thermal mac_hid acpi_pad sch_fq_codel msr parport_pc ppdev drm lp parport efi_pstore ip_ta
bles x_tables autofs4 usbhid hid_generic nvme ucsi_acpi ucsi_ccg sdhci_pci intel_lpss_pci i2c_hid_acpi spi_intel_pci cqhci i2c_i801 nvme_core ahci intel_lpss typec_ucsi crc32_pclmul
kernel: [ 6983.891335] thunderbolt xhci_pci i2c_nvidia_gpu i2c_hid spi_intel sdhci i2c_smbus nvme_common libahci idma64 typec xhci_pci_renesas i2c_ccgx_ucsi hid video wmi pinctrl_cannonlak
e
kernel: [ 6983.891344] CPU: 2 PID: 8427 Comm: Sw-Signaling Tainted: P W O 6.2.0-37-generic #38~22.04.1-Ubuntu
kernel: [ 6983.891345] Hardware name: HP HP ENVY Laptop 15-ep0xxx/878E, BIOS F.07 07/30/2021
kernel: [ 6983.891346] RIP: 0010:nvidia_dev_put+0xb9/0xc0 [nvidia]
kernel: [ 6983.891530] Code: 31 f6 31 ff c3 cc cc cc cc 48 c7 c7 c0 8a bd c4 e8 dc 27 26 e2 5b 41 5c 41 5d 41 5e 5d 31 c0 31 d2 31 f6 31 ff c3 cc cc cc cc <0f> 0b eb be 0f 1f 00 90 90 90 90
 90 90 90 90 90 90 90 90 90 90 90
kernel: [ 6983.891531] RSP: 0018:ffffa17d46867bf8 EFLAGS: 00010202
kernel: [ 6983.891533] RAX: 0000000000000026 RBX: 0000000000000100 RCX: 0000000000000000
kernel: [ 6983.891534] RDX: 0000000000000000 RSI: 000...

Read more...

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

Yes I noticed that but it's already covered by the nvidia-graphics-drivers-535 and linux-hwe-6.2 tasks here.

Assuming this isn't a hardware problem I think you're probably waiting for a driver fix from Nvidia, or a kernel fix.

Revision history for this message
Fikrul Arif (fikr4n) wrote :

fyi,

- Probably this is related to KVM, QEMU, or Android emulator, because it usually happens when the computer is about to suspend, lock screen, or shut down in a session where Android emulator had ever disconnected/closed unexpectedly.
- I switched to nvidia-driver-525, so far the freeze has not happened.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.