[nvidia] GPU has fallen off the bus
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
Incomplete
|
Undecided
|
Unassigned | ||
nvidia-graphics-drivers-525 (Ubuntu) |
New
|
Undecided
|
Unassigned |
Bug Description
When playing Assassins Creed Unity through Steam, the game will run fine for a short period and then pretty quickly in my experience the screen will go blank, lights on the GPU will turn off and GPU fans will spin at max RPM.
I checked the dmesg logs from that session and saw at the bottom:
```
Jun 12 19:25:09 pikachu kernel: NVRM: GPU at PCI:0000:0b:00: GPU-f888943b-
Jun 12 19:25:09 pikachu kernel: NVRM: Xid (PCI:0000:0b:00): 79, pid='<unknown>', name=<unknown>, GPU has fallen off the bus.
Jun 12 19:25:09 pikachu kernel: NVRM: GPU 0000:0b:00.0: GPU has fallen off the bus.
Jun 12 19:25:09 pikachu kernel: nvidia-gpu 0000:0b:00.3: Unable to change power state from D3hot to D0, device inaccessible
Jun 12 19:25:09 pikachu kernel: xhci_hcd 0000:0b:00.2: Unable to change power state from D3hot to D0, device inaccessible
Jun 12 19:25:09 pikachu kernel: xhci_hcd 0000:0b:00.2: Unable to change power state from D3cold to D0, device inaccessible
Jun 12 19:25:09 pikachu kernel: xhci_hcd 0000:0b:00.2: Controller not ready at resume -19
Jun 12 19:25:09 pikachu kernel: xhci_hcd 0000:0b:00.2: PCI post-resume error -19!
Jun 12 19:25:09 pikachu kernel: xhci_hcd 0000:0b:00.2: HC died; cleaning up
Jun 12 19:25:09 pikachu kernel: audit: type=1400 audit(168659430
Jun 12 19:25:10 pikachu kernel: nvidia-gpu 0000:0b:00.3: i2c timeout error ffffffff
Jun 12 19:25:10 pikachu kernel: ucsi_ccg 0-0008: i2c_transfer failed -110
```
Further up in the logs I also see the following (in case it's related):
```
[drm:nv_
```
I am using an RTX 2080Ti on driver version 525.105.17.
I have attached the full dmesg log
ProblemType: Bug
DistroRelease: Ubuntu 23.04
Package: nvidia-driver-525 525.105.17-0ubuntu1
ProcVersionSign
Uname: Linux 6.2.0-20-generic x86_64
NonfreeKernelMo
ApportVersion: 2.26.1-0ubuntu2
Architecture: amd64
CasperMD5CheckR
CurrentDesktop: ubuntu:GNOME
Date: Mon Jun 12 19:35:37 2023
InstallationDate: Installed on 2022-12-06 (187 days ago)
InstallationMedia: Ubuntu 22.10 "Kinetic Kudu" - Release amd64 (20221020)
SourcePackage: nvidia-
UpgradeStatus: Upgraded to lunar on 2023-04-21 (51 days ago)
From what I can tell, this is most likely to be a hardware issue like:
* The graphics card not making clean contact with the slot.
* Hardware failure of the motherboard.
* Hardware failure of the graphics card.
But a bit of googling suggests that other people encountering the same error message over the years have sometimes been able to avoid it by tweaking kernel/driver parameters.
The message "Failed to grab modeset ownership" is unrelated and can be ignored here.