nvidia module cant load with 5.13.0-41-generic

Bug #1975494 reported by schoubi
22
This bug affects 4 people
Affects Status Importance Assigned to Milestone
linux-hwe-5.13 (Ubuntu)
Confirmed
Undecided
Unassigned

Bug Description

Hi,

My system is working fine with 5.13.0-40-generic kernel and nvidia-driver-510...

Switching to 5.13.0-41 is messing something up...

Black screen after grub.

No graphic at all (black screen).

Further investigation show problems with the nvidia module :

----------------------------------
nvidia 0000:01:00.0: vgaarb: changed VGA decodes: olddecodes=none,decodes=none:owns=none
NVRM: The NVIDIA GPU 0000:01:00.0
NVRM: (PCI ID: 10de:1cb3) installed in this system has
NVRM: fallen off the bus and is not responding to commands.
nvidia: probe of 0000:01:00.0 failed with error -1
NVRM: The NVIDIA probe routine failed for 1 device(s).
NVRM: None of the NVIDIA devices were initialized.
nvidia-nvlink: Unregistered the Nvlink Core, major device number 507
----------------------------------

modules nvidia-* are present and installed :

----------------------------------
dkms status
nvidia, 510.73.05, 5.13.0-40-generic, x86_64: installed
nvidia, 510.73.05, 5.13.0-41-generic, x86_64: installed
----------------------------------

but they can't be modprobed :

-----------------------------------
modprobe: ERROR: could not insert 'nvidia': No such device
-----------------------------------

Reboot on the 5.13.0-40-generic solve the problem.

We are numerous users here to have the same behaviour...

Description: Ubuntu 20.04.4 LTS
Release: 20.04

schoubi (schoubi)
tags: added: focal
Revision history for this message
Ubuntu Foundations Team Bug Bot (crichton) wrote :

Thank you for taking the time to report this bug and helping to make Ubuntu better. It seems that your bug report is not filed about a specific source package though, rather it is just filed against Ubuntu in general. It is important that bug reports be filed about source packages so that people interested in the package can find the bugs about it. You can find some hints about determining what package your bug might be about at https://wiki.ubuntu.com/Bugs/FindRightPackage. You might also ask for help in the #ubuntu-bugs irc channel on Libera.chat.

To change the source package that this bug is filed about visit https://bugs.launchpad.net/ubuntu/+bug/1975494/+editstatus and add the package name in the text box next to the word Package.

[This is an automated message. I apologize if it reached you inappropriately; please just reply to this message indicating so.]

tags: added: bot-comment
schoubi (schoubi)
affects: ubuntu → linux-hwe-5.13 (Ubuntu)
Revision history for this message
Launchpad Janitor (janitor) wrote :

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in linux-hwe-5.13 (Ubuntu):
status: New → Confirmed
Revision history for this message
Ian Gordon (ian-gordon) wrote :

I have the same problem. The nvidia drivers (version 470 in my case) don't load after upgrading to 5.13.0-41-generic. Reverting to 5.13.0-39-generic fixes the problem.

I did notice that -41 has a lot of changes from the "stable" impish kernel and that the modules loaded on my machine with -41 include lots of wmi and i915 stuff:

i915 2400256 1
i2c_algo_bit 16384 1 i915
drm_kms_helper 253952 1 i915
cec 53248 2 drm_kms_helper,i915
drm 557056 3 drm_kms_helper,i915
video 53248 2 dell_wmi,i915
wmi 32768 5 dell_wmi_sysman,dell_wmi,wmi_bmof,dell_smbios,dell_wmi_descriptor

which the -39 modules does not load. Black listing these modules does not fix the problem.

The 2 machines we are having issues with are

Dell Optiplex 7090 with a Nvidia GeForce RTX 3070

so I tried the 5.14.0-1038-oem kernel which fortunately fixes the problem for me.

Regards,

Ian G.

Revision history for this message
schoubi (schoubi) wrote :

Same behavior with last 5.13.0-44 :/

-----------------------------------------
Package: linux-image-5.13.0-44-generic
Version: 5.13.0-44.49~20.04.1
Built-Using: linux-hwe-5.13 (= 5.13.0-44.49~20.04.1)
Priority: optional
Section: kernel
Source: linux-signed-hwe-5.13
Origin: Ubuntu

Revision history for this message
mjbogusz (mjbogusz) wrote :

Same behavior here with 5.13.0-48, 3 desktops with 3060s.

Workaround: Install kernel 5.14 with `sudo apt install linux-oem-20.04d`

Possibly a duplicate and/or same root cause as https://bugs.launchpad.net/ubuntu/+source/linux-meta-hwe-5.13/+bug/1974434

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.