Nvidia 530 kernel driver is not installed/built

Bug #2023946 reported by Teh Kok How
18
This bug affects 3 people
Affects Status Importance Assigned to Milestone
nvidia-graphics-drivers-530 (Ubuntu)
Incomplete
High
Unassigned
nvidia-graphics-drivers-535 (Ubuntu)
New
Undecided
Unassigned

Bug Description

```
$ uname -a
Linux khteh-p17-2i 6.2.0-20-generic #20-Ubuntu SMP PREEMPT_DYNAMIC Thu Apr 6 07:48:48 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux
```
```
$ nvidia-smi
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.
```
```
$ prime-select query
on-demand
```
```
$ ubuntu-drivers devices
== /sys/devices/pci0000:00/0000:00:01.0/0000:01:00.0 ==
modalias : pci:v000010DEd000025B8sv000017AAsd000022DEbc03sc00i00
vendor : NVIDIA Corporation
model : GA107GLM [RTX A2000 Mobile]
driver : nvidia-driver-510 - distro non-free
driver : nvidia-driver-470-server - distro non-free
driver : nvidia-driver-525 - distro non-free
driver : nvidia-driver-515-open - distro non-free
driver : nvidia-driver-515-server - distro non-free
driver : nvidia-driver-525-open - distro non-free
driver : nvidia-driver-530 - distro non-free recommended
driver : nvidia-driver-525-server - distro non-free
driver : nvidia-driver-530-open - distro non-free
driver : nvidia-driver-515 - distro non-free
driver : nvidia-driver-470 - distro non-free
driver : xserver-xorg-video-nouveau - distro free builtin
```
```
 $ dpkg -l | grep -i nvidia
ii libaccinj64-11.8:amd64 11.8.87~11.8.0-3 amd64 NVIDIA ACCINJ Library (64-bit)
ii libcu++-dev 1.8.1-2 all NVIDIA C++ Standard Library
ii libcublas11:amd64 11.11.3.6~11.8.0-3 amd64 NVIDIA cuBLAS Library
ii libcublaslt11:amd64 11.11.3.6~11.8.0-3 amd64 NVIDIA cuBLASLt Library
ii libcudart11.0:amd64 11.8.89~11.8.0-3 amd64 NVIDIA CUDA Runtime Library
ii libcufft10:amd64 11.1.1+~10.9.0.58~11.8.0-3 amd64 NVIDIA cuFFT Library
ii libcufftw10:amd64 11.1.1+~10.9.0.58~11.8.0-3 amd64 NVIDIA cuFFTW Library
ii libcuinj64-11.8:amd64 11.8.87~11.8.0-3 amd64 NVIDIA CUINJ Library (64-bit)
ii libcupti-dev:amd64 11.8.87~11.8.0-3 amd64 NVIDIA CUDA Profiler Tools Interface development files
ii libcupti-doc 11.8.87~11.8.0-3 all NVIDIA CUDA Profiler Tools Interface documentation
ii libcupti11.8:amd64 11.8.87~11.8.0-3 amd64 NVIDIA CUDA Profiler Tools Interface runtime library
ii libcurand10:amd64 11.1.1+~10.3.0.86~11.8.0-3 amd64 NVIDIA cuRAND Library
ii libcusolver11:amd64 11.4.1.48~11.8.0-3 amd64 NVIDIA cuSOLVER Library
ii libcusolvermg11:amd64 11.4.1.48~11.8.0-3 amd64 NVIDIA cuSOLVERmg Library
ii libcusparse11:amd64 11.7.5.86~11.8.0-3 amd64 NVIDIA cuSPARSE Library
ii libnppc11:amd64 11.8.0.86~11.8.0-3 amd64 NVIDIA Performance Primitives core runtime library
ii libnppial11:amd64 11.8.0.86~11.8.0-3 amd64 NVIDIA Performance Primitives lib for Image Arithmetic and Logic
ii libnppicc11:amd64 11.8.0.86~11.8.0-3 amd64 NVIDIA Performance Primitives lib for Image Color Conversion
ii libnppidei11:amd64 11.8.0.86~11.8.0-3 amd64 NVIDIA Performance Primitives lib for Image Data Exchange and Initialization
ii libnppif11:amd64 11.8.0.86~11.8.0-3 amd64 NVIDIA Performance Primitives lib for Image Filters
ii libnppig11:amd64 11.8.0.86~11.8.0-3 amd64 NVIDIA Performance Primitives lib for Image Geometry transforms
ii libnppim11:amd64 11.8.0.86~11.8.0-3 amd64 NVIDIA Performance Primitives lib for Image Morphological operations
ii libnppist11:amd64 11.8.0.86~11.8.0-3 amd64 NVIDIA Performance Primitives lib for Image Statistics
ii libnppisu11:amd64 11.8.0.86~11.8.0-3 amd64 NVIDIA Performance Primitives lib for Image Support
ii libnppitc11:amd64 11.8.0.86~11.8.0-3 amd64 NVIDIA Performance Primitives lib for Image Threshold and Compare
ii libnpps11:amd64 11.8.0.86~11.8.0-3 amd64 NVIDIA Performance Primitives for signal processing runtime library
ii libnvidia-cfg1-530:amd64 530.41.03-0ubuntu2 amd64 NVIDIA binary OpenGL/GLX configuration library
ii libnvidia-common-530 530.41.03-0ubuntu2 all Shared files used by the NVIDIA libraries
ii libnvidia-compute-530:amd64 530.41.03-0ubuntu2 amd64 NVIDIA libcompute package
ii libnvidia-decode-530:amd64 530.41.03-0ubuntu2 amd64 NVIDIA Video Decoding runtime libraries
ii libnvidia-encode-530:amd64 530.41.03-0ubuntu2 amd64 NVENC Video Encoding runtime library
ii libnvidia-extra-530:amd64 530.41.03-0ubuntu2 amd64 Extra libraries for the NVIDIA driver
ii libnvidia-fbc1-530:amd64 530.41.03-0ubuntu2 amd64 NVIDIA OpenGL-based Framebuffer Capture runtime library
ii libnvidia-gl-530:amd64 530.41.03-0ubuntu2 amd64 NVIDIA OpenGL/GLX/EGL/GLES GLVND libraries and Vulkan ICD
ii libnvidia-ml-dev:amd64 11.8.86~11.8.0-3 amd64 NVIDIA Management Library (NVML) development files
ii libnvjpeg11:amd64 11.9.0.86~11.8.0-3 amd64 NVIDIA JPEG library (nvJPEG)
ii libnvrtc-builtins11.8:amd64 11.8.89~11.8.0-3 amd64 CUDA Runtime Compilation (NVIDIA NVRTC Builtins Library)
ii libnvrtc11.2:amd64 11.8.89~11.8.0-3 amd64 CUDA Runtime Compilation (NVIDIA NVRTC Library)
ii libnvtoolsext1:amd64 11.8.86~11.8.0-3 amd64 NVIDIA Tools Extension Library
ii libnvvm4:amd64 11.8.89~11.8.0-3 amd64 NVIDIA NVVM Library
ii linux-signatures-nvidia-6.2.0-1003-lowlatency 6.2.0-1003.3+2 amd64 Linux kernel signatures for nvidia modules for version 6.2.0-1003-lowlatency
ii linux-signatures-nvidia-6.2.0-20-generic 6.2.0-20.20+2 amd64 Linux kernel signatures for nvidia modules for version 6.2.0-20-generic
ii nsight-compute 2022.3.0.22~11.8.0-3 amd64 NVIDIA Nsight Compute
ii nsight-compute-target:amd64 2022.3.0.22~11.8.0-3 amd64 NVIDIA Nsight Compute (target specific libraries)
ii nsight-systems 2022.4.2.1~11.8.0-3 amd64 NVIDIA Nsight Systems
ii nsight-systems-target:amd64 2022.4.2.1~11.8.0-3 amd64 NVIDIA Nsight Systems (target specific libraries)
ii nvidia-compute-utils-530 530.41.03-0ubuntu2 amd64 NVIDIA compute utilities
ii nvidia-cuda-dev:amd64 11.8.89~11.8.0-3 amd64 NVIDIA CUDA development files
ii nvidia-cuda-gdb 11.8.86~11.8.0-3 amd64 NVIDIA CUDA Debugger (GDB)
ii nvidia-cuda-toolkit 11.8.89~11.8.0-3 amd64 NVIDIA CUDA development toolkit
ii nvidia-cuda-toolkit-doc 11.8.0-3 all NVIDIA CUDA and OpenCL documentation
ii nvidia-dkms-530 530.41.03-0ubuntu2 amd64 NVIDIA DKMS package
ii nvidia-driver-530 530.41.03-0ubuntu2 amd64 NVIDIA driver metapackage
ii nvidia-kernel-common-530 530.41.03-0ubuntu2 amd64 Shared files used with the kernel module
ii nvidia-kernel-source-530 530.41.03-0ubuntu2 amd64 NVIDIA kernel source package
ii nvidia-opencl-dev:amd64 11.8.89~11.8.0-3 amd64 NVIDIA OpenCL development files
ii nvidia-prime 0.8.17.1 all Tools to enable NVIDIA's Prime
ii nvidia-profiler 11.8.87~11.8.0-3 amd64 NVIDIA Profiler for CUDA and OpenCL
ii nvidia-settings 510.47.03-0ubuntu1 amd64 Tool for configuring the NVIDIA graphics driver
ii nvidia-utils-530 530.41.03-0ubuntu2 amd64 NVIDIA driver support binaries
ii nvidia-visual-profiler 11.8.87~11.8.0-3 amd64 NVIDIA Visual Profiler for CUDA and OpenCL
ii screen-resolution-extra 0.18.3 all Extension for the nvidia-settings control panel
ii xserver-xorg-video-nvidia-530 530.41.03-0ubuntu2 amd64 NVIDIA binary Xorg driver
```
WTF is happening?

Tags: lunar nvidia
Revision history for this message
Florian W. (florian-will) wrote :

I haven't rebooted yet after installing 530, and now I may wait a bit before attempting it, but I notice you're missing linux-objects-nvidia-530-6.2.0-20-generic and linux-modules-nvidia-530-generic and maybe a few more compared to my setup.

Did you install using dkms for some reason, like having a custom kernel? If so, these packages may not be needed and that's not your issue.

Revision history for this message
Teh Kok How (khteh) wrote :

I tried several installation approaches like `sudo ubuntu-drivers autoinstall` but none of them works. How can I have the complete installation without missing anything?

Revision history for this message
Teh Kok How (khteh) wrote :

"Did you install using dkms for some reason, like having a custom kernel? If so, these packages may not be needed and that's not your issue."

What do you mean? What do I NOT need and can / should remove? I don't use any custom kernel!

Revision history for this message
Teh Kok How (khteh) wrote :

How do I recover my system? What to remove and what missing stuff to install and how to install the missing stuff?

Revision history for this message
Launchpad Janitor (janitor) wrote :

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in nvidia-graphics-drivers-530 (Ubuntu):
status: New → Confirmed
Revision history for this message
pakaoraki (pakaoraki) wrote (last edit ):

Ubuntu 23.04, dell xps 9570 with nvidia 1050Ti (prime).

# nvidia-smi
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.

I tried to enroll key again for secure boot as it can cause this issue:
sudo update-secureboot-policy --enroll-key
It didn't fix this situation.

I'm going back to nvidia 525.

Revision history for this message
pakaoraki (pakaoraki) wrote :

After, a complete reinstall of nvidia driver again, I managed to use the nvidia-530 driver. I notice that after some recent updates, the oracle kernel and lowlatency kernel were installed: after removing it and reinstall nvidia driver, prime was working again.

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

If you are still experiencing this bug then please run:

  lspci -k > lspci.txt
  journalctl -b0 > journal.txt

and attach the resulting text files here.

tags: added: lunar
tags: added: nvidia
Revision history for this message
Teh Kok How (khteh) wrote :

journal.txt

Revision history for this message
Teh Kok How (khteh) wrote :
Revision history for this message
Alexander (sturmlocke86) wrote (last edit ):

The new Nvidia 530 driver just nuked my Ubuntu install too, see my comments from today further below here: https://bugs.launchpad.net/ubuntu/+source/mutter/+bug/2023776

Same here: Low latency kernel has been installed (don't know why tbh), secure boot is off, can't install the Nvidia driver after the update tool messed something up, trying to recover right now.

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

Try this:

  sudo apt install --reinstall nvidia-dkms-530 nvidia-driver-530

and if that doesn't fix it then also try uninstalling any kernels you're not using first.

summary: - Nvidia 530 doesn’t work on ubuntu 23.04!
+ Nvidia 530 kernel driver is not installed
summary: - Nvidia 530 kernel driver is not installed
+ Nvidia 530 kernel driver is not installed/built
Changed in nvidia-graphics-drivers-530 (Ubuntu):
importance: Undecided → High
Revision history for this message
Teh Kok How (khteh) wrote :

I don't think I want to install nvidia-dkms-530 again which messes things up! Please refer to https://forums.developer.nvidia.com/t/530-doesnt-work-on-ubuntu-23-04/256721/2

I only have ONE kernel which is the generic.

And BTW, what the fuck does Oracle have to do with the world of Ubuntu that screws up our devices!?!

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

Different kernels exists for different purposes. Regular desktop users won't generally have more than one type of kernel installed unless they've been experimenting...

I suspect sometimes failure to build the module for one kernel may cause DKMS to give up and not even try to build the module for the other kernels. But I'm guessing now (recalling similar experience of my own) and have no direct evidence that's the issue.

Revision history for this message
Teh Kok How (khteh) wrote (last edit ):

So it is not a problem with Oracle but with Canonical which doesn't do proper validation of the "experimenting" kernels before publishing to the general public then?

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

The NVIDIA driver is only supported by some kernels, not all. That in itself is not a bug. To cover the possibility that it will fail to build for some kernels and to keep building for others, a bug report either to the driver itself (this bug) or to the automated builder (https://bugs.launchpad.net/ubuntu/+source/dkms/+bugs) is probably appropriate.

While Canonical might not expect desktop users to have multiple types of kernels installed, that's something I do see people do often so suggest it should be part of QA.

Revision history for this message
Teh Kok How (khteh) wrote (last edit ):

I don't think it makes sense because I did not choose to install those kernels that break - oracle and lowlatency. It was pushed to me when I ran `apt update -y` and `apt upgrade -y` on a daily basis. These "experimenting" kernels fall through the cracks of your QA process and we become the victims of it.

How to prevent this from happening again?

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

I don't know why that would have happened and haven't seen it occur myself. I suggest you query the unwanted packages that are installed to find out what is depending on (or recommending) them.

Revision history for this message
Ahmad Hammoudeh (hammoudeh-69) wrote :

@sturmlocke86 using 6.2.0-20-generic or 6.2.0-23-generic solved the issue for me.

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

"Using" a different kernel isn't the main problem. This bug was initially logged against 6.2.0-20-generic

Revision history for this message
Teh Kok How (khteh) wrote :

No. It happened initially with oracle kernel which I later solved after switching to generic kernel. But it is not entirely solved yet as the resolution of the external monitor is not correct and sometimes it doesn't detect external monitor at all.

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

You can use the 'apt rdepends' command to find out what caused the non-default kernels to get installed.

Revision history for this message
Alexander (sturmlocke86) wrote :

I am also left wondering where these additional kernels came from tbh. I didn't install them knowingly onto my system; could these have been installed as a dependency by the app "Gnome Boxes" (virtual machine), which I did in fact install onto my 23.04 machine a few weeks ago ?

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

Why the kernels are installed isn't really important, just a curiosity. The nvidia driver should build and install more reliably regardless...

Changed in dkms (Ubuntu):
status: New → Incomplete
Changed in nvidia-graphics-drivers-530 (Ubuntu):
status: Confirmed → Incomplete
Revision history for this message
Daniel van Vugt (vanvugt) wrote (last edit ):

Incomplete: We'll need to see the DKMS build logs to determine which package is to blame.

Revision history for this message
Teh Kok How (khteh) wrote :

$ apt rdepends
E: No packages found

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

$ apt rdepends NAME_OF_KERNEL_PACKAGE

Revision history for this message
Teh Kok How (khteh) wrote :

What's that NAME_OF_KERNEL_PACKAGE that I should use?

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

NAME_OF_KERNEL_PACKAGE would be the name of one of the 'oracle' or 'lowlatency' kernel packages you have installed. But as mentioned in comment #24 it's not really important, what's important is comment #25.

Revision history for this message
Teh Kok How (khteh) wrote :

I have removed nvidia-dkms-*. So, probably can't give you the logs you need.

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

Removing the kernel driver would certainly cause this bug too. What graphics driver are you using now?

Revision history for this message
Teh Kok How (khteh) wrote :

RTX A2000

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

What output do you get from this command?

  lspci -k

Revision history for this message
Teh Kok How (khteh) wrote :
Download full text (5.2 KiB)

$ lspci -k
00:00.0 Host bridge: Intel Corporation 11th Gen Core Processor Host Bridge/DRAM Registers (rev 05)
 Subsystem: Lenovo 11th Gen Core Processor Host Bridge/DRAM Registers
00:01.0 PCI bridge: Intel Corporation 11th Gen Core Processor PCIe Controller #1 (rev 05)
 Subsystem: Lenovo 11th Gen Core Processor PCIe Controller
 Kernel driver in use: pcieport
00:02.0 VGA compatible controller: Intel Corporation TigerLake-H GT1 [UHD Graphics] (rev 01)
 Subsystem: Lenovo TigerLake-H GT1 [UHD Graphics]
 Kernel driver in use: i915
 Kernel modules: i915
00:04.0 Signal processing controller: Intel Corporation TigerLake-LP Dynamic Tuning Processor Participant (rev 05)
 Subsystem: Lenovo TigerLake-LP Dynamic Tuning Processor Participant
 Kernel driver in use: proc_thermal
 Kernel modules: processor_thermal_device_pci_legacy
00:06.0 PCI bridge: Intel Corporation 11th Gen Core Processor PCIe Controller #0 (rev 05)
 Subsystem: Lenovo 11th Gen Core Processor PCIe Controller
 Kernel driver in use: pcieport
00:07.0 PCI bridge: Intel Corporation Tiger Lake-H Thunderbolt 4 PCI Express Root Port #0 (rev 05)
 Subsystem: Lenovo Tiger Lake-H Thunderbolt 4 PCI Express Root Port
 Kernel driver in use: pcieport
00:07.1 PCI bridge: Intel Corporation Tiger Lake-H Thunderbolt 4 PCI Express Root Port #1 (rev 05)
 Subsystem: Lenovo Tiger Lake-H Thunderbolt 4 PCI Express Root Port
 Kernel driver in use: pcieport
00:0d.0 USB controller: Intel Corporation Tiger Lake-H Thunderbolt 4 USB Controller (rev 05)
 Subsystem: Lenovo Tiger Lake-H Thunderbolt 4 USB Controller
 Kernel driver in use: xhci_hcd
 Kernel modules: xhci_pci
00:0d.2 USB controller: Intel Corporation Tiger Lake-H Thunderbolt 4 NHI #0 (rev 05)
 Subsystem: Lenovo Tiger Lake-H Thunderbolt 4 NHI
 Kernel driver in use: thunderbolt
 Kernel modules: thunderbolt
00:14.0 USB controller: Intel Corporation Tiger Lake-H USB 3.2 Gen 2x1 xHCI Host Controller (rev 11)
 Subsystem: Lenovo Tiger Lake-H USB 3.2 Gen 2x1 xHCI Host Controller
 Kernel driver in use: xhci_hcd
 Kernel modules: xhci_pci
00:14.2 RAM memory: Intel Corporation Tiger Lake-H Shared SRAM (rev 11)
 Subsystem: Lenovo Tiger Lake-H Shared SRAM
00:16.0 Communication controller: Intel Corporation Tiger Lake-H Management Engine Interface (rev 11)
 Subsystem: Lenovo Tiger Lake-H Management Engine Interface
 Kernel driver in use: mei_me
 Kernel modules: mei_me
00:16.3 Serial controller: Intel Corporation Device 43e3 (rev 11)
 Subsystem: Lenovo Device 22de
 Kernel driver in use: serial
00:1c.0 PCI bridge: Intel Corporation Device 43b8 (rev 11)
 Subsystem: Lenovo Device 22de
 Kernel driver in use: pcieport
00:1c.4 PCI bridge: Intel Corporation Tiger Lake-H PCI Express Root Port #5 (rev 11)
 Subsystem: Lenovo Tiger Lake-H PCI Express Root Port
 Kernel driver in use: pcieport
00:1c.5 PCI bridge: Intel Corporation Device 43bd (rev 11)
 Subsystem: Lenovo Device 22de
 Kernel driver in use: pcieport
00:1c.7 PCI bridge: Intel Corporation Device 43bf (rev 11)
 Subsystem: Lenovo Device 22de
 Kernel driver in use: pcieport
00:1d.0 PCI bridge: Intel Corporation Tiger Lake-H PCI Express Root Port #9 (rev 11)
 Subsystem: Lenovo Tiger Lake-H PCI Express Root Port
 Kernel driver in us...

Read more...

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

Looks like it's working now.

If the bug ever occurs again then please look in /var/lib/dkms/... for the build log and attach it here.

Revision history for this message
Teh Kok How (khteh) wrote :

No, it doesn't work completely. Please check comment #21.

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

Please run 'nvidia-smi' and try launching 'NVIDIA Settings'. If those work then comment #21 would be a different issue.

Revision history for this message
Teh Kok How (khteh) wrote :

`nvidia-smi` runs successfully. How to launch "Nvidia Settings"?

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

Nvidia Settings is an app in the apps list (Super+A key). But actually that doesn't matter now because nvidia-smi is working. So there are two separate problems:

1. Bug 2023946: nvidia kernel driver not built. Still requires a DKMS build log showing the failure or we can give up now that it's working.

2. Wrong monitor resolution (presumably in Xorg): Needs a new bug report please.

Revision history for this message
Teh Kok How (khteh) wrote :

Arw you sure this is "resolved" just because we removed the 2 fxxking third party kernels - Oracle and lowlatency?

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

The bug status here is "Incomplete" awaiting DKMS build logs showing a failure.

The monitor resolution problem should get a separate bug.

Revision history for this message
Teh Kok How (khteh) wrote :
Revision history for this message
Teh Kok How (khteh) wrote :
Download full text (9.9 KiB)

Was away for a week and today things break again after `apt update && apt upgrade` !

$ dpkg -l | grep -i nvidia
ii libaccinj64-11.8:amd64 11.8.87~11.8.0-3 amd64 NVIDIA ACCINJ Library (64-bit)
ii libcu++-dev 1.8.1-2 all NVIDIA C++ Standard Library
ii libcublas11:amd64 11.11.3.6~11.8.0-3 amd64 NVIDIA cuBLAS Library
ii libcublaslt11:amd64 11.11.3.6~11.8.0-3 amd64 NVIDIA cuBLASLt Library
ii libcudart11.0:amd64 11.8.89~11.8.0-3 amd64 NVIDIA CUDA Runtime Library
ii libcufft10:amd64 11.1.1+~10.9.0.58~11.8.0-3 amd64 NVIDIA cuFFT Library
ii libcufftw10:amd64 11.1.1+~10.9.0.58~11.8.0-3 amd64 NVIDIA cuFFTW Library
ii libcuinj64-11.8:amd64 11.8.87~11.8.0-3 amd64 NVIDIA CUINJ Library (64-bit)
ii libcupti-dev:amd64 11.8.87~11.8.0-3 amd64 NVIDIA CUDA Profiler Tools Interface development files
ii libcupti-doc 11.8.87~11.8.0-3 all NVIDIA CUDA Profiler Tools Interface documentation
ii libcupti11.8:amd64 11.8.87~11.8.0-3 amd64 NVIDIA CUDA Profiler Tools Interface runtime library
ii libcurand10:amd64 11.1.1+~10.3.0.86~11.8.0-3 amd64 NVIDIA cuRAND Library
ii libcusolver11:amd64 11.4.1.48~11.8.0-3 amd64 NVIDIA cuSOLVER Library
ii libcusolvermg11:amd64 11.4.1.48~11.8.0-3 amd64 NVIDIA cuSOLVERmg Library
ii libcusparse11:amd64 11.7.5.86~11.8.0-3 amd64 NVIDIA cuSPARSE Library
ii libnppc11:amd64 11.8.0.86~11.8.0-3 amd64 NVIDIA Performance Primitives core runtime library
ii libnppial11:amd64 11.8.0.86~11.8.0-3 amd64 NVIDIA Performance Primitives lib for Image Arithmetic and Logic
ii libnppicc11:amd64 11.8.0.86~11.8.0-3 amd64 NVIDIA Performance Primitives lib for Image Color Conversion
ii libnppidei11:amd64 11.8.0.86~11.8.0-3 amd64 NVIDIA Performance Primitives lib for Image Data Exchange and Initialization
ii libnppif11:amd64 11.8.0.86~11.8.0-3 amd64 NVIDIA Performance Primitives lib for Image Filters
ii libnppig11:amd64 11.8.0.86~11.8.0-3 amd64 NVIDIA Performance Primitives lib for Image Geometry transforms
ii libnppim11:amd64 11.8.0.86~11.8.0-3 ...

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

That's unfortunate, but useful for debugging this. Please try to find the relevant log file from when it tried to build the kernel graphics driver:

  find /var/lib/dkms/nvidia -name \*.log

Please also attach the system update logs:

  /var/log/apt/history.log
  /var/log/dpkg.log

Revision history for this message
Teh Kok How (khteh) wrote :

/var/lib/dkms/nvidia/535.54.03/6.2.0-24-generic/x86_64/log/make.log

Revision history for this message
Teh Kok How (khteh) wrote :

/var/log/apt/history.log
/var/log/dpkg.log

Revision history for this message
Daniel van Vugt (vanvugt) wrote (last edit ):

That's odd. Those logs suggest the nvidia kernel driver built and installed successfully. But your log from bug 2025585 explains the issue:

Jul 03 11:40:12 khteh-p17-2i gdm3[3370]: modprobe: ERROR: could not insert 'nvidia': Key was rejected by service
Jul 03 11:40:12 khteh-p17-2i kernel: Loading of module with unavailable key is rejected

which is also mentioned in bug 1921536.

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

Bug 2014983 has a suggestion for how to fix it.

no longer affects: dkms (Ubuntu)
Revision history for this message
Alexander (sturmlocke86) wrote (last edit ):

@Teh Kok How (khteh), answering to your post #17.

Typing this from my phone rn, so bare with me please. I kept digging into this because it turns out that the computer that my wife uses also had a few Nvidia related issues recently, and she also has the Oracle and low latency kernel installed, even though she didn't install these manually, willingly or knowingly. There is a bunch of information about this out there:

- https://www.reddit.com/r/Kubuntu/comments/14ndb7a/why_do_i_keep_getting_these_oracle_and_lowlatency/

- https://ubuntuforums.org/showthread.php?t=2482558

- https://ubuntuforums.org/showthread.php?t=2473057

- https://ubuntuforums.org/showthread.php?t=2483064

Going by the information at hand, it seems that this is indeed caused either by the Nvidia driver update in Ubuntu - or - it's a dependency from Virtualbox or Gnome Boxes, which I mentioned in the comments of the bug report further below. Two things here: a) having the kernel change like this from the default Ubuntu kernel to something from Oracle without any form of obvious user info or confirmation is something to worry about imo b) it seems to be tied to some of the gpu driver update issues that a few of us have been experiencing, and it has even nuked or rendered some systems unusable too, see posts above

I haven't found the time to further diagnose my own Ubuntu + Nvidia related issues (https://bugs.launchpad.net/ubuntu/+source/mutter/+bug/2023776), and I don't see myself troubleshooting my wife's computer right now, so I leave this comment here just in case someone wants to open up a new or separate bug report about this, which I'll gladly upvote in order to provide whatever little information that I have right now. If there is already an ongoing bug report for this available, a point in the right direction is highly appreciated.

Thanks!

Edit / Update: Found the relevant / corresponding bug report for this issue here -> https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2024113

Revision history for this message
Teh Kok How (khteh) wrote :
Revision history for this message
Teh Kok How (khteh) wrote :

Works now!

`sudo update-secureboot-policy --enroll-key`

reboot and choose enrol MOK

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.