install --gpgpu does not install modules with RTX A5000

Bug #2090502 reported by Joao Andre Simioni
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
ubuntu-drivers-common (Ubuntu)
New
Undecided
Unassigned

Bug Description

[Problem Description]

In a system with RTX A5000, when using the --gpgpu flag, the drivers options
are correctly displayed:

# lspci | grep -i nvidia
3b:00.0 VGA compatible controller: NVIDIA Corporation GA102GL [RTX A5000] (rev a1)
3b:00.1 Audio device: NVIDIA Corporation GA102 High Definition Audio Controller (rev a1)

# ubuntu-drivers list --gpgpu
nvidia-driver-550, (kernel modules provided by linux-modules-nvidia-550-generic)
nvidia-driver-470-server, (kernel modules provided by linux-modules-nvidia-470-server-generic)
nvidia-driver-545, (kernel modules provided by nvidia-dkms-545)
nvidia-driver-535, (kernel modules provided by linux-modules-nvidia-535-generic)
nvidia-driver-535-open, (kernel modules provided by linux-modules-nvidia-535-open-generic)
nvidia-driver-535-server, (kernel modules provided by linux-modules-nvidia-535-server-generic)
nvidia-driver-535-server-open, (kernel modules provided by linux-modules-nvidia-535-server-open-generic)
nvidia-driver-545-open, (kernel modules provided by nvidia-dkms-545-open)
nvidia-driver-470, (kernel modules provided by linux-modules-nvidia-470-generic)
nvidia-driver-550-open, (kernel modules provided by linux-modules-nvidia-550-open-generic)

When installing it, it picks the nvidia-headless-no-dkms-535-server package, but this won't install the signed modules:

# ubuntu-drivers install --gpgpu
nvidia-headless-no-dkms-535-server
None
Reading package lists... Done
Building dependency tree... Done
Reading state information... Done
The following packages were automatically installed and are no longer required:
  adwaita-icon-theme at-spi2-core dconf-gsettings-backend dconf-service fontconfig fontconfig-config fonts-dejavu-core gsettings-desktop-schemas gtk-update-icon-cache hicolor-icon-theme
  humanity-icon-theme libatk-bridge2.0-0 libatk1.0-0 libatk1.0-data libatspi2.0-0 libavahi-client3 libavahi-common-data libavahi-common3 libcairo-gobject2 libcairo2 libcolord2 libcups2 libdatrie1
  libdconf1 libdeflate0 libdpkg-perl libdrm-amdgpu1 libdrm-intel1 libdrm-nouveau2 libdrm-radeon1 libegl-mesa0 libegl1 libepoxy0 libfile-fcntllock-perl libfontconfig1 libfontenc1 libgbm1
  libgdk-pixbuf-2.0-0 libgdk-pixbuf2.0-bin libgdk-pixbuf2.0-common libgl1 libgl1-amber-dri libgl1-mesa-dri libglapi-mesa libglvnd0 libglx-mesa0 libglx0 libgraphite2-3 libgtk-3-0 libgtk-3-bin
  libgtk-3-common libharfbuzz0b libice6 libjbig0 libjpeg-turbo8 libjpeg8 liblcms2-2 libllvm15 libpango-1.0-0 libpangocairo-1.0-0 libpangoft2-1.0-0 libpixman-1-0 librsvg2-2 librsvg2-common
  libsensors-config libsensors5 libsm6 libthai-data libthai0 libtiff5 libvdpau1 libwayland-client0 libwayland-cursor0 libwayland-egl1 libwayland-server0 libwebp7 libx11-xcb1 libxaw7 libxcb-dri2-0
  libxcb-dri3-0 libxcb-glx0 libxcb-present0 libxcb-randr0 libxcb-render0 libxcb-shm0 libxcb-sync1 libxcb-xfixes0 libxcomposite1 libxcursor1 libxcvt0 libxdamage1 libxfixes3 libxfont2 libxi6 libxinerama1
  libxkbcommon0 libxkbfile1 libxmu6 libxnvctrl0 libxpm4 libxrandr2 libxrender1 libxshmfence1 libxt6 libxtst6 libxxf86vm1 mesa-vdpau-drivers pkg-config session-migration ubuntu-mono vdpau-driver-all
  x11-common x11-xkb-utils xcvt xfonts-base xfonts-encodings xfonts-utils xserver-common xserver-xorg-core
Use 'apt autoremove' to remove them.
The following additional packages will be installed:
  libnvidia-cfg1-535-server libnvidia-compute-535-server nvidia-compute-utils-535-server nvidia-firmware-535-server-535.216.01 nvidia-kernel-common-535-server nvidia-kernel-source-535-server
The following NEW packages will be installed:
  libnvidia-cfg1-535-server libnvidia-compute-535-server nvidia-compute-utils-535-server nvidia-firmware-535-server-535.216.01 nvidia-headless-no-dkms-535-server nvidia-kernel-common-535-server
  nvidia-kernel-source-535-server
0 upgraded, 7 newly installed, 0 to remove and 5 not upgraded.

# dpkg -l | grep -i nvidia
ii libnvidia-cfg1-535-server:amd64 535.216.01-0ubuntu0.22.04.1 amd64 NVIDIA binary OpenGL/GLX configuration library
ii libnvidia-compute-535-server:amd64 535.216.01-0ubuntu0.22.04.1 amd64 NVIDIA libcompute package
ii nvidia-compute-utils-535-server 535.216.01-0ubuntu0.22.04.1 amd64 NVIDIA compute utilities
ii nvidia-firmware-535-server-535.216.01 535.216.01-0ubuntu0.22.04.1 amd64 Firmware files used by the kernel module
ii nvidia-headless-no-dkms-535-server 535.216.01-0ubuntu0.22.04.1 amd64 NVIDIA headless metapackage - no DKMS
ii nvidia-kernel-common-535-server 535.216.01-0ubuntu0.22.04.1 amd64 Shared files used with the kernel module
ii nvidia-kernel-source-535-server 535.216.01-0ubuntu0.22.04.1 amd64 NVIDIA kernel source package

[Workaround]

Manually installing the proposed package listed in the list options will work:

# apt install linux-modules-nvidia-535-server-generic
Reading package lists... Done
Building dependency tree... Done
Reading state information... Done
The following packages were automatically installed and are no longer required:
  adwaita-icon-theme at-spi2-core dconf-gsettings-backend dconf-service fontconfig fontconfig-config fonts-dejavu-core gsettings-desktop-schemas gtk-update-icon-cache hicolor-icon-theme
  humanity-icon-theme libatk-bridge2.0-0 libatk1.0-0 libatk1.0-data libatspi2.0-0 libavahi-client3 libavahi-common-data libavahi-common3 libcairo-gobject2 libcairo2 libcolord2 libcups2 libdatrie1
  libdconf1 libdeflate0 libdpkg-perl libdrm-amdgpu1 libdrm-intel1 libdrm-nouveau2 libdrm-radeon1 libegl-mesa0 libegl1 libepoxy0 libfile-fcntllock-perl libfontconfig1 libfontenc1 libgbm1
  libgdk-pixbuf-2.0-0 libgdk-pixbuf2.0-bin libgdk-pixbuf2.0-common libgl1 libgl1-amber-dri libgl1-mesa-dri libglapi-mesa libglvnd0 libglx-mesa0 libglx0 libgraphite2-3 libgtk-3-0 libgtk-3-bin
  libgtk-3-common libharfbuzz0b libice6 libjbig0 libjpeg-turbo8 libjpeg8 liblcms2-2 libllvm15 libpango-1.0-0 libpangocairo-1.0-0 libpangoft2-1.0-0 libpixman-1-0 librsvg2-2 librsvg2-common
  libsensors-config libsensors5 libsm6 libthai-data libthai0 libtiff5 libvdpau1 libwayland-client0 libwayland-cursor0 libwayland-egl1 libwayland-server0 libwebp7 libx11-xcb1 libxaw7 libxcb-dri2-0
  libxcb-dri3-0 libxcb-glx0 libxcb-present0 libxcb-randr0 libxcb-render0 libxcb-shm0 libxcb-sync1 libxcb-xfixes0 libxcomposite1 libxcursor1 libxcvt0 libxdamage1 libxfixes3 libxfont2 libxi6 libxinerama1
  libxkbcommon0 libxkbfile1 libxmu6 libxnvctrl0 libxpm4 libxrandr2 libxrender1 libxshmfence1 libxt6 libxtst6 libxxf86vm1 mesa-vdpau-drivers pkg-config session-migration ubuntu-mono vdpau-driver-all
  x11-common x11-xkb-utils xcvt xfonts-base xfonts-encodings xfonts-utils xserver-common xserver-xorg-core
Use 'apt autoremove' to remove them.
The following additional packages will be installed:
  linux-modules-nvidia-535-server-5.15.0-126-generic linux-objects-nvidia-535-server-5.15.0-126-generic linux-signatures-nvidia-5.15.0-126-generic
The following NEW packages will be installed:
  linux-modules-nvidia-535-server-5.15.0-126-generic linux-modules-nvidia-535-server-generic linux-objects-nvidia-535-server-5.15.0-126-generic linux-signatures-nvidia-5.15.0-126-generic
0 upgraded, 4 newly installed, 0 to remove and 5 not upgraded.

# modprobe nvidia
# cat /proc/driver/nvidia/version
NVRM version: NVIDIA UNIX x86_64 Kernel Module 535.216.01 Tue Sep 17 16:54:04 UTC 2024
GCC version:

Revision history for this message
Joao Andre Simioni (jasimioni) wrote :
Revision history for this message
Joao Andre Simioni (jasimioni) wrote :
Download full text (4.3 KiB)

I just noticed it can be related to the command line arguments order. The --gpgpu flag is being ignored and I think it's interpreting it as some driver version.

# ubuntu-drivers install
ERROR:root:aplay command not found
Reading package lists... Done
Building dependency tree... Done
Reading state information... Done
Recommended packages:
  libnvidia-compute-550:i386 libnvidia-decode-550:i386 libnvidia-encode-550:i386 libnvidia-fbc1-550:i386 libnvidia-gl-550:i386
The following NEW packages will be installed:
  libnvidia-cfg1-550 libnvidia-common-550 libnvidia-compute-550 libnvidia-decode-550 libnvidia-egl-wayland1 libnvidia-encode-550 libnvidia-extra-550 libnvidia-fbc1-550 libnvidia-gl-550
  linux-modules-nvidia-550-5.15.0-126-generic linux-modules-nvidia-550-generic linux-objects-nvidia-550-5.15.0-126-generic linux-signatures-nvidia-5.15.0-126-generic nvidia-compute-utils-550
  nvidia-driver-550 nvidia-firmware-550-550.120 nvidia-kernel-common-550 nvidia-kernel-source-550 nvidia-prime nvidia-settings nvidia-utils-550 screen-resolution-extra xserver-xorg-video-nvidia-550

# ubuntu-drivers --gpgpu install
This is gpgpu mode
ERROR:root:aplay command not found
Reading package lists... Done
Building dependency tree... Done
Reading state information... Done
Recommended packages:
  libnvidia-compute-550:i386 libnvidia-decode-550:i386 libnvidia-encode-550:i386 libnvidia-fbc1-550:i386 libnvidia-gl-550:i386
The following NEW packages will be installed:
  libnvidia-cfg1-550 libnvidia-common-550 libnvidia-compute-550 libnvidia-decode-550 libnvidia-egl-wayland1 libnvidia-encode-550 libnvidia-extra-550 libnvidia-fbc1-550 libnvidia-gl-550
  linux-modules-nvidia-550-5.15.0-126-generic linux-modules-nvidia-550-generic linux-objects-nvidia-550-5.15.0-126-generic linux-signatures-nvidia-5.15.0-126-generic nvidia-compute-utils-550
  nvidia-driver-550 nvidia-firmware-550-550.120 nvidia-kernel-common-550 nvidia-kernel-source-550 nvidia-prime nvidia-settings nvidia-utils-550 screen-resolution-extra xserver-xorg-video-nvidia-550
0 upgraded, 23 newly installed, 0 to remove and 5 not upgraded.

Both these formats work. But this fails:

# ubuntu-drivers install --gpgpu
nvidia-headless-no-dkms-535-server
None
Reading package lists... Done
Building dependency tree... Done ...

Read more...

Revision history for this message
Iurii Kondrakov (deezzir) wrote :

# ubuntu-drivers --gpgpu install
This is gpgpu mode
ERROR:root:aplay command not found
Reading package lists... Done
Building dependency tree... Done
Reading state information... Done
Recommended packages:
  libnvidia-compute-550:i386 libnvidia-decode-550:i386 libnvidia-encode-550:i386 libnvidia-fbc1-550:i386 libnvidia-gl-550:i386
The following NEW packages will be installed:
  libnvidia-cfg1-550 libnvidia-common-550 libnvidia-compute-550 libnvidia-decode-550 libnvidia-egl-wayland1 libnvidia-encode-550 libnvidia-extra-550 libnvidia-fbc1-550 libnvidia-gl-550
  linux-modules-nvidia-550-5.15.0-126-generic linux-modules-nvidia-550-generic linux-objects-nvidia-550-5.15.0-126-generic linux-signatures-nvidia-5.15.0-126-generic nvidia-compute-utils-550
  nvidia-driver-550 nvidia-firmware-550-550.120 nvidia-kernel-common-550 nvidia-kernel-source-550 nvidia-prime nvidia-settings nvidia-utils-550 screen-resolution-extra xserver-xorg-video-nvidia-550
0 upgraded, 23 newly installed, 0 to remove and 5 not upgraded.

Using the flag before the install arguments fixes the issue, installing the nvidia-driver-550-server

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.