systemd-udevd eats cpu in infinite loop when nvidia card is not plugged

Bug #1774768 reported by Gaétan QUENTIN
18
This bug affects 4 people
Affects Status Importance Assigned to Milestone
systemd (Ubuntu)
Invalid
Undecided
Unassigned

Bug Description

i have a roaming ssd disk ubuntu 18.04 with proprietaty nvidia drivers installed from ubuntu thid party repository:

ii nvidia-cuda-dev 9.1.85-3ubuntu1 amd64 NVIDIA CUDA development files
ii nvidia-cuda-doc 9.1.85-3ubuntu1 all NVIDIA CUDA and OpenCL documentation
ii nvidia-cuda-gdb 9.1.85-3ubuntu1 amd64 NVIDIA CUDA Debugger (GDB)
ii nvidia-cuda-toolkit 9.1.85-3ubuntu1 amd64 NVIDIA CUDA development toolkit
ii nvidia-dkms-396 396.24-0ubuntu0~gpu18.04.1 amd64 NVIDIA DKMS package
ii nvidia-kernel-common-396 396.24-0ubuntu0~gpu18.04.1 amd64 Shared files used with the kernel module
ii nvidia-kernel-source-396 396.24-0ubuntu0~gpu18.04.1 amd64 NVIDIA kernel source package
ii nvidia-profiler 9.1.85-3ubuntu1 amd64 NVIDIA Profiler for CUDA and OpenCL
ii nvidia-settings 396.24-0ubuntu0~gpu18.04.1 amd64 Tool for configuring the NVIDIA graphics driver
ii nvidia-visual-profiler 9.1.85-3ubuntu1 amd64 NVIDIA Visual Profiler for CUDA and OpenCL
r

When i boot on my nvidia system , no problem.

when i boot on other system , with other card, systemd-udevd eats lots cpu (nearby 100% or little more) and in logs i see infinite loop logging:

Jun 2 11:23:16 localhost [19963]: failed to execute '/usr/bin/nvidia-smi' '/usr/bin/nvidia-smi': No such file or directory
Jun 2 11:23:16 localhost systemd-udevd[492]: Process '/usr/bin/nvidia-smi' failed with exit code 2.
Jun 2 11:23:16 localhost systemd[1]: nvidia-persistenced.service: Start request repeated too quickly.
Jun 2 11:23:16 localhost systemd[1]: nvidia-persistenced.service: Failed with result 'exit-code'.
Jun 2 11:23:16 localhost systemd[1]: Failed to start NVIDIA Persistence Daemon.
Jun 2 11:23:16 localhost kernel: [ 2227.682538] nvidia-nvlink: Nvlink Core is being initialized, major device number 241
Jun 2 11:23:16 localhost kernel: [ 2227.682917] NVRM: No NVIDIA graphics adapter found!
Jun 2 11:23:16 localhost kernel: [ 2227.683149] nvidia-nvlink: Unregistered the Nvlink Core, major device number 241
Jun 2 11:23:16 localhost systemd-udevd[492]: Process '/sbin/modprobe nvidia-modeset' failed with exit code 1.
Jun 2 11:23:16 localhost kernel: [ 2227.756646] nvidia-nvlink: Nvlink Core is being initialized, major device number 241
Jun 2 11:23:16 localhost kernel: [ 2227.756971] NVRM: No NVIDIA graphics adapter found!
Jun 2 11:23:16 localhost kernel: [ 2227.757149] nvidia-nvlink: Unregistered the Nvlink Core, major device number 241
Jun 2 11:23:16 localhost systemd-udevd[492]: Process '/sbin/modprobe nvidia-drm' failed with exit code 1.
Jun 2 11:23:16 localhost kernel: [ 2227.824072] nvidia-nvlink: Nvlink Core is being initialized, major device number 241
Jun 2 11:23:16 localhost kernel: [ 2227.824383] NVRM: No NVIDIA graphics adapter found!
Jun 2 11:23:16 localhost kernel: [ 2227.824596] nvidia-nvlink: Unregistered the Nvlink Core, major device number 241
Jun 2 11:23:16 localhost systemd-udevd[492]: Process '/sbin/modprobe nvidia-uvm' failed with exit code 1.
Jun 2 11:23:16 localhost [19972]: failed to execute '/usr/bin/nvidia-smi' '/usr/bin/nvidia-smi': No such file or directory
Jun 2 11:23:16 localhost systemd-udevd[492]: Process '/usr/bin/nvidia-smi' failed with exit code 2.
Jun 2 11:23:16 localhost systemd[1]: nvidia-persistenced.service: Start request repeated too quickly.
Jun 2 11:23:16 localhost systemd[1]: nvidia-persistenced.service: Failed with result 'exit-code'.
Jun 2 11:23:16 localhost systemd[1]: Failed to start NVIDIA Persistence Daemon.

the nvidia-uvm module is listed in /etc/modules.

# /etc/modules: kernel modules to load at boot time.
#
# This file contains the names of kernel modules that should be loaded
# at boot time, one per line. Lines beginning with "#" are ignored.
nvidia-uvm

it should just fail at boot time and do nothing else.

when stopping systemd-udevd and starting it again, the problem disappear.

i hadn't this problem with ubuntu 16.X/17.X.
 it may be a nvidia driver/software pb too, so i will do a bug report on nvidia driver too.

Regards

ProblemType: Bug
DistroRelease: Ubuntu 18.04
Package: udev 237-3ubuntu10
ProcVersionSignature: Ubuntu 4.15.0-22.24-generic 4.15.17
Uname: Linux 4.15.0-22-generic x86_64
ApportVersion: 2.20.9-0ubuntu7.1
Architecture: amd64
CurrentDesktop: ubuntu:GNOME
CustomUdevRuleFiles: 60-vlan-network-interface.rules 70-snap.core.rules
Date: Sat Jun 2 11:16:41 2018
EcryptfsInUse: Yes
InstallationDate: Installed on 2017-04-25 (402 days ago)
InstallationMedia: Ubuntu 17.04 "Zesty Zapus" - Release amd64 (20170412)
MachineType: GIGABYTE MRHM3AP
ProcEnviron:
 TERM=xterm-256color
 PATH=(custom, no user)
 XDG_RUNTIME_DIR=<set>
 LANG=fr_FR.UTF-8
 SHELL=/bin/bash
ProcKernelCmdLine: BOOT_IMAGE=/vmlinuz-4.15.0-22-generic root=UUID=9ed36b5d-9fea-4544-b33c-a7c75c2cdfa5 ro rootflags=subvol=@ disablemodules=nouveau pci=noaer elevator=noop net.ifnames=0 ipv6.disable=1 quiet splash vt.handoff=1
SourcePackage: systemd
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 01/29/2014
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: F7
dmi.board.asset.tag: To be filled by O.E.M.
dmi.board.name: MRHM3AP
dmi.board.vendor: GIGABYTE
dmi.board.version: To be filled by O.E.M.
dmi.chassis.asset.tag: To Be Filled By O.E.M.
dmi.chassis.type: 3
dmi.chassis.vendor: To Be Filled By O.E.M.
dmi.chassis.version: To Be Filled By O.E.M.
dmi.modalias: dmi:bvnAmericanMegatrendsInc.:bvrF7:bd01/29/2014:svnGIGABYTE:pnMRHM3AP:pvrTobefilledbyO.E.M.:rvnGIGABYTE:rnMRHM3AP:rvrTobefilledbyO.E.M.:cvnToBeFilledByO.E.M.:ct3:cvrToBeFilledByO.E.M.:
dmi.product.family: To be filled by O.E.M.
dmi.product.name: MRHM3AP
dmi.product.version: To be filled by O.E.M.
dmi.sys.vendor: GIGABYTE

Revision history for this message
Gaétan QUENTIN (gaetan-quentin) wrote :
description: updated
description: updated
description: updated
description: updated
description: updated
description: updated
Revision history for this message
Launchpad Janitor (janitor) wrote :

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in systemd (Ubuntu):
status: New → Confirmed
Revision history for this message
Dan Streetman (ddstreet) wrote :

please reopen if this is still an issue

Changed in systemd (Ubuntu):
status: Confirmed → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.