driver installation cost about 20 mins on 64 cores system
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
HWE Next |
Fix Released
|
Undecided
|
Unassigned | ||
nvidia-graphics-drivers-375 (Ubuntu) |
Fix Released
|
High
|
Alberto Milone |
Bug Description
Ubuntu version: 16.04
Kernel: 4.4.0-67-generic
Issue:
With current makefile setting "make -j$(nproc)".
And on a 64 cores system the nvidia driver[1] installation stucks in "Building initial module for 4.4.0-67-generic" for about 20 mins.
Workaround:
repack driver to change setting to "make -j16" , then it just spend about 3 mins to pass "Building initial module"
Investigation:
from iotop, there are 35 processes were using >90% CPU, 23 processes >50% , it might could be a evidence that too heavy IO accessing by -j$(nproc) caused whole system hangs up when nv driver installation.
htop:
http://
[1] https:/
Changed in nvidia-graphics-drivers-375 (Ubuntu): | |
status: | New → In Progress |
importance: | Undecided → High |
assignee: | nobody → Alberto Milone (albertomilone) |
tags: | added: originate-from-1675061 somerville |
Changed in hwe-next: | |
status: | New → Fix Released |
add verbose debug message to check what it was doing after printing "Building initial module for 4.4.0-67-generic",
And attached tarball include the message for -j$(nproc) and -j16 j16-verbose- complete. log : the complete message for buiding in -j16 jnproc- verbose- complete. log : the complete message for buiding in -j$(proc) jnproc- verbose. log : the message copied when saw "Building initial module for 4.4.0-67-generic" jnproc- verbose- 2.log : the message copied when saw "Building initial module for 4.4.0-67-generic" and stucked for a while.
├── make-nvidia-
├── make-nvidia-
├── make-nvidia-
├── make-nvidia-