build tensorflow from source cause machine reboot

Bug #1957861 reported by james guo
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
kernel-package (Ubuntu)
Undecided
Unassigned

Bug Description

$ lsb_release -rd
Description: Ubuntu 20.04.3 LTS
Release: 20.04

motherboard h11ssl bios 2.0A(for Eng Sample CPU)

cpu epyc 32 core 64 thread
$ cat /proc/cpuinfo
...
processor : 63
vendor_id : AuthenticAMD
cpu family : 23
model : 48
model name : AMD Eng Sample: 2S1705E3VIVG5_20/17_N
stepping : 0
microcode : 0x8300027
cpu MHz : 900.000
cache size : 512 KB
physical id : 0
siblings : 64
core id : 31
cpu cores : 32
apicid : 63
initial apicid : 63
fpu : yes
fpu_exception : yes
cpuid level : 16
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs skinit wdt tce topoext perfctr_core perfctr_nb bpext perfctr_llc mwaitx cpb cat_l3 cdp_l3 hw_pstate sme ssbd mba sev ibpb vmmcall sev_es fsgsbase bmi1 avx2 smep bmi2 cqm rdt_a rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 xsaves cqm_llc cqm_occup_llc cqm_mbm_total cqm_mbm_local clzero irperf xsaveerptr rdpru wbnoinvd arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold avic v_vmsave_vmload vgif umip rdpid overflow_recov succor smca
bugs : sysret_ss_attrs spectre_v1 spectre_v2 spec_store_bypass
bogomips : 3399.98
TLB size : 3072 4K pages
clflush size : 64
cache_alignment : 64
address sizes : 43 bits physical, 48 bits virtual
power management: ts ttp tm hwpstate cpb eff_freq_ro [13] [14]

install cuda-repo-ubuntu2004-11-4-local_11.4.3-470.82.01-1_amd64.deb
install nv-tensorrt-repo-ubuntu2004-cuda11.4-trt8.2.2.1-ga-20211214_1-1_amd64.deb
build tensorflow 2.8 from source
machine reboot unexpected

Revision history for this message
Ubuntu Foundations Team Bug Bot (crichton) wrote :

Thank you for taking the time to report this bug and helping to make Ubuntu better. It seems that your bug report is not filed about a specific source package though, rather it is just filed against Ubuntu in general. It is important that bug reports be filed about source packages so that people interested in the package can find the bugs about it. You can find some hints about determining what package your bug might be about at https://wiki.ubuntu.com/Bugs/FindRightPackage. You might also ask for help in the #ubuntu-bugs irc channel on Freenode.

To change the source package that this bug is filed about visit https://bugs.launchpad.net/ubuntu/+bug/1957861/+editstatus and add the package name in the text box next to the word Package.

[This is an automated message. I apologize if it reached you inappropriately; please just reply to this message indicating so.]

tags: added: bot-comment
Revision history for this message
james guo (jinp65) wrote :

From the symptom, fell like the kernel does not handle this cpu very well

james guo (jinp65)
affects: ubuntu → kernel-package (Ubuntu)
Revision history for this message
james guo (jinp65) wrote :

upgrade kernel 5.11.0-46 to 5.13.0-25 fix the problem

To post a comment you must log in.
This report contains Public information  Edit
Everyone can see this information.

Other bug subscribers