Randomly segfault on BYT with 4.4 kernel

Bug #1586941 reported by Ying-Chun Liu
12
This bug affects 1 person
Affects Status Importance Assigned to Milestone
OEM Priority Project
Won't Fix
Medium
Unassigned
upstart
New
Undecided
Unassigned
upstart (Ubuntu)
New
Undecided
Unassigned

Bug Description

We have a randomly crash bug when using 14.04 (trusty) upstart (1.12.1-0ubuntu4.2).

We've tried the workarounds of LP:1447756 but it doesn't work.
Also the test case doesn't crash. So we think this is a new bug.

The bug is also happened on very early stages. And inside the installer, it crashes more often then the system. We see it core dumped in the installer, but not about to get the core file because the squashfs only keeps file in the memory.

Also when run on system, the filesystem seems to be read-only then so there's also no logs.

We are using 4.4 kernel (linux-image-4.4.0-22-generic 4.4.0-22.40~14.04.1)

I'll paste some images and logs after.

Tags: trusty
Revision history for this message
Ying-Chun Liu (paulliu) wrote :
Revision history for this message
Ying-Chun Liu (paulliu) wrote :

I tried to remove exit(signum) in crash_handler. And I got a core dump (might not be the first time crash).

Here are the backtrace:
#0 0x00007fa5cef53f90 in sigprocmask () from /lib/x86_64-linux-gnu/libc.so.6
No symbol table info available.
#1 0x0000563682722e92 in ?? ()
No symbol table info available.
#2 <signal handler called>
No symbol table info available.
#3 0x00007fa5cfd6dc9c in nih_list_destroy ()
   from /lib/x86_64-linux-gnu/libnih.so.1
No symbol table info available.
#4 0x00007fa5cfd6c3a9 in nih_discard () from /lib/x86_64-linux-gnu/libnih.so.1
No symbol table info available.
#5 0x0000563682731895 in ?? ()
No symbol table info available.
#6 0x0000563682730ccc in ?? ()
No symbol table info available.
#7 0x0000563682731d0a in ?? ()
No symbol table info available.
#8 0x00005636827369bd in ?? ()
No symbol table info available.
#9 0x00007fa5cfd742ba in nih_main_loop ()
   from /lib/x86_64-linux-gnu/libnih.so.1
No symbol table info available.
#10 0x00005636827225f9 in ?? ()
#11 0x00007fa5cef3eec5 in __libc_start_main ()
   from /lib/x86_64-linux-gnu/libc.so.6
No symbol table info available.
#12 0x0000563682722712 in ?? ()

Revision history for this message
Ying-Chun Liu (paulliu) wrote :

The bug is happened randomly. In the installer, it is about 90% rate. On system, it is lesser. So the kernel panic is also random in place.

Jamie Chang (jamie315)
Changed in oem-priority:
importance: Undecided → High
Ara Pulido (ara)
Changed in oem-priority:
status: New → Incomplete
Revision history for this message
Ying-Chun Liu (paulliu) wrote :

We've tried upstart 1.13.2 from xenial. I download 1.13.2-0ubuntu16 from Xenial. And re-built it in Trusty chroot (also build all of the reverse dependencies). However the package doesn't provide /sbin/init. Thus we cannot use it.

Ara Pulido (ara)
Changed in oem-priority:
status: Incomplete → Confirmed
tags: added: trusty
Changed in oem-priority:
importance: High → Medium
Revision history for this message
Yuan-Chen Cheng (ycheng-twn) wrote :

too old.

Changed in oem-priority:
status: Confirmed → Won't Fix
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.