Comment 68 for bug 1569925

Revision history for this message
Rafael David Tinoco (rafaeldtinoco) wrote :

Okay, besides all crap I posted - which are definitely right and will possibly be addressed in other bugs, and also show other problems that could make systemd to hang because of iscsi disks - I was able to cause the same symptom of this bug:

[ OK ] Stopped LSB: ebtables ruleset management.
[ OK ] Stopped Login to default iSCSI targets.
         Stopping iSCSI initiator daemon (iscsid)...
[ OK ] Stopped iSCSI initiator daemon (iscsid).
[ OK ] Stopped target Network is Online.
[ OK ] Stopped target Network.
         Stopping Raise network interfaces...
         Stopping ifup for bond0.20...
[ OK ] Stopped ifup for bond0.20.
         Stopping ifup for bond0.10...
[ OK ] Stopped ifup for bond0.10.
         Stopping ifup for bond0...
[ OK ] Stopped ifup for bond0.
         Stopping ifup for internal...
[ OK ] Stopped ifup for internal.
[ OK ] Stopped Raise network interfaces.
[ OK ] Stopped target Network (Pre).
[ OK ] Stopped Apply Kernel Variables.
[ OK ] Stopped Load Kernel Modules.
[ OK ] Stopped target Local File Systems.
         Unmounting /run/cgmanager/fs...
         Unmounting /run/user/1000...
         Unmounting /xfs...
         Unmounting /ext4...
[ OK ] Unmounted /run/cgmanager/fs.
[ OK ] Unmounted /run/user/1000.

[ 522.972114] connection3:0: ping timeout of 5 secs expired, recv timeout 5, last rx 4295020535, last ping 4295021788, now 4295023040
[ 522.974085] connection4:0: ping timeout of 5 secs expired, recv timeout 5, last rx 4295020535, last ping 4295021788, now 4295023040
[ 522.975905] connection2:0: ping timeout of 5 secs expired, recv timeout 5, last rx 4295020535, last ping 4295021788, now 4295023040
[ 522.976087] connection1:0: ping timeout of 5 secs expired, recv timeout 5, last rx 4295020535, last ping 4295021788, now 4295023040

[ 720.156456] INFO: task umount:11860 blocked for more than 120 seconds.
[ 720.157477] Not tainted 4.4.0-93-generic #116-Ubuntu
[ 720.158282] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 720.159695] INFO: task umount:11861 blocked for more than 120 seconds.
[ 720.160680] Not tainted 4.4.0-93-generic #116-Ubuntu
[ 720.161587] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 733.180181] sd 2:0:0:1: timing out command, waited 180s
[ 733.181191] blk_update_request: I/O error, dev sdc, sector 0
[ 733.182164] sd 4:0:0:1: timing out command, waited 180s
[ 733.183084] blk_update_request: I/O error, dev sdb, sector 0

[ OK ] Unmounted /ext4.
[ OK ] Unmounted /xfs.
[ OK ] Reached target Unmount All Filesystems.
[ OK ] Stopped target Local File Systems (Pre).
[ OK ] Stopped Create Static Device Nodes in /dev.
[ OK ] Stopped Remount Root and Kernel File Systems.
[ OK ] Reached target Shutdown.
[ 805.018886] systemd-shutdown[1]: Failed to finalize DM devices, ignoring

<system hangs forever>

And that is likely because I added 2 paths to each of my nics - using multipath, likely unrelated - and configured bonding + vlan within "ifupdown" - likely what causes this. After that change, I was able to make systemd to freeze in the same (likely) output as of this bug (and the one I'm taking care of). I'll investigate further tomorrow, but this can be related to ifupdown and its locking mechanism for configuring/de-configuring bonds/vlans/slaves interfaces.

Matt,

If not too much trouble to you, could you share the /etc/network/interfaces you were using ? You presented yourself and I was so focused in working this case that I forgot to mention: I'm one of the Sustaining Engineering Tech Leads @ Canonical. Usually those type of "partnership" product enablements/fixes are done 4 hands in other channels, that is why I felt lost in the beginning about your role here, I'm sorry if that bothered you.

Now that I was to reproduce, I won't bother much, just want to compare your "interfaces" file with the ones I have (facing similar problem).

Tks

-Rafael