Compute node got kernel panic after instance creation

Bug #1418687 reported by Artem Panchenko
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Fuel for OpenStack
Won't Fix
Low
Pavel Boldin

Bug Description

Fuel version info (6.1 #98): http://paste.openstack.org/show/167554/

This issue was produced during system tests on CI:

http://jenkins-product.srt.mirantis.net:8080/view/6.1_swarm/job/6.1.system_test.centos.cluster_actions/25/testReport/junit/%28root%29/deploy_stop_reset_on_ha/deploy_stop_reset_on_ha/

OSTF test 'Launch instance, create snapshot, launch instance from snapshot (failure)' failed by timeout, because compute node on which the instance was scheduled to boot got kernel panic (see screenshot) and became inaccessible.

Steps to reproduce:

1. Create new env: CentOS, HA, NovaFlatDHCP, Cinder for volumes
2. Add 3 controllers and 2 computes
3. Deploy changes
4. Run OSTF

Expected result:

- all health checks passed

Actual:

- 'Launch instance, create snapshot, launch instance from snapshot (failure)' test failed by timeout

Here is the part of OSTF logs:

http://paste.openstack.org/show/167564/
http://paste.openstack.org/show/167569/

As you can see from timestamps compute node gone offline when test was running (2015-02-05 05:18). After revert from snapshot crashed node (node-2) was inaccessible via network, VNC console screeshot is attached. When I reset that node it booted correctly and I was able to create instance on it without problems, so I believe that the bug is floating.

Diagnostic snapshot and logs from node-2 are attached.

Revision history for this message
Artem Panchenko (apanchenko-8) wrote :
Revision history for this message
Artem Panchenko (apanchenko-8) wrote :
Revision history for this message
Artem Panchenko (apanchenko-8) wrote :
Changed in fuel:
assignee: nobody → MOS Linux (mos-linux)
Revision history for this message
Aleksander Mogylchenko (amogylchenko) wrote :

Low priority, because:
- panic occurred in nested virtualization;
- panic occurred only once (according to Artem);

I was able to find only one somewhat similar bug:
https://bugzilla.redhat.com/show_bug.cgi?id=1076294

Changed in fuel:
importance: Undecided → Low
Changed in fuel:
status: New → Confirmed
Revision history for this message
Michael Semenov (msemenov) wrote :

Postponing to 7.0 due to low priority.

Changed in fuel:
milestone: 6.1 → 7.0
Changed in fuel:
assignee: MOS Linux (mos-linux) → Pavel Boldin (pboldin)
Revision history for this message
Pavel Boldin (pboldin) wrote :

Impossible to fix without debug information.

Please, next time this happens make a dump of the VM memory using QEMU `dump'.

Changed in fuel:
status: Confirmed → Won't Fix
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.