Fuel for OpenStack

[8.0] Pacemaker may to destroy on one of controllers after restart corosync

Bug #1576749 reported by Vadim Rovachev on 2016-04-29

This bug affects 1 person

	Status	Importance	Assigned to	Milestone
Fuel for OpenStack	Invalid	Medium	Oleksiy Molchanov	Fuel for OpenStack 10.0
8.0.x	Won't Fix	Medium	Fuel Sustaining	Fuel for OpenStack 8.0-updates
Mitaka	Won't Fix	Medium	Fuel Sustaining	Fuel for OpenStack 9.0

Bug Description

Detailed bug description:
On swarm test:
https://github.com/openstack/fuel-qa/blob/stable/8.0/fuelweb_test/tests/tests_strength/test_failover_base.py#L1210-L1238
we kill and run corosync on controllers 500 times in a row. But sometime this test hase failed.
Fail moment in test logs:
http://paste.openstack.org/show/495777/
fail moment on node-1(node with fail)
http://paste.openstack.org/show/495778/

failed job:
https://patching-ci.infra.mirantis.net/job/8.0.system_test.ubuntu.ha_neutron_destructive/8/

All snapshot attached.

Reproducibility:
sometime
https://mirantis.testrail.com/index.php?/cases/results/271655

See original description

Tags:

Revision history for this message

Vadim Rovachev (vrovachev) wrote on 2016-04-29:

fail.tar.xz Edit (55.6 MiB, application/octet-stream)

description:	updated
Changed in fuel:
milestone:	none → 8.0-updates
importance:	Undecided → Medium

Oleksiy Molchanov (omolchanov) on 2016-05-04

Changed in fuel:
assignee:	nobody → Fuel Sustaining (fuel-sustaining-team)
tags:	added: area-library

Matthew Mosesohn (raytrac3r) on 2016-05-18

tags:

added: ha

Revision history for this message

Oleksiy Molchanov (omolchanov) wrote on 2016-05-23:

So, in logs we can see that actually node-1 have returned to cluster, it is also visible on node-2 (the node where from we run 'pcs status nodes' in tests). pcs_status command logs in diagnostic snapshot show that node-1 was in cluster. But test logs indicate that for 20 seconds node-1 was offline, so test failed.

I was trying to reproduce it, but didn't manage. So I am marking this as Incomplete, until we have environment to revert.

Revision history for this message

Dmitry Pyzhov (dpyzhov) wrote on 2016-06-27:

Bug is incomplete status for a month. Changing status to invalid. Please reopen the issue if you have more data.

no longer affects:

fuel/newton