Pacemaker turned neutron-ovs-agent into unmanaged state because proc_kill sends wrong signals to pkill
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Fix Released
|
Critical
|
Bogdan Dobrelya | ||
6.1.x |
Won't Fix
|
Critical
|
MOS Maintenance | ||
7.0.x |
Won't Fix
|
Critical
|
MOS Maintenance |
Bug Description
As result of some failover circumstances neutron-ovs-agent became unmanaged.
According to agent logs:
2015-12-23T14:06:41 -- ovs agent started
2015-12-23T14:07:19 -- Agent initialized successfully, now running...
2015-12-23T14:08:55 -- Error while processing VIF ports due to MessagingTimeout: Timed out waiting for a reply to message
2015-12-23T14:09:01 -- the last message
In pacemaker logs:
Dec 23 14:08:51 [11810] node-4.domain.tld pacemaker_remoted: warning: child_timeout_
Dec 23 14:08:51 [11810] node-4.domain.tld pacemaker_remoted: warning: operation_finished: p_neutron-
Dec 23 14:08:51 [11813] node-4.domain.tld crmd: warning: update_failcount: Updating failcount for p_neutron-
Dec 23 14:08:51 [11812] node-4.domain.tld pengine: info: native_print: p_neutron-
It appears that the agent was started slowly and pacemaker decided to stop it forever.
Changed in fuel: | |
status: | New → Confirmed |
tags: | added: on-verification |
VERSION: version: "2015.1.0-7.0" 5b37608c787944d 1983f543aa8" fuelclient_ sha: "486bde57cda1ba db68f915f66c61b 544108606f3" e9085ff71d2950c fbcca91af67" nailgun- agent_sha: "d7027952870a35 db8dc52f185bb11 58cdd3d1ebd" 781c809db915992 7655ced5012" 0dc53b43825dc4c 8f7780be9dd" c3a0abd6af9f31e 5b4d150a11c" 284a2e4761be7a1 56bb5627677"
feature_groups:
- mirantis
production: "docker"
release: "7.0"
openstack_
api: "1.0"
build_number: "301"
build_id: "301"
nailgun_sha: "4162b0c15adb42
python-
fuel-agent_sha: "50e90af6e3d560
fuel-
astute_sha: "6c5b73f93e24cc
fuel-library_sha: "5d50055aeca1dd
fuel-ostf_sha: "2cd967dccd66cf
fuelmain_sha: "a65d453215edb0
DEPLOYMENT:
Neutron VXLAN + KVM