Pacemaker service is unable to be started in time when zabbix plugins enabled
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel Plugins |
Invalid
|
Undecided
|
Unassigned | ||
Fuel for OpenStack |
Fix Released
|
Critical
|
Ksenia Svechnikova |
Bug Description
build_number: "286"
Scenario:
TUN HW environment 3x(controller, mongo)+
1. Create cluster with neutron + TUN
2. Configure emc and zabbix plugins (emc_vnx, zabbix_
3. Configure networks with bonds and create VLAN tagged interfaces
4. Start deploy
Deploy fails, controllers are in error state
id | status | name | cluster | ip | mac | roles | pending_roles | online | group_id
---|---
4 | provisioned | Untitled (45:54) | 2 | 192.168.5.113 | ec:f4:bb:cd:45:54 | cinder, compute | | True | 2
1 | ready | Untitled (42:94) | 2 | 192.168.5.110 | ec:f4:bb:cd:42:94 | controller, mongo | | True | 2
5 | provisioned | Untitled (45:4c) | 2 | 192.168.5.114 | ec:f4:bb:cd:45:4c | cinder, compute | | True | 2
3 | error | Untitled (43:00) | 2 | 192.168.5.111 | ec:f4:bb:cd:43:00 | controller, mongo | | True | 2
2 | error | Untitled (41:20) | 2 | 192.168.5.112 | ec:f4:bb:cd:41:20 | controller, mongo | | True | 2
Deploy on controller fails with puppet errors according pacemaker service:
pacemaker.log:
Sep 08 15:53:46 [104918] node-3.domain.tld pacemakerd: error: pcmk_child_exit: Child process cib (104920) exited: Network is down (100)
Sep 08 15:53:46 [104918] node-3.domain.tld pacemakerd: warning: pcmk_child_exit: Pacemaker child process cib no longer wishes to be respa
wned. Shutting ourselves down.
Sep 08 15:53:46 [104918] node-3.domain.tld pacemakerd: error: pcmk_child_exit: Child process attrd (104923) exited: Network is down (10
0)
Sep 08 15:53:46 [104918] node-3.domain.tld pacemakerd: warning: pcmk_child_exit: Pacemaker child process attrd no longer wishes to be res
pawned. Shutting ourselves down.
...
2015-09-08 15:53:51 +0000 /Stage[
2015-09-08 15:53:51 +0000 /Stage[
2015-09-08 15:53:51 +0000 /Stage[
2015-09-08 15:53:51 +0000 /Stage[
2015-09-08 15:53:51 +0000 /Stage[
2015-09-08 15:53:51 +0000 Pcmk_nodes[
2015-09-08 15:53:51 +0000 Puppet (debug): Waiting 600 seconds for Pacemaker to become online
summary: |
- Pacemaker service is unable to started in time + Pacemaker service is unable to be started in time |
Changed in fuel: | |
status: | Incomplete → In Progress |
tags: | added: on-verification |
Changed in fuel-plugins: | |
status: | New → Invalid |
Regarding message "network down" - actually, with manual checks - ping on mgmt network between nodes works.