Deployment failure due to ERROR: Could not determine galera name from pacemaker node <galera-bundle-0>.
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
Invalid
|
Critical
|
Unassigned |
Bug Description
We see overcloud deployment failures in the gate because the galera-ready task fails. The error from galera/pacemaker is:
Oct 7 02:44:53 localhost journal: #033[1;31mError: /usr/bin/
Oct 7 02:44:53 localhost journal: #033[1;31mError: /Stage[
Oct 7 02:44:53 localhost journal: #033[0;32mInfo: Class[Tripleo:
Oct 7 02:44:53 localhost journal: #033[0;32mInfo: Creating state file /var/lib/
Oct 7 02:44:53 localhost journal: #033[1;31mError: Failed to apply catalog: Execution of '/usr/bin/mysql --defaults-
Oct 7 02:44:53 localhost galera(
Oct 7 02:44:53 localhost pacemaker_
Ok so on http:// logs.openstack. org/12/ 510212/ 3/check/ gate-tripleo- ci-centos- 7-scenario004- multinode- oooq-container/ 54ca592/ logs/subnode- 2/var/log/ cluster/ corosync. log.txt. gz I see this: 7-2-node- inap-mtl01- 11266984- 941703 cib: info: cib_perform_op: ++ <nvpair id="galera- meta_attributes -container- attribute- target" name="container -attribute- target" value="host"/>
Oct 07 02:14:31 [28250] centos-
This should only happen only in one situation: 5f18914c887dc4f a4bad4d620 (which was reverted because we did not have the proper process for updating the mariadb/ rabbitmq/ redis containers with the latest pacemaker/ resource- agents rpms) resource- agents combo
A) puppet-tripleo does have the patch 6bcb011723ad7b7
B) the containers have an old pacemaker/
We know about B) which hopefully will be either solved with a master promotion or with manual fixing of the containers. A) is quite surprising since it was reverted on: attribute- target= host attribute"" <Jenkins>
091f92d6f0e8 - (2017-10-07 03:36:52 +0000) Merge "Revert "Set meta container-
So the reason for A) is we have an old puppet-tripleo in this job (i.e. it predates the revert)? tripleo- 8.0.0-0. 20171006214736. 5e54b7e. el7.centos. noarch
puppet-
The way out of this is to either fix B) or make sure we have a newer puppet-tripleo which contains the revert.
I will recap B) in an email tomorrow (am totally knackered atm)