M/N upgrade: blockstorage fails to converge.
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
Fix Released
|
Critical
|
Unassigned |
Bug Description
Hi,
During a full upgrade of Mitaka to Newton the convergence step on the blockstorage fails with:
Error: Could not find dependency Exec[wait-
This is on the blockstorage node! It believes that it's the bootstrap
node and tries to create the pacemaker resource.
The reason seems to be linked to the commit
b345dbea16ad3ed
The bootstrap_nodeid, now looks like that:
[stack@instack ~]$ nova list
+------
| ID | Name | Status | Task State | Power State | Networks |
+------
| 750b6855-
| 522c9203-
| afdb238d-
| 365f459d-
| 8d23bc79-
| 5c752ac3-
| 4801bde7-
+------
==> 192.0.2.11
bootstrap_
bootstrap_
==> 192.0.2.7
bootstrap_
bootstrap_
==> 192.0.2.13
bootstrap_
bootstrap_
==> 192.0.2.10
bootstrap_
bootstrap_
==> 192.0.2.12
bootstrap_
bootstrap_
==> 192.0.2.9
bootstrap_
bootstrap_
==> 192.0.2.8
bootstrap_
bootstrap_
So on all roles in puppet-tripleo that check the bootstrap_node,
thinking it will be a pacemaker master fails on everything but the
controller nodes.
In the case of this bug this is the kind of code I'm talking about:
$bootstrap_node = hiera('
if $::hostname == downcase(
$pacemaker_
} else {
$pacemaker_
}
...
if $step >= 5 and $pacemaker_master {
pacemaker:
op_params => 'start timeout=200s stop timeout=200s',
}
}
in manifests/
So the bootstrap_nodeip is associated with the pacemaker_master, which
is not true anymore.
The tripleo code base is filled with this idiom.
Changed in tripleo: | |
importance: | Undecided → Critical |
milestone: | none → newton-rc3 |
This is a duplicate of https:/ /bugs.launchpad .net/tripleo/ +bug/1628912 which has been fixed.