deadlock can occur in between clustercheck vs ceph-mon setup in pacemaker scenario
Bug #1598907 reported by
Giulio Fidente
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
Fix Released
|
Medium
|
Giulio Fidente | ||
Mitaka |
Fix Released
|
Undecided
|
Unassigned |
Bug Description
During step2, in pacemaker scenario, all nodes will try to perform the ceph-mon setup which blocks puppet until all initial members are available when forming the ceph cluster.
On the bootstrap node this can happen *before* galera setup is initialized but if in the same deployment, on non-bootstrap nodes clustercheck is launched first, the deployment will stop in a deadlock where:
1 ceph-mon on bootstrap node is waiting for the non-bootstrap nodes
2 clustercheck on non-bootstrap nodes is waiting galera to come up on the bootstrap nodes
The clustercheck resource should really only be used on bootstrap nodes.
Changed in tripleo: | |
status: | Triaged → In Progress |
Changed in tripleo: | |
milestone: | none → newton-2 |
To post a comment you must log in.
I have a WIP patch at https:/ /review. openstack. org/#/c/ 337302/ 1
More infos on the BZ can also be found in https:/ /bugzilla. redhat. com/show_ bug.cgi? id=1349456