Race condition in ZK node locking
Bug #1556063 reported by
Proskurin Kirill
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
kolla-mesos |
Fix Released
|
High
|
Proskurin Kirill |
Bug Description
Right now we check if ZK node exist first and if it's not - we lock node and do stuff.
So, in multinode deployment, several nodes do this and all of them try to get lock. So we end with situation, then 1 node got lock and run bootstrap for example and other waits for lock, after this first node done with bootstrap, it releases the lock and second node get it and start to do the same thing, since it didnt re-check if node flag is already ".done". Which lead to multiple runs of "run_once" commands.
Changed in kolla-mesos: | |
assignee: | nobody → Proskurin Kirill (kproskurin) |
Changed in kolla-mesos: | |
importance: | Undecided → High |
To post a comment you must log in.
Fix proposed to branch: master /review. openstack. org/291693
Review: https:/