When doing upgrade recoverable checks are leaving the cluster in a unknown state
Bug #1614907 reported by
Sofer Athlan-Guyot
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
Fix Released
|
High
|
Sofer Athlan-Guyot |
Bug Description
Hi,
when doing the upgrade, numerous static checks are done during the major pacemaker upgrade step. They arrive in the script at various moments, like the check on rpm-python package, the disk size left of the bootstrap node and so on.
All those checks, if they fail, leave the cluster in more or less an unknown state. One has to go to the controller check what happen and put the cluster back into shape, fix the detected error and then maybe be able to upgrade again.
This is less than optimal situation.
A better way would be for all those tests to happen at the beginning of the upgrade. Then the operator would only have to fix the detected issue and re-run the upgrade again.
Changed in tripleo: | |
milestone: | none → newton-3 |
importance: | Undecided → High |
Changed in tripleo: | |
milestone: | newton-3 → newton-rc1 |
tags: | removed: update-bugs |
Changed in tripleo: | |
status: | In Progress → Fix Released |
To post a comment you must log in.
After re-reading the code again, they all happen before any serious change. But it would be nice to refactor them to make it obvious.