N->O Upgrade, ochestration is broken.
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
Fix Released
|
High
|
Sofer Athlan-Guyot |
Bug Description
Hi,
deploying a custom role Novacontrol, with the nova related API failed with this error:
TASK [Setup cell_v2 (map cell0)] *******
fatal: [localhost]: FAILED! => {"changed": true, "cmd": ["nova-manage", "cell_v2", "map_cell0"], "delta": "0:02:12.569490", "end": "2017-02-28 15:41:23.802908", "failed": true, "rc": 1, "start": "2017-02-28 15:39:11.233418", "stderr": "", "stdout": "An error has occurred:
"DBConnectionError: (pymysql.
to retry, use: --limit @/var/lib/
We discovered that the Novacontrol node was at step5, while the Controller was at step3. From the logs:
I prefix Novacontrol with N and controller logs with C:
- C: step0: Apr 03 09:08:55
- N: step0: Apr 03 09:07:36
- N: step1: Apr 03 09:08:27
- N: step2: Apr 03 09:08:52
- C: step1: Apr 03 09:13:56
- N: step3: Apr 03 09:14:24
- N: step4: Apr 03 09:14:40
- C: step2: Apr 03 09:15:01
- N: step5: Apr 03 09:17:28
- C: step3: Apr 03 09:20:48
- C: step4: never happened
- C: step5: never happened
So, it seems that contrary to what we claim there https:/
It may be that it "happened" to work in our test because we just using Controller/
1. compute are upgraded with their own mechanism;
2. Ceph are batched upgrade;
3. controller are step guaranty as they belong to the same role.
Changed in tripleo: | |
assignee: | nobody → Sofer Athlan-Guyot (sofer-athlan-guyot) |
status: | Confirmed → In Progress |
Changed in tripleo: | |
assignee: | Sofer Athlan-Guyot (sofer-athlan-guyot) → Marios Andreou (marios-b) |
Changed in tripleo: | |
importance: | Critical → High |
Changed in tripleo: | |
assignee: | Marios Andreou (marios-b) → Giulio Fidente (gfidente) |
Changed in tripleo: | |
assignee: | Giulio Fidente (gfidente) → Marios Andreou (marios-b) |
assignee: | Marios Andreou (marios-b) → Sofer Athlan-Guyot (sofer-athlan-guyot) |
Originally reported there https:/ /bugzilla. redhat. com/show_ bug.cgi? id=1427569