Mcollective can be restarted after the deployment was started
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Fix Released
|
High
|
Artem Roma |
Bug Description
Hi,
I have analyzed the log files in the job #148. Since the environment is down, some of log files are lost.
https:/
Astute starts deployment tasks _before_ mcollective is configured on the slave nodes.
1. Deployment is started
2. The node is set up and rebooted
3. Astute starts ntpdate update task (https:/
4. Fuel agent is started by cron, FA gets node id from the master and restarts mcollective (https:/
5. Deployment fails.
Mcollective is reconfigured after 10 minutes in average: http://
Best regarsd,
Kyrylo
Changed in fuel: | |
status: | New → Confirmed |
importance: | Undecided → Critical |
assignee: | nobody → Fuel Core Team (fuel-core) |
milestone: | none → 8.0 |
tags: | added: area-python |
Changed in fuel: | |
assignee: | Fuel Core Team (fuel-core) → Fuel Python Team (fuel-python) |
importance: | Critical → High |
tags: | added: team-bugfix |
tags: |
added: area-library removed: area-python |
tags: |
added: area-python removed: area-library |
Changed in fuel: | |
assignee: | Fuel Python Team (fuel-python) → Artem Roma (aroma-x) |
Changed in fuel: | |
status: | Confirmed → In Progress |
Changed in fuel: | |
status: | In Progress → Fix Committed |
I believe that we are missing fuel nailgun agent run on every server boot (we should start it right away and not to wait for cron to start our script).
We whether can use @reboot clause in cron script (if it's supported and does what's needed), or have script run by /etc/rc.d. We should prevent two copies of script running at the same time though.