Pacemaker 'vip__public' can be stopped for a while when deploy finishes.
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Fix Released
|
High
|
Vladimir Kuklin | ||
6.0.x |
Invalid
|
High
|
Unassigned |
Bug Description
Reproduced on different CI jobs: [1], [2]
When deploy finished (cluster marked as 'ready'), system tests are failed on first access to the cluster because slave nodes are inaccessible:
---------
Authorization Failed: Unable to establish connection to http://
---------
Also, Nailgun marks slaves as 'offline':
http://
But since several seconds after revert, nodes are 'online' and they weren't rebooted:
http://
Hosts where the tests were running were not under load.
Need to investigate this issue.
[1] smoke test 'deploy_
[2] system test 'deploy_
Changed in fuel: | |
importance: | Undecided → High |
Changed in fuel: | |
assignee: | MOS QA Team (mos-qa) → Dennis Dmitriev (ddmitriev) |
status: | New → In Progress |
tags: | added: non-release system-tests |
summary: |
- Pacemaker 'vip__public' can be stopped for a while when deploy finishes. + [system-tests] Pacemaker 'vip__public' can be stopped for a while when + deploy finishes. |
Changed in fuel: | |
assignee: | Dennis Dmitriev (ddmitriev) → Fuel Library Team (fuel-library) |
tags: | removed: non-release |
tags: | removed: fuel-ci system-tests |
summary: |
- [system-tests] Pacemaker 'vip__public' can be stopped for a while when - deploy finishes. + Pacemaker 'vip__public' can be stopped for a while when deploy finishes. |
Changed in fuel: | |
assignee: | Vladimir Kuklin (vkuklin) → Bogdan Dobrelya (bogdando) |
Changed in fuel: | |
assignee: | Bogdan Dobrelya (bogdando) → Vladimir Kuklin (vkuklin) |
tags: | added: on-verification |
Looks like issue was caused because pacemaker restart 'vip__public' :
=== /node-1. test.domain. local/pengine. log : 17T12:14: 30.186762+ 00:00 notice: notice: LogActions: Stop vip__public (node-1. test.domain. local)
2015-05-
=== CI 6.1/job/ 6.1.centos. smoke_nova/ 357/console : client. py:765 -- Get ID of a last created cluster 10.109. 6.2:5000/ v2.0/tokens
2015-05-17 12:15:59,477 - INFO fuel_web_
...
Authorization Failed: Unable to establish connection to http://
=== /node-1. test.domain. local/pengine. log : 17T12:16: 16.363780+ 00:00 notice: notice: LogActions: Start vip__public (node-1. test.domain. local)
2015-05-
We should cover this in system tests.