[bvt]After deployment nailgun says that some controllers are offline, that caused ostf failures
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Fix Released
|
High
|
Aleksandr Didenko |
Bug Description
{"build_id": "2015-03-
Sometimes OSTF "Check RabbitMQ is available" failed with error:
Number of controllers is not equal to number of cluster nodes - reasonable to add timeout on:
if len(self.
in fuel-ostf/
Paste of error:
http://
Sometimes after deployment nailgun says that some controllers are ofline, as result we fail on different tests (sometimes with error like can not set proxy with exceptions There is no online controllers) sometimes like here -----|- ------- ------- ------| ------- --|---- ------- --|---- ------- ------- -|----- ------- |------ ------- --|---- ----|-- ------- [{disc, ['rabbit@ node-2' ,'rabbit@ node-4' ,'rabbit@ node-5' ]}]}, nodes,[ 'rabbit@ node-4' ,'rabbit@ node-5' ,'rabbit@ node-2' ]}, name,<< "<email address hidden>">>}, jenkins- product. srt.mirantis. net:8080/ job/6.1. centos. bvt_1/176/ console
id | status | name | cluster | ip | mac | roles | pending_roles | online | group_id
---|---
3 | ready | slave-04_compute | 1 | 10.109.20.6 | 64:db:c0:0d:71:34 | compute | | True | 1
1 | ready | slave-05_compute | 1 | 10.109.20.7 | 64:be:a4:d3:47:73 | compute | | True | 1
4 | ready | slave-01_controller | 1 | 10.109.20.3 | 64:33:fe:e0:61:aa | controller | | False | 1
5 | ready | slave-02_controller | 1 | 10.109.20.4 | 64:2b:8e:a4:2a:66 | controller | | True | 1
2 | ready | slave-03_controller | 1 | 10.109.20.5 | 64:29:63:f4:91:13 | controller | | True | 1
[root@nailgun ostf]# ssh node-1
Warning: Permanently added 'node-1' (RSA) to the list of known hosts.
Last login: Wed Mar 11 13:03:57 2015 from 10.109.20.2
[root@node-1 ~]# rabbitmqctl cluster_status
-bash: rabbitmqctl: command not found
[root@node-1 ~]# exit
logout
Connection to node-1 closed.
[root@nailgun ostf]# ssh node-2
Warning: Permanently added 'node-2' (RSA) to the list of known hosts.
Last login: Wed Mar 11 13:03:29 2015 from 10.109.20.2
[root@node-2 ~]# rabbitmqctl cluster_status
Cluster status of node 'rabbit@node-2' ...
[{nodes,
{running_
{cluster_
{partitions,[]}]
...done.
[root@node-2 ~]# exit
logout
Connection to node-2 closed.
[root@nailgun ostf]# ssh node-5
Warning: Permanently added 'node-5' (RSA) to the list of known hosts.
Last login: Wed Mar 11 13:03:49 2015 from 10.109.20.2
[root@node-5 ~]# exit
http://
There is no revert just deployment and run ostf, so it is not clear why nailgun move some controllers offline