SM provision failures on the computes due to rabbitmq error
Affects | Status | Importance | Assigned to | Milestone | ||
---|---|---|---|---|---|---|
Juniper Openstack | Status tracked in Trunk | |||||
Trunk |
Fix Committed
|
High
|
Ranjeet R |
Bug Description
mainline-3064 mitaka on an openstack/contrail ha setup.
root@servermana
+------
| id | status | ip_address | mac_address |
+------
| server1 | provision_completed | 10.0.0.4 | 02:A7:E6:35:4B:B3 |
| server4 | provision_completed | 10.0.0.7 | 02:E3:91:A6:9A:16 |
| server5 | provision_completed | 10.0.0.8 | 02:50:D8:3D:B2:B8 |
| server8 | provision_failed | 10.0.0.11 | 02:DE:FF:F8:2C:01 |
| server9 | provision_failed | 10.0.0.12 | 02:D1:98:5C:63:7A |
| server2 | provision_completed | 10.0.0.5 | 02:3A:3E:7D:18:64 |
| server6 | provision_completed | 10.0.0.9 | 02:F3:80:97:DC:DA |
| server7 | provision_completed | 10.0.0.10 | 02:1F:A5:19:FD:DC |
| server3 | provision_completed | 10.0.0.6 | 02:4B:F2:39:B6:71 |
+------
root@servermana
+------
| id | fixed_ip_address | floating_ip_address | port_id |
+------
| 344338c9-
| 854b2880-
| 0cbaf9d9-
| 45444a52-
| 85094b65-
| acc94765-
| 72d25453-
| 8b354010-
| 69853b1e-
| f5d263bf-
+------
Pls see /cs-shared/
rabbitmq clustering seem to have failures. Looks like "hamon" script is trying to restart but causes the services to go down and cluster is not formed:
=INFO REPORT==== 1-May-2017: :17:06: 36 === 10.22:51604 -> 192.168.10.21:5672)
accepting AMQP connection <0.1069.0> (192.168.
=ERROR REPORT==== 1-May-2017: :17:06: 51 === ,#Ref<0. 0.0.811> ,process,
{rabbit, rabbit@ server3ctrl} ,
normal}
{state,
{dict, 2,16,16, 8,80,48,
{[],[] ,[],[], [],[],[ ],[],[] ,[],[], [],[],
[],[] ,[]},
{{[],[ ],[],[] ,[],[], [],[],[ ],[],
[[{rabbit, rabbit@ server2ctrl} |
#Ref< 0.0.0.798> ],
[{rabbit, rabbit@ server3ctrl} |
#Ref< 0.0.0.811> ]],
[],[ ],[],[] ,[]}}},
erlang} ,
[],
{state,
{dict, 0,16,16, 8,80,48,
{[],[] ,[],[], [],[],[ ],[],[] ,[],[], [],[],
[],[] ,[]},
{{[],[ ],[],[] ,[],[], [],[],[ ],[],[] ,[],[],
[],[ ],[]}}} ,
erlang} ,
undefined,
{erlang, #Ref<0. 0.0.48590> },
not_healing,
<<231,60, 193,202, 20,180, 42,137, 93,180, 223,195, 105,
11,7, 107>>,
[{rabbit@ server2ctrl,
<< 58,225, 231,48, 0,209,95, 224,239, 221,27, 58,
246,135, 70,143> >},
{rabbit@ server3ctrl,
<< 176,117, 24,211, 8,44,247, 194,82, 161,181,
115,170, 245,238, 63>>}]}
{{badmatch,
{error,
[ {<<19,139, 74,182, 204,170, 56,54,219, 165,24, 35,209, 214,81,
223> >,
"consoleauth" },
{root,none} ],
[ "server2" ]}},
[{ rabbit_ exchange_ type_topic, follow_ down_get_ path,2, []},
{ rabbit_ exchange_ type_topic, '-remove_ bindings/ 3-lc$^1/ 1-1-',1, []},
{ rabbit_ exchange_ type_topic, remove_ bindings, 3,[]},
{ rabbit_ binding, x_callback, 4,[]},
{ rabbit_ binding, '-process_ deletions/ 1-fun-0- ',2,[]} ,
{ dict,map_ bucket, 2,[{file, "dict.erl" },{line, 460}]},
{ dict,map_ bkt_list, 2,[{file, "dict.erl" },{line, 456}]},
{ dict,map_ bkt_list, 2,[{file, "dict.erl" },{line, 456}]}] }}}
** Generic server rabbit_node_monitor terminating
** Last message in was {'DOWN'
** When Server state == {state,
** Reason for termination ==
** {bad_return_value,
{error,
/<email address hidden>
=CRASH REPORT==== 1-May-2017: :17:04: 23 ===
cr...