nailgun-agent hangs and node show false positive offline with fuel node command
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Invalid
|
Medium
|
Fuel Python (Deprecated) |
Bug Description
After issue command "fuel node", one ceph node was stated as "online False", but I can ssh to this node, and ceph health, ceph osd tree are all fine.
what I found is that the nailgun agent hangs:
root@node-
I, [2015-10-
I, [2015-10-
I, [2015-10-
I, [2015-10-
I, [2015-10-
I, [2015-10-
I, [2015-10-
I, [2015-10-
I, [2015-10-
I, [2015-10-
and some XFS, I/O error on this ceph node:
root@node-
[1062182.525661] XFS (nbd11): SB validate failed with error 22.
[1062182.839223] XFS (nbd11): SB validate failed with error 22.
[1062381.143688] XFS (nbd9): SB validate failed with error 22.
[1062381.455537] XFS (nbd9): SB validate failed with error 22.
[2646674.388851] end_request: I/O error, dev nbd1, sector 0
[2646674.528707] end_request: I/O error, dev nbd5, sector 0
[2646674.668831] end_request: I/O error, dev nbd6, sector 0
[2646674.810431] end_request: I/O error, dev nbd9, sector 0
[2646674.952581] end_request: I/O error, dev nbd10, sector 0
[2646675.096139] end_request: I/O error, dev nbd11, sector 0
[2646675.239707] end_request: I/O error, dev nbd12, sector 0
tags: | added: area-python |
Changed in fuel: | |
milestone: | none → 8.0 |
assignee: | nobody → Fuel Python Team (fuel-python) |
importance: | Undecided → Medium |
For more information, mcollective is working fine: 4:/var/ log# tail -f /var/log/ mcollective. log 28T15:31: 24.516765 #8291] DEBUG -- : rabbitmq.rb:66:in `on_hbfire' Publishing heartbeat to stomp:/ /mcollective@ 10.14.20. 2:61613: send_fire, {:curt= >1446046284. 5165255, :last_sleep= >30.49964284896 8506} 28T15:31: 47.750883 #8291] DEBUG -- : rabbitmq.rb:64:in `on_hbfire' Received heartbeat from stomp:/ /mcollective@ 10.14.20. 2:61613: receive_fire, {:curt= >1446046307. 7507} 28T15:31: 55.016810 #8291] DEBUG -- : rabbitmq.rb:66:in `on_hbfire' Publishing heartbeat to stomp:/ /mcollective@ 10.14.20. 2:61613: send_fire, {:curt= >1446046315. 016635, :last_sleep= >30.49959111213 684} 28T15:32: 17.251278 #8291] DEBUG -- : rabbitmq.rb:64:in `on_hbfire' Received heartbeat from stomp:/ /mcollective@ 10.14.20. 2:61613: receive_fire, {:curt= >1446046337. 2510948} 28T15:32: 25.516946 #8291] DEBUG -- : rabbitmq.rb:66:in `on_hbfire' Publishing heartbeat to stomp:/ /mcollective@ 10.14.20. 2:61613: send_fire, {:curt= >1446046345. 5167692, :last_sleep= >30.49960017204 2847} 28T15:32: 46.751646 #8291] DEBUG -- : rabbitmq.rb:64:in `on_hbfire' Received heartbeat from stomp:/ /mcollective@ 10.14.20. 2:61613: receive_fire, {:curt= >1446046366. 7514794} 28T15:32: 56.017065 #8291] DEBUG -- : rabbitmq.rb:66:in `on_hbfire' Publishing heartbeat to stomp:/ /mcollective@ 10.14.20. 2:61613: send_fire, {:curt= >1446046376. 0168877, :last_sleep= >30.49965715408 3252} 28T15:33: 16.252112 #8291] DEBUG -- : rabbitmq.rb:64:in `on_hbfire' Received heartbeat from stomp:/ /mcollective@ 10.14.20. 2:61613: receive_fire, {:curt= >1446046396. 2519305} 28T15:33: 26.517184 #8291] DEBUG -- : rabbitmq.rb:66:in `on_hbfire' Publishing heartbeat to stomp:/ /mcollective@ 10.14.20. 2:61613: send_fire, {:curt= >1446046406. 5170097, :last_sleep= >30.49966502189 6362} 28T15:33: 45.752556 #8291] DEBUG -- : rabbitmq.rb:64:in `on_hbfire' Received heartbeat from stomp:/ /mcollective@ 10.14.20. 2:61613: receive_fire, {:curt= >1446046425. 752374} 28T15:33: 57.017319 #8291] DEBUG -- : rabbitmq.rb:66:in `on_hbfire' Publishing heartbeat to stomp:/ /mcollective@ 10.14.20. 2:61613: send_fire, {:curt= >1446046437. 0171432, :last_sleep= >30.49962067604 065}
root@node-
D, [2015-10-
D, [2015-10-
D, [2015-10-
D, [2015-10-
D, [2015-10-
D, [2015-10-
D, [2015-10-
D, [2015-10-
D, [2015-10-
D, [2015-10-
D, [2015-10-