{"build_id": "2014-05-12_11-37-35", "mirantis": "yes", "build_number": "194", "ostf_sha": "cdb075090b752246a9c43db3e918c42f645b5873", "nailgun_sha": "4477ba3a6efc4379a6509386e7a9e2e6ae832041", "production": "docker", "api": "1.0", "fuelmain_sha": "97d7f6d5461db3afc27f58160cf9f6985230d255", "astute_sha": "5813d9b537ba6ac95f668321c682f339aac57e05", "release": "5.0", "fuellib_sha": "ff4e0182a94f9b17e5a02bcc65faaf4452a0ad35"}
Steps to reproduce:
1. Create env - Centos, 3 Controllers, Neutron Vlan.
2. Shutdown primary controller
3. Verify networks
Network Verification failed:
2014-05-12 13:17:25 INFO
[398] Casting message to fuel: {"method"=>"verify_networks_resp", "args"=>{"task_uuid"=>"77387ab2-2412-4261-9f3e-ab7b4874f27a", "status"=>"error", "error"=>"Error occurred while running method 'verify_networks'. Inspect Orchestrator logs for the details."}}
2014-05-12 13:17:25 ERR
[398] Error running RPC method verify_networks: 77387ab2-2412-4261-9f3e-ab7b4874f27a: MCollective agents '1' didn't respond within the allotted time.
, trace: ["/usr/lib64/ruby/gems/2.1.0/gems/astute-0.0.2/lib/astute/mclient.rb:114:in `check_results_with_retries'", "/usr/lib64/ruby/gems/2.1.0/gems/astute-0.0.2/lib/astute/mclient.rb:62:in `method_missing'", "/usr/lib64/ruby/gems/2.1.0/gems/astute-0.0.2/lib/astute/network.rb:78:in `block in start_frame_listeners'", "/usr/lib64/ruby/gems/2.1.0/gems/astute-0.0.2/lib/astute/network.rb:71:in `each'", "/usr/lib64/ruby/gems/2.1.0/gems/astute-0.0.2/lib/astute/network.rb:71:in `start_frame_listeners'", "/usr/lib64/ruby/gems/2.1.0/gems/astute-0.0.2/lib/astute/network.rb:39:in `check_network'", "/usr/lib64/ruby/gems/2.1.0/gems/astute-0.0.2/lib/astute/orchestrator.rb:173:in `verify_networks'", "/usr/lib64/ruby/gems/2.1.0/gems/astute-0.0.2/lib/astute/server/dispatcher.rb:114:in `verify_networks'", "/usr/lib64/ruby/gems/2.1.0/gems/astute-0.0.2/lib/astute/server/server.rb:126:in `dispatch_message'", "/usr/lib64/ruby/gems/2.1.0/gems/astute-0.0.2/lib/astute/server/server.rb:89:in `block in dispatch'", "/usr/lib64/ruby/gems/2.1.0/gems/astute-0.0.2/lib/astute/server/task_queue.rb:64:in `call'", "/usr/lib64/ruby/gems/2.1.0/gems/astute-0.0.2/lib/astute/server/task_queue.rb:64:in `block in each'", "/usr/lib64/ruby/gems/2.1.0/gems/astute-0.0.2/lib/astute/server/task_queue.rb:56:in `each'", "/usr/lib64/ruby/gems/2.1.0/gems/astute-0.0.2/lib/astute/server/task_queue.rb:56:in `each'", "/usr/lib64/ruby/gems/2.1.0/gems/astute-0.0.2/lib/astute/server/server.rb:87:in `each_with_index'", "/usr/lib64/ruby/gems/2.1.0/gems/astute-0.0.2/lib/astute/server/server.rb:87:in `dispatch'", "/usr/lib64/ruby/gems/2.1.0/gems/astute-0.0.2/lib/astute/server/server.rb:72:in `block in perform_main_job'"]
2014-05-12 13:17:25 ERR
[398] MCollective agents '1' didn't respond within the allotted time.
2014-05-12 13:15:23 DEBUG
[398] Retry #5 to run mcollective agent on nodes: '1'
2014-05-12 13:13:20 DEBUG
[398] Retry #4 to run mcollective agent on nodes: '1'
2014-05-12 13:11:17 DEBUG
[398] Retry #3 to run mcollective agent on nodes: '1'
2014-05-12 13:09:15 DEBUG
[398] Retry #2 to run mcollective agent on nodes: '1'
2014-05-12 13:07:13 DEBUG
[398] Retry #1 to run mcollective agent on nodes: '1'
It will happen not only after primary controller shutdown, but also if any node goes offline.
I think about adding warning about that some nodes of cluster if offline, but do not send them to network verification task.
Also it would be cool if titles for bugs was a bit more descriptive.