Im having this problem with the deployment of Kolla-Ansible Openstack
TASK [nova : Waiting for nova-compute service up] *****************************************************************
FAILED - RETRYING: Waiting for nova-compute service up (20 retries left).
...
FAILED - RETRYING: Waiting for nova-compute service up (1 retries left).
fatal: [172.30.220.3 -> 172.30.220.3]: FAILED! => {"attempts": 20, "changed": false, "cmd": ["docker", "exec", "kolla_toolbox", "openstack", "--os-interface", "internal", "--os-auth-url", "http://172.30.230.3:35357", "--os-identity-api-version", "3", "--os-project-domain-name", "default", "--os-tenant-name", "admin", "--os-username", "admin", "--os-password", "3PxtKnvjKDTbPg2QT3llwig08efLoAgkdEY5VVoY", "--os-user-domain-name", "default", "compute", "service", "list", "-f", "json", "--service", "nova-compute"], "delta": "0:00:02.405043", "end": "2017-10-04 16:21:35.742909", "failed": true, "rc": 0, "start": "2017-10-04 16:21:33.337866", "stderr": "", "stderr_lines": [], "stdout": "[]", "stdout_lines": ["[]"]}
In nova-api.log it seems that nova-api cant get a response from the nova-compute service because i execute the same command for nova-conductor and it works fine.
---
docker exec kolla_toolbox openstack --os-interface internal --os-auth-url http://172.30.230.3:35357 --os-identity-api-version 3 --os-project-domain-name default --os-tenant-name admin --os-username admin --os-password 3PxtKnvjKDTbPg2QT3llwig08efLoAgkdEY5VVoY --os-user-domain-name default compute service list -f json --service nova-compute
[]
docker exec kolla_toolbox openstack --os-interface internal --os-auth-url http://172.30.230.3:35357 --os-identity-api-version 3 --os-project-domain-name default --os-tenant-name admin --os-username admin --os-password 3PxtKnvjKDTbPg2QT3llwig08efLoAgkdEY5VVoY --os-user-domain-name default compute service list -f json --service nova-conductor
[
{
"Status": "enabled",
"Binary": "nova-conductor",
"Zone": "internal",
"State": "down",
"Host": "server1.domain.local",
"Updated At": null,
"ID": 10
},
{
"Status": "enabled",
"Binary": "nova-conductor",
"Zone": "internal",
"State": "down",
"Host": "server2.domain.local",
"Updated At": null,
"ID": 12
}
]
---
In the nova-api.log
---
## nova-conductor ##
2017-10-04 17:09:34.938 39 DEBUG nova.osapi_compute.wsgi.server [req-5a3572f7-6ce6-42e2-8715-6c10b9ede9db - - - - -] (39) accepted ('172.30.230.3', 44344) server /usr/lib/python2.7/site-packages/eventlet/wsgi.py:883
2017-10-04 17:09:35.160 39 DEBUG nova.api.openstack.wsgi [req-9605b348-0260-4c49-9826-9ff726b75138 e3c98600a43841b88aad3ff06808a36f b75d457f41d848858717c7ab443820ee - default default] Calling method '<bound method ServiceController.index of <nova.api.openstack.compute.services.ServiceController object at 0x7145650>>' _process_stack /usr/lib/python2.7/site-packages/nova/api/openstack/wsgi.py:612
2017-10-04 17:09:35.165 39 DEBUG oslo_concurrency.lockutils [req-9605b348-0260-4c49-9826-9ff726b75138 e3c98600a43841b88aad3ff06808a36f b75d457f41d848858717c7ab443820ee - default default] Lock "00000000-0000-0000-0000-000000000000" acquired by "nova.context.get_or_set_cached_cell_and_set_connections" :: waited 0.000s inner /usr/lib/python2.7/site-packages/oslo_concurrency/lockutils.py:270
2017-10-04 17:09:35.165 39 DEBUG oslo_concurrency.lockutils [req-9605b348-0260-4c49-9826-9ff726b75138 e3c98600a43841b88aad3ff06808a36f b75d457f41d848858717c7ab443820ee - default default] Lock "00000000-0000-0000-0000-000000000000" released by "nova.context.get_or_set_cached_cell_and_set_connections" :: held 0.001s inner /usr/lib/python2.7/site-packages/oslo_concurrency/lockutils.py:282
2017-10-04 17:09:35.193 39 DEBUG oslo_concurrency.lockutils [req-9605b348-0260-4c49-9826-9ff726b75138 e3c98600a43841b88aad3ff06808a36f b75d457f41d848858717c7ab443820ee - default default] Lock "02be5acb-a4d9-4b89-8245-8a1fc689012c" acquired by "nova.context.get_or_set_cached_cell_and_set_connections" :: waited 0.000s inner /usr/lib/python2.7/site-packages/oslo_concurrency/lockutils.py:270
2017-10-04 17:09:35.194 39 DEBUG oslo_concurrency.lockutils [req-9605b348-0260-4c49-9826-9ff726b75138 e3c98600a43841b88aad3ff06808a36f b75d457f41d848858717c7ab443820ee - default default] Lock "02be5acb-a4d9-4b89-8245-8a1fc689012c" released by "nova.context.get_or_set_cached_cell_and_set_connections" :: held 0.001s inner /usr/lib/python2.7/site-packages/oslo_concurrency/lockutils.py:282
2017-10-04 17:09:35.221 39 DEBUG nova.servicegroup.drivers.db [req-9605b348-0260-4c49-9826-9ff726b75138 e3c98600a43841b88aad3ff06808a36f b75d457f41d848858717c7ab443820ee - default default] Seems service nova-conductor on host server1.domain.local is down. Last heartbeat was 2017-10-04 19:16:42. Elapsed time is 3173.221375 is_up /usr/lib/python2.7/site-packages/nova/servicegroup/drivers/db.py:79
2017-10-04 17:09:35.222 39 DEBUG nova.servicegroup.drivers.db [req-9605b348-0260-4c49-9826-9ff726b75138 e3c98600a43841b88aad3ff06808a36f b75d457f41d848858717c7ab443820ee - default default] Seems service nova-conductor on host server2.domain.local is down. Last heartbeat was 2017-10-04 19:16:42. Elapsed time is 3173.222094 is_up /usr/lib/python2.7/site-packages/nova/servicegroup/drivers/db.py:79
2017-10-04 17:09:35.223 39 INFO nova.osapi_compute.wsgi.server [req-9605b348-0260-4c49-9826-9ff726b75138 e3c98600a43841b88aad3ff06808a36f b75d457f41d848858717c7ab443820ee - default default] 172.30.230.3 "GET /v2.1/b75d457f41d848858717c7ab443820ee/os-services?binary=nova-conductor HTTP/1.1" status: 200 len: 766 time: 0.2835221
## nova-compute ##
2017-10-04 17:11:05.550 39 DEBUG nova.osapi_compute.wsgi.server [req-5a3572f7-6ce6-42e2-8715-6c10b9ede9db - - - - -] (39) accepted ('172.30.230.3', 44446) server /usr/lib/python2.7/site-packages/eventlet/wsgi.py:883
2017-10-04 17:11:05.625 39 DEBUG nova.api.openstack.wsgi [req-fcc097d5-a1ac-4a55-9bfb-3d5b6f42d0c5 e3c98600a43841b88aad3ff06808a36f b75d457f41d848858717c7ab443820ee - default default] Calling method '<bound method ServiceController.index of <nova.api.openstack.compute.services.ServiceController object at 0x7145650>>' _process_stack /usr/lib/python2.7/site-packages/nova/api/openstack/wsgi.py:612
2017-10-04 17:11:05.630 39 DEBUG oslo_concurrency.lockutils [req-fcc097d5-a1ac-4a55-9bfb-3d5b6f42d0c5 e3c98600a43841b88aad3ff06808a36f b75d457f41d848858717c7ab443820ee - default default] Lock "00000000-0000-0000-0000-000000000000" acquired by "nova.context.get_or_set_cached_cell_and_set_connections" :: waited 0.000s inner /usr/lib/python2.7/site-packages/oslo_concurrency/lockutils.py:270
2017-10-04 17:11:05.631 39 DEBUG oslo_concurrency.lockutils [req-fcc097d5-a1ac-4a55-9bfb-3d5b6f42d0c5 e3c98600a43841b88aad3ff06808a36f b75d457f41d848858717c7ab443820ee - default default] Lock "00000000-0000-0000-0000-000000000000" released by "nova.context.get_or_set_cached_cell_and_set_connections" :: held 0.001s inner /usr/lib/python2.7/site-packages/oslo_concurrency/lockutils.py:282
2017-10-04 17:11:05.656 39 DEBUG oslo_concurrency.lockutils [req-fcc097d5-a1ac-4a55-9bfb-3d5b6f42d0c5 e3c98600a43841b88aad3ff06808a36f b75d457f41d848858717c7ab443820ee - default default] Lock "02be5acb-a4d9-4b89-8245-8a1fc689012c" acquired by "nova.context.get_or_set_cached_cell_and_set_connections" :: waited 0.000s inner /usr/lib/python2.7/site-packages/oslo_concurrency/lockutils.py:270
2017-10-04 17:11:05.656 39 DEBUG oslo_concurrency.lockutils [req-fcc097d5-a1ac-4a55-9bfb-3d5b6f42d0c5 e3c98600a43841b88aad3ff06808a36f b75d457f41d848858717c7ab443820ee - default default] Lock "02be5acb-a4d9-4b89-8245-8a1fc689012c" released by "nova.context.get_or_set_cached_cell_and_set_connections" :: held 0.001s inner /usr/lib/python2.7/site-packages/oslo_concurrency/lockutils.py:282
2017-10-04 17:11:05.682 39 INFO nova.osapi_compute.wsgi.server [req-fcc097d5-a1ac-4a55-9bfb-3d5b6f42d0c5 e3c98600a43841b88aad3ff06808a36f b75d457f41d848858717c7ab443820ee - default default] 172.30.230.3 "GET /v2.1/b75d457f41d848858717c7ab443820ee/os-services?binary=nova-compute HTTP/1.1" status: 200 len: 414 time: 0.1309791
It looks that im not getting heartbeats from nova-compute service
This uses to be an issue with nova-compute connecting to rabbitmq or cells not created properly, is normal that conductor appears in service-list and compute not if is not synced into main cell.
please share logs from nova-compute. Also question, what release? 5.0.0 from pip or master | stable/pike from github? Also what distros/version using?