Comment 0 for bug 1572968

Revision history for this message
Leontii Istomin (listomin) wrote :

Detailed bug description:
during boot_attach_and_delete_server_with_secgroups (http://paste.openstack.org/show/494970/) rally scenario we got the following error: http://paste.openstack.org/show/494926/
uuid of the instance: http://paste.openstack.org/show/494973/
We have found that nova-conductor wait responce from nova-scheduller during 60 seconds: http://paste.openstack.org/show/494957/
But nova-sheduller received the request only after 40 minutes and processed the request during 5 seconds: http://paste.openstack.org/show/494968/
Steps to reproduce:
1. deploy Fuel 9.0-217
2. apply fixes for succsessfull deployment due the following bugs:
https://bugs.launchpad.net/fuel/+bug/1543233 (regen repo and build butstrap image)
https://bugs.launchpad.net/fuel/+bug/1566974
https://bugs.launchpad.net/fuel/+bug/1570509
https://bugs.launchpad.net/fuel/+bug/1569859
restart nailgun and receiverd
3. boot 201 nodes in bootstrap
4. deploy env with 3 controllers, 20 computes+Ceph, 168 computes
5. replace rabbitmq to zeromq (http://perestroika-repo-tst.infra.mirantis.net/review/CR-19836/mos-repos/ubuntu/9.0/pool/main/p/python-oslo.messaging/)
6. perform rally tests

Expected results:
All vms will be booted successfully during the test
Actual result:
Some instances was failed to boot (43 from 1000)
Reproducibility:
didn't try to repro
Workaround:
not found yet
Impact:
Some instances can';t be started
Description of the environment:
- Operation system: ubuntu
- Versions of components: mos9.0
- Reference architecture: 3 controllers, 20 computes+Ceph, 168 computes
- Network model: vxlan+DVR
- Related projects installed: replaced rabbitmq by zeromq
Additional information:
Diagnostic snapshot will be soon