[nailgun] It is not possible to rerun deployment after exception in deployment task
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Fix Released
|
High
|
Roman Prykhodchenko |
Bug Description
VERSION:
feature_groups:
- mirantis
production: "docker"
release: "6.1"
api: "1.0"
build_number: "229"
build_id: "2015-03-
nailgun_sha: "64a3d380cefde4
python-
astute_sha: "631f96d5a09cc4
fuellib_sha: "345a98b34dd0cd
ostf_sha: "a4cf5f218c6aea
fuelmain_sha: "320b5f46fc1b27
Steps to reproduce:
1. deploy KVM Fuel master node
2. Start 3 KVM slaves
3. Reboot Supermicro
All KVMs and Supermicro in one admin network
Also KVMs and Supermicro have networks for other
4. Wait while all KVM slaves and Supermicro connected to Fuel Master
5. Create env with parameters:
3 KVMs: controller + mongo
networks on KVMs:
eth0: admin
eth1(All networks on this interface have vlan): public, management, storage
Supermicro: Compute + Ceph
eth0: empty
eth1(All networks on this interface have vlan, for admin network eth1 interface has native vlan): admin, public, management, storage
parameters:
virt_type=kvm
config_
release_name=centos
net_provider=
net_segment_
provision_
debug=true
nova_quota=true
volumes_lvm=false,
volumes_ceph=true,
images_ceph=true,
ephemeral_
objects_ceph=true,
osd_pool_size=1,
sahara=true,
murano=true,
ceilometer=true
6. Start deploy.
The progress of env deployment does not change(awlays equals to 0%). If provision method is cobbler, slave nodes (KVMs and Supermicro) don't reboot.
Changed in fuel: | |
importance: | Undecided → High |
milestone: | none → 6.1 |
Changed in fuel: | |
assignee: | nobody → Fuel Python Team (fuel-python) |
Changed in fuel: | |
assignee: | Fuel Python Team (fuel-python) → Aleksey Kasatkin (alekseyk-ru) |
Changed in fuel: | |
importance: | High → Medium |
summary: |
- Environment deployment does not started + Deployment hangs if there are not enought public IPs |
tags: | added: module-serialization |
Changed in fuel: | |
assignee: | Aleksey Kasatkin (alekseyk-ru) → Fuel Python Team (fuel-python) |
milestone: | 6.1 → 7.0 |
Changed in fuel: | |
importance: | Medium → High |
milestone: | 7.0 → 6.1 |
Changed in fuel: | |
assignee: | Roman Prykhodchenko (romcheg) → Kamil Sambor (ksambor) |
Changed in fuel: | |
status: | Fix Committed → Fix Released |
It was an error during deployment task. Seems smth was wrong with public IPs assigning:
2015-03-26 11:44:46.557 INFO [7f63a46a9740] (manager) Assigning IP for node '1' in network 'management' 1174-45fa- b09a-324706e7d1 35 1174-45fa- b09a-324706e7d1 35 cluster_id: 1 status: error neutron- gre-ha (id=1, mode=ha_compact)) status: from new to error
2015-03-26 11:44:46.557 INFO [7f63a46a9740] (manager) Assigning IP for node '2' in network 'management'
2015-03-26 11:44:46.557 INFO [7f63a46a9740] (manager) Assigning IP for node '3' in network 'management'
2015-03-26 11:44:46.557 INFO [7f63a46a9740] (manager) Assigning IP for node '4' in network 'management'
2015-03-26 11:44:46.584 DEBUG [7f63a46a9740] (task) Updating task: 770d10d1-
2015-03-26 11:44:46.585 DEBUG [7f63a46a9740] (task) Updating cluster status: 770d10d1-
2015-03-26 11:44:46.585 DEBUG [7f63a46a9740] (task) Updating cluster (centos-
Although, it's not clear yet what is the reason of the error and why tasks were not deleted from DB.