I tried to test more complicated case, it looks that images storage did not survived interruption of re-balancing.
Steps:
1. Created Environment:
Ubuntu 14.04.1, Neutron VLAN
1 Controller + Ceph OSD
1 Compute + Ceph OSD
Replication Factor = 2
3 nodes left not allocated.
All this were launched on Virtualbox on my desktop.
2. Filled Glance by Images (they took 70% of Ceph storage)
# df -h
...
/dev/sdc3 64G 44G 21G 69% /var/lib/ceph/osd/ceph-0
/dev/sdb3 64G 45G 19G 71% /var/lib/ceph/osd/ceph-1
Also calculated and saved md5 sums of uploaded images.
3. Then i added 3 new OSD nodes.
4. Deployment of additional nodes passed OK.
5. After new OSD nodes were deployed - Ceph rebalancing started. It heavily loaded hardware resources host machine.
6. For test after some time of re-balancing i have switched-off additional OSD nodes.
7. Check Ceph status
# ceph -s
cluster 1f5161b7-a500-4757-a486-0b6d44430a15
health HEALTH_WARN 481 pgs degraded; 25 pgs down; 25 pgs peering; 734 pgs stale; 25 pgs stuck inactive; 734 pgs stuck stale; 506 pgs stuck unclean; recovery 1525/20738 objects degraded (7.354%)
monmap e1: 1 mons at {node-1=192.168.0.3:6789/0}, election epoch 1, quorum 0 node-1
osdmap e131: 10 osds: 4 up, 4 in
pgmap v5033: 2496 pgs, 12 pools, 82212 MB data, 10369 objects
133 GB used, 119 GB / 253 GB avail 1525/20738 objects degraded (7.354%) 253 stale+active+clean 1737 active+clean 25 down+peering 481 stale+active+degraded
8. List images - ok.
Try to download image:
# glance image-download --file ~/dummy${i}.iso dummy${i}.iso
<html><body><h1>504 Gateway Time-out</h1>
The server didn't respond in time.
</body></html>
(HTTP N/A)
Try to list images one more time:
# glance image-list
<html><body><h1>503 Service Unavailable</h1>
No server is available to handle this request.
</body></html>
(HTTP 503)
Horizon also refuses to retrieve images list.
Does it works properly in you environment?
I tried to test more complicated case, it looks that images storage did not survived interruption of re-balancing.
Steps:
1. Created Environment:
Ubuntu 14.04.1, Neutron VLAN
1 Controller + Ceph OSD
1 Compute + Ceph OSD
Replication Factor = 2
3 nodes left not allocated.
All this were launched on Virtualbox on my desktop.
2. Filled Glance by Images (they took 70% of Ceph storage) ceph/osd/ ceph-0 ceph/osd/ ceph-1
# df -h
...
/dev/sdc3 64G 44G 21G 69% /var/lib/
/dev/sdb3 64G 45G 19G 71% /var/lib/
Also calculated and saved md5 sums of uploaded images.
3. Then i added 3 new OSD nodes. a500-4757- a486-0b6d44430a 15 192.168. 0.3:6789/ 0}, election epoch 1, quorum 0 node-1
1525/ 20738 objects degraded (7.354%)
253 stale+active+clean
1737 active+clean
25 down+peering
481 stale+active+ degraded
4. Deployment of additional nodes passed OK.
5. After new OSD nodes were deployed - Ceph rebalancing started. It heavily loaded hardware resources host machine.
6. For test after some time of re-balancing i have switched-off additional OSD nodes.
7. Check Ceph status
# ceph -s
cluster 1f5161b7-
health HEALTH_WARN 481 pgs degraded; 25 pgs down; 25 pgs peering; 734 pgs stale; 25 pgs stuck inactive; 734 pgs stuck stale; 506 pgs stuck unclean; recovery 1525/20738 objects degraded (7.354%)
monmap e1: 1 mons at {node-1=
osdmap e131: 10 osds: 4 up, 4 in
pgmap v5033: 2496 pgs, 12 pools, 82212 MB data, 10369 objects
133 GB used, 119 GB / 253 GB avail
8. List images - ok.
Try to download image:
# glance image-download --file ~/dummy${i}.iso dummy${i}.iso
<html><body><h1>504 Gateway Time-out</h1>
The server didn't respond in time.
</body></html>
(HTTP N/A)
Try to list images one more time:
# glance image-list
<html><body><h1>503 Service Unavailable</h1>
No server is available to handle this request.
</body></html>
(HTTP 503)
Horizon also refuses to retrieve images list.
VERSION: version: "2014.2.2-6.1" 24_10-19- 44" 20cc7819427662f a244e150177" fuelclient_ sha: "e19f1b65792f84 c4a18b5a9473f85 ef3ba172fce" 3376864e8e8dabf e873d40450a" afd5d1540c1cadb d913c0c8c14" 6c840c8333d8f32 ec8cfa65c1f" 000af2de3ccdb4a a8bb766ce93"
feature_groups:
- mirantis
production: "docker"
release: "6.1"
openstack_
api: "1.0"
build_number: "461"
build_id: "2015-05-
nailgun_sha: "76441596e4fe64
python-
astute_sha: "0bd72c72369e74
fuel-library_sha: "889c2534ceadf8
fuel-ostf_sha: "9a5f55602c260d
fuelmain_sha: "5c8ebddf64ea93