Vanilla cluster can't decommission node if replication factor < 3
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Sahara |
Invalid
|
Undecided
|
Unassigned |
Bug Description
Steps to reproduce:
1. Start vanilla cluster with replication factor 2 and with next structure:
NN+JT x 1 node
SNN x 1 node
TT+DN x 2 nodes
TT x 1 node
DN x 1 node
2. Start scaling with next parameters:
decommission TT x 1 node
decommission DN x 1 node
add TT x 1 node
add DN x 1 node
Cluster receives state "Error".
log in master node:
2014-02-20 11:57:44,558 INFO org.apache.
2014-02-20 11:57:44,558 INFO org.apache.
Changed in sahara: | |
status: | New → Incomplete |
I am unable to reproduce this error. I am using a trunk version of Sahara (commit c1d6d02ab7b6c8e 50bbe637d9f6883 2f19dbcdc6) with a stable/icehouse version of devstack. My image is ubuntu vanilla hdp1 created using sahara- image-elements.
Step 1, no problems. Step 2, I am able to remove and then add all the nodes listed and the cluster returns to "Active" status.