Unable to scale cluster beyond 49 nodes
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Magnum |
Expired
|
Undecided
|
Unassigned |
Bug Description
I've got a cluster with enough resources to theoretically deploy 100 nodes. Every time I try to scale past 49, however, my cluster goes into UPDATE_FAILED status. The error in magnum-
Limits modified in OpenStack:
Heat maximum resources set to -1
Open file limit set to 99999 on all nodes/containers
Nova volume quota increased to 200
Nova instance quota increased to 150
Nova RAM quota increased to 5TB
Nova cores quota increased to 500
Neutron floating ip quota increased to 150
Neutron port quota increased to 200
>200 IP addresses available in networks used
Cluster deploys successfully and all update operations are successful until attempting to scale from 48 to 49 nodes. Strangely, even though the cluster goes into UPDATE_FAILED status, the 49th VM does get created, and it does get assigned an IP address from the network pool.
The error appears to be a database error, but I'm not sure what to make of the error message. I've attached the outputs of the openstack stack resource commands that pinpoint the failure for review.
The heat engine log might be more helpful. Do you also have enough floating ips?