Cannot Scale to more than 116 VMS and subnets
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
Fix Released
|
High
|
Sai Sindhur Malleni |
Bug Description
Description:
During scale testing, we are unable to create more than 116 subnet and VMs due to some kernel params limitations on the dnsmasq processes.
We are using an OpenStack setup with 1 controller and 11 compute nodes. We are executing the following usecase
1. Create a network
2. Create a subnet
3. Boot an instance on this subnet
We do the above sequence of operations 500 times at a concurrency of 8.
Even after several attempts we are unable to scale past 116 VMs (each VM is on its own subnet). 116 seems to be the hard limit. The port never transitionas into active as even though the VIF Plugging happens, it fails the provioning block (DHCP), Since Ml2/ODL makes use of the neutron DHCP agent for DHCP, on looking in the DHCP agent logs we see
2017-09-13 21:45:38.124 91663 ERROR neutron.
dnsmasq: failed to create inotify: Too many open files
2017-09-13 21:45:38.124 91663 ERROR neutron.
2017-09-13 21:45:38.124 91663 ERROR neutron.
2017-09-13 21:45:38.124 91663 ERROR neutron.
2017-09-13 21:45:38.124 91663 ERROR neutron.
2017-09-13 21:45:38.124 91663 ERROR neutron.
2017-09-13 21:45:38.124 91663 ERROR neutron.
2017-09-13 21:45:38.124 91663 ERROR neutron.
2017-09-13 21:45:38.124 91663 ERROR neutron.
2017-09-13 21:45:38.124 91663 ERROR neutron.
2017-09-13 21:45:38.124 91663 ERROR neutron.
2017-09-13 21:45:38.124 91663 ERROR neutron.
2017-09-13 21:45:38.124 91663 ERROR neutron.
2017-09-13 21:45:38.124 91663 ERROR neutron.
2017-09-13 21:45:38.124 91663 ERROR neutron.
2017-09-13 21:45:38.124 91663 ERROR neutron.
2017-09-13 21:45:38.124 91663 ERROR neutron.
2017-09-13 21:45:38.124 91663 ERROR neutron.
2017-09-13 21:45:38.124 91663 ERROR neutron.
2017-09-13 21:45:38.124 91663 ERROR neutron.
2017-09-13 21:45:38.260 91663 ERROR neutron.
dnsmasq: failed to create inotify: Too many open files
Based on https:/
Environment:
Pike
RHEL 7.4
Additional info:
Additional details can be found in:
https:/
https:/
Changed in tripleo: | |
milestone: | none → queens-1 |
importance: | Undecided → High |
status: | New → Triaged |
tags: | added: networking pike-backport-potential |
Fix proposed to branch: master /review. openstack. org/505381
Review: https:/