Default ulimit value for neutron containers (DHCP, L3 and OVS agents) limits a number of SDN entities in production environment

Bug #1824020 reported by Slawek Kaplonski
14
This bug affects 2 people
Affects Status Importance Assigned to Milestone
tripleo
Fix Released
High
Slawek Kaplonski

Bug Description

Bug originally reported by Alex Stupnikov:

Description of problem:

TripleO sets ulimits for different containers since RHOSP 13 [1].
Default value for neutron services and nova-compute is 1024. This
default doesn't work well for nova-compute with huge number of attached
disks, so I have reported bug [2].

Today customer informed me that neutron services are ALSO AFFECTED:
he faced issues with DHCP agents after creating 250 tenant networks
(quite low number) and was able to solve them after increasing default
ulimit value by defining DockerNeutronDHCPAgentUlimit.

I have asked neutron developers to provide recommendations for defaults for the following parameters and was told to report the bug:

DockerNeutronDHCPAgentUlimit
DockerNeutronL3AgentUlimit
DockerOpenvswitchUlimit

[1] https://review.openstack.org/#/c/560991/
[2] https://bugzilla.redhat.com/show_bug.cgi?id=1693667

Changed in tripleo:
assignee: nobody → Slawek Kaplonski (slaweq)
status: New → In Progress
Revision history for this message
Alex Schultz (alex-schultz) wrote :

So for the record we lowered it to 1024 because rootwrap didn't like the larger default value. See Bug 1760471 So we to figure out a happy medium between 131072 and 1024

Changed in tripleo:
importance: Undecided → Medium
milestone: none → train-1
Revision history for this message
Slawek Kaplonski (slaweq) wrote :

Hi Alex,

Yes, it seems that we lowered it too much when fixing rootwrap issue.

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to tripleo-heat-templates (master)

Reviewed: https://review.openstack.org/651137
Committed: https://git.openstack.org/cgit/openstack/tripleo-heat-templates/commit/?id=5d3fe630db336b54284a05720f5ef0d3fc8bd729
Submitter: Zuul
Branch: master

commit 5d3fe630db336b54284a05720f5ef0d3fc8bd729
Author: Slawek Kaplonski <email address hidden>
Date: Tue Apr 9 10:26:27 2019 +0200

    Increase default ulimit values for Neutron agents containers

    For containers where Neutron DHCP, L3 and OVS agents are running
    there is limit for number of opened files set.
    This limit was configured by default to 1024 which looks that is
    quite low and can cause problems even with relatively small number
    of networks or routers handled by agent.

    This patch increase default ulimit values for those containers to be
    16x higher than it was before. Such value should allow to handle much
    more routers or networks and shouldn't be too high for any system.

    Change-Id: Ibc8fa0c1ccd4f60ec7c876623a7262003197f67d
    Closes-bug: #1824020

Changed in tripleo:
status: In Progress → Fix Released
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-heat-templates (stable/rocky)

Fix proposed to branch: stable/rocky
Review: https://review.openstack.org/651709

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-heat-templates (stable/queens)

Fix proposed to branch: stable/queens
Review: https://review.openstack.org/651710

Changed in tripleo:
importance: Medium → High
tags: added: queens-backport-potential
tags: added: edge
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to tripleo-heat-templates (stable/rocky)

Reviewed: https://review.openstack.org/651709
Committed: https://git.openstack.org/cgit/openstack/tripleo-heat-templates/commit/?id=178773f92b0baf28f9cec93e5e7527c510a10ac7
Submitter: Zuul
Branch: stable/rocky

commit 178773f92b0baf28f9cec93e5e7527c510a10ac7
Author: Slawek Kaplonski <email address hidden>
Date: Tue Apr 9 10:26:27 2019 +0200

    Increase default ulimit values for Neutron agents containers

    For containers where Neutron DHCP, L3 and OVS agents are running
    there is limit for number of opened files set.
    This limit was configured by default to 1024 which looks that is
    quite low and can cause problems even with relatively small number
    of networks or routers handled by agent.

    This patch increase default ulimit values for those containers to be
    16x higher than it was before. Such value should allow to handle much
    more routers or networks and shouldn't be too high for any system.

    Conflicts:
        deployment/neutron/neutron-ovs-agent-container-puppet.yaml

    Change-Id: Ibc8fa0c1ccd4f60ec7c876623a7262003197f67d
    Closes-bug: #1824020
    (cherry picked from commit 5d3fe630db336b54284a05720f5ef0d3fc8bd729)

tags: added: in-stable-rocky
tags: added: in-stable-queens
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to tripleo-heat-templates (stable/queens)

Reviewed: https://review.openstack.org/651710
Committed: https://git.openstack.org/cgit/openstack/tripleo-heat-templates/commit/?id=bc30fb036947473c53515ff96e800d8e4e3468da
Submitter: Zuul
Branch: stable/queens

commit bc30fb036947473c53515ff96e800d8e4e3468da
Author: Slawek Kaplonski <email address hidden>
Date: Tue Apr 9 10:26:27 2019 +0200

    Increase default ulimit values for Neutron agents containers

    For containers where Neutron DHCP, L3 and OVS agents are running
    there is limit for number of opened files set.
    This limit was configured by default to 1024 which looks that is
    quite low and can cause problems even with relatively small number
    of networks or routers handled by agent.

    This patch increase default ulimit values for those containers to be
    16x higher than it was before. Such value should allow to handle much
    more routers or networks and shouldn't be too high for any system.

    Conflicts:
        deployment/neutron/neutron-ovs-agent-container-puppet.yaml

    Change-Id: Ibc8fa0c1ccd4f60ec7c876623a7262003197f67d
    Closes-bug: #1824020
    (cherry picked from commit 5d3fe630db336b54284a05720f5ef0d3fc8bd729)

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/tripleo-heat-templates 10.5.0

This issue was fixed in the openstack/tripleo-heat-templates 10.5.0 release.

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/tripleo-heat-templates 9.4.0

This issue was fixed in the openstack/tripleo-heat-templates 9.4.0 release.

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/tripleo-heat-templates 8.4.0

This issue was fixed in the openstack/tripleo-heat-templates 8.4.0 release.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.