openvswitch-agent eating 100% cpu

Bug #1755605 reported by Ben Nemec
10
This bug affects 2 people
Affects Status Importance Assigned to Milestone
tripleo
Fix Released
High
Unassigned

Bug Description

I'm seeing this in my recent undercloud installs. For some reason the Neutron openvswitch-agent process is pegged at 100% cpu as soon as the undercloud is installed (so no overcloud deployed yet). I'll attach the log. I didn't notice anything obviously wrong, but maybe someone else will see an issue.

Revision history for this message
Ben Nemec (bnemec) wrote :
Revision history for this message
Ben Nemec (bnemec) wrote :

I see this in the ci logs as well, so it is almost certainly affecting our job runtimes and resource usage:

system_u:system_r:neutron_t:s0 neutron 4115 ***98.0*** 1.2 313944 99980 ? Rs 20:02 74:08 /usr/bin/python2 /usr/bin/neutron-openvswitch-agent ...

tags: added: ci
Revision history for this message
Bogdan Dobrelya (bogdando) wrote :

I can observe the same behavior in a containerized UC

Revision history for this message
Sergii Golovatiuk (sgolovatiuk) wrote :

Pike doesn't have such problems. I have just checked on my env.

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Related fix proposed to paunch (master)

Related fix proposed to branch: master
Review: https://review.openstack.org/554539

Revision history for this message
Ben Nemec (bnemec) wrote : Re: Undercloud openvswitch-agent eating 100% cpu

I also did not see this on Pike.

However, I am seeing the same thing on the overcloud. It appears to be a problem with openvswitch-agent itself or our configuration of it, not something specific to the undercloud.

summary: - Undercloud openvswitch-agent eating 100% cpu
+ openvswitch-agent eating 100% cpu
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Related fix proposed to tripleo-heat-templates (master)

Related fix proposed to branch: master
Review: https://review.openstack.org/554863

Revision history for this message
OpenStack Infra (hudson-openstack) wrote :

Related fix proposed to branch: master
Review: https://review.openstack.org/554869

Revision history for this message
Bogdan Dobrelya (bogdando) wrote :

At very least, I proposed related CI w/a to save some HW resources for the bug fix time being

Revision history for this message
Harald Jensås (harald-jensas) wrote :
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Related fix merged to paunch (master)

Reviewed: https://review.openstack.org/554539
Committed: https://git.openstack.org/cgit/openstack/paunch/commit/?id=4a4f43ac36c3edc2645c8fff5cf783415ea3f1cf
Submitter: Zuul
Branch: master

commit 4a4f43ac36c3edc2645c8fff5cf783415ea3f1cf
Author: Bogdan Dobrelya <email address hidden>
Date: Tue Mar 20 14:22:36 2018 +0100

    Allow to limit cgroup cpu shares

    Related-bug: #1755605

    Change-Id: I1f24866ea2ec01e3ef22571145efa25ab219bb3a
    Signed-off-by: Bogdan Dobrelya <email address hidden>

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Change abandoned on tripleo-heat-templates (master)

Change abandoned by Bogdan Dobrelya (<email address hidden>) on branch: master
Review: https://review.openstack.org/554863

Revision history for this message
OpenStack Infra (hudson-openstack) wrote :

Change abandoned by Bogdan Dobrelya (<email address hidden>) on branch: master
Review: https://review.openstack.org/554869
Reason: Ping it if we might want this in the bug scope

Changed in tripleo:
milestone: rocky-1 → rocky-2
Changed in tripleo:
milestone: rocky-2 → rocky-3
Changed in tripleo:
milestone: rocky-3 → rocky-rc1
Changed in tripleo:
milestone: rocky-rc1 → stein-1
Changed in tripleo:
milestone: stein-1 → stein-2
Revision history for this message
Ben Nemec (bnemec) wrote :

I don't recall how or when this was fixed, but I don't see this behavior anymore.

Changed in tripleo:
status: Triaged → Fix Released
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Related fix proposed to paunch (stable/queens)

Related fix proposed to branch: stable/queens
Review: https://review.opendev.org/701064

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Related fix merged to paunch (stable/queens)

Reviewed: https://review.opendev.org/701064
Committed: https://git.openstack.org/cgit/openstack/paunch/commit/?id=2ddf40d833f905353b9ca43a34e40f0118f0f898
Submitter: Zuul
Branch: stable/queens

commit 2ddf40d833f905353b9ca43a34e40f0118f0f898
Author: Bogdan Dobrelya <email address hidden>
Date: Tue Mar 20 14:22:36 2018 +0100

    Allow to limit cgroup cpu shares

    Related-bug: #1755605

    Change-Id: I1f24866ea2ec01e3ef22571145efa25ab219bb3a
    Signed-off-by: Bogdan Dobrelya <email address hidden>
    (cherry picked from commit 4a4f43ac36c3edc2645c8fff5cf783415ea3f1cf)

tags: added: in-stable-queens
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.