stx-openstack: neutron netns cleanup job is deadlocking the cpu
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
Medium
|
Thiago Paiva Brito |
Bug Description
Brief Description
-----------------
The neutron-
Severity
--------
<Major: System/Feature is usable but degraded>
Steps to Reproduce
------------------
Just apply stx-openstack and look at the neutron-
Expected Behavior
------------------
Job should run and sleep for a whole day.
Actual Behavior
----------------
Job is looping out and consuming more than 50% of cpu
Reproducibility
---------------
100% reproducible
System Configuration
-------
All configurations
Branch/Pull Time/Commit
-------
N/A
Last Pass
---------
N/A
Timestamp/Logs
--------------
N/A
Test Activity
-------------
Developer Testing
Workaround
----------
system helm-override-
system application-apply stx-openstack
Changed in starlingx: | |
importance: | Undecided → Medium |
tags: | added: stx.7.0 stx.distro.openstack |
The bug on the script itself that will prevent this from happening in case of any future misconfiguration is dealt with on https:/ /review. opendev. org/c/openstack /openstack- helm/+/ 833160
This bug only covers openstack- armada- app. The aim is to remove the override that is causing the error. We override conf.neutron. agent.root_ helper with just "sudo", but calling "ip netns list" with sudo causes a problem the current version of our base OS for the neutron image:
```
sh-4.2$ sudo ip netns list
sudo: unable to mkdir /run/sudo: Read-only file system
We trust you have received the usual lecture from the local System
Administrator. It usually boils down to these three things:
#1) Respect the privacy of others.
#2) Think before you type.
#3) With great power comes great responsibility.
[sudo] password for neutron:
```
Using the default call with the rootwrap that osh uses by default [1] solves the problem, so I'll remove the override.
[1] https:/ /github. com/openstack/ openstack- helm/blame/ master/ neutron/ values. yaml#L1903