OpenStack pods were not recovered soon after force reboot active controller
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Invalid
|
High
|
zhipeng liu |
Bug Description
Brief Description
-----------------
It usually takes around 10 minutes to get all pods ready again after force rebooting the active/standby controller
Severity
--------
Major
Steps to Reproduce
------------------
- Install and configure system, apply stx-openstack application
- Lock/unlock of standby controller
- reset (ie: reboot -f) of the standby controller
- reset (ie: reboot -f) of the active controller
- reapply of stx-openstack after the above scenarios
Expected Behavior
------------------
- All OpenStack pods recover to Running or Completed states soon.
Actual Behavior
----------------
- From send "reboot -f" to the time when we see all pods are ready, it takes around 10 minutes.
Reproducibility
---------------
100% performance issue?
System Configuration
-------
Duplex
Branch/Pull Time/Commit
-------
stx master daily build 20200530T013359Z
Changed in starlingx: | |
assignee: | nobody → zhipeng liu (zhipengs) |
status: | New → In Progress |
tags: | added: stx.distro.openstack |
Changed in starlingx: | |
status: | In Progress → Confirmed |
Hi all,
I have done too much test on 4 different set up including daily build 0530.
It usually takes 8~10min.
mariadb pod and ovs-db pod take some time to get them ready.
Below is the time statistics. run/openvswitch /db.sock)
4min for host restart and get ready.
3min for mariadb terminating, initialization, get ready. (then configmap sync is ready)
2min for ovs-db ready (reduce probe live/ready timer can improve a little, as it can retry quickly to connect ovs-vsctl: unix:/var/
1min for other pods ready, like neutron-ovs-agent which depends on ovs-db. )
Thanks!
Zhipeng