standby controller not going active after forced reboot of active controller
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Invalid
|
High
|
Bin Qian |
Bug Description
Brief Description
-----------------
In a two-node system, when forcing the active controller to go into an uncontrolled reboot (by crashing the kernel via sysrq) the standby controller sometimes takes a long time (multiple minutes) to go active.
On possible issue is that I think the system in question has direct-connected mgmt/infra links.
Severity
--------
Major
Steps to Reproduce
------------------
On active controller, as root, run:
echo 1 > /proc/sys/
Expected Behavior
------------------
The standby controller should go active.
Actual Behavior
----------------
The standby controller stayed standby for multiple minutes.
Reproducibility
---------------
Intermittent but frequent
System Configuration
-------
Two node system, and I think the mgmt/infra links are direct connect
Branch/Pull Time/Commit
-------
<email address hidden>"
BUILD_NUMBER="6"
BUILD_HOST=
BUILD_DATE=
Timestamp/Logs
--------------
Timestamp is somewhere around 19:46:56
tags: |
added: stx.2.0 removed: stx.2019.05 |
Changed in starlingx: | |
status: | Triaged → Invalid |
controller-0 sm logs