100.104 alarm raised after Lock/Reboot/Unlock Standby Controller operation
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
Medium
|
Daniel Safta |
Bug Description
Brief Description
-----------------
The 100.104 alarm (see below) was raised after Lock/Reboot/Unlock operation on the Standby Controller (Controller-0 in this case).
100.104', 'host=controlle
+------
| UUID | Alarm ID | Reason Text | Entity ID | Severity | Time Stamp |
+------
| f137b20e-
+------
Severity
--------
Major
Steps to Reproduce
------------------
Install an app that should react to incoming SIGTERM from kubelet and run:
Run testcases/
def test_host_
Verify that host-reboot to an unlock node is rejected and host-reboot cmd working on locked hosts
Test Steps:
- Attempt to host-reboot from the active controller
- Verify that the host-reboot was rejected unlocked hosts
- lock host
- host reboot
- unlock host
Expected Behavior
------------------
No alarms were raised after the test execution and the pods stops properly.
Actual Behavior
----------------
100.104 alarm raised after the test execution because the pod generate a huge coredump.
Reproducibility
---------------
Reproducible
System Configuration
-------
DX
Branch/Pull Time/Commit
-------
2021-06-09_18-58-11
Last Pass
---------
-
Timestamp/Logs
--------------
-
Test Activity
-------------
Regression testing
Workaround
----------
-
Changed in starlingx: | |
assignee: | nobody → Daniel Safta (dsafta) |
importance: | Undecided → Medium |
tags: | added: stx.7.0 stx.containers |
Fix proposed to branch: master /review. opendev. org/c/starlingx /integ/ +/832519
Review: https:/