StarlingX bug reporting guidelines:
Please use the template below when opening StarlingX bugs.
Brief Description
-----------------
when we perform multi k8s upgrade using orchestration method for the first k8s control-plane and kubelet it will upgrade successfully but when it continues to upgrade next k8s kubelet it will fail due to 200.006 alarm.
Severity
--------
Major
Steps to Reproduce
------------------
Upgrade kublet as part of the kubernets upgarde.
Expected Behavior
------------------
Write down what was expected after taking the steps written above
Actual Behavior
----------------
isolcpu_plugin is not running after kubelet upgrade.
Reproducibility
---------------
100%
System Configuration
--------------------
All controllers and workers
StarlingX bug reporting guidelines:
Please use the template below when opening StarlingX bugs.
Brief Description
-----------------
when we perform multi k8s upgrade using orchestration method for the first k8s control-plane and kubelet it will upgrade successfully but when it continues to upgrade next k8s kubelet it will fail due to 200.006 alarm.
Severity
--------
Major
Steps to Reproduce
------------------
Upgrade kublet as part of the kubernets upgarde.
Expected Behavior
------------------
Write down what was expected after taking the steps written above
Actual Behavior
----------------
isolcpu_plugin is not running after kubelet upgrade.
Reproducibility
---------------
100%
System Configuration ------- ------
-------
All controllers and workers
Branch/Pull Time/Commit ------- ------- --
-------
Master
Last Pass
---------
Timestamp/Logs controller- 0 ~(keystone_admin)]$ sw-manager kube-upgrade- strategy show 6abe-480f- 972e-56b59dde0f 4f apply-type: serial apply-type: serial apply-type: serial instance- action: stop-start restrictions: strict phase-completio n: 100%
--------------
[sysadmin@
Strategy Kubernetes Upgrade Strategy:
strategy-uuid: 88e0b598-
controller-
storage-
worker-
default-
alarm-
current-phase: abort
current-
state: aborted
apply-result: failed
apply-reason: alarms ['200.006'] from platform are present
abort-result: success
abort-reason:
Test Activity
-------------
Testing
Workaround
----------
Use systemd to unmask and start isolcpu_plugin. Use pmon-start to monitor isolcpu_plugin.