BnR: SX - restore succeeded but many pods are evicted
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
Low
|
Joshua Kraitberg |
Bug Description
Brief Description
-----------------
After successful run of restore playbook, lots of pods from every namespace are evicted w/ no alarms raised. System is able to reschedule after and operation is green.
Eviction occurred during the restore playbook, before the unlock.
Severity
--------
Minor
Steps to Reproduce
------------------
Run optimized restore
Expected Behavior
------------------
No evicted pods
Actual Behavior
----------------
Evicted pods
Reproducibility
---------------
Unknown
System Configuration
-------
AIO-SX
Branch/Pull Time/Commit
-------
11-11-2023
Last Pass
---------
N/A
Timestamp/Logs
--------------
N/A
Test Activity
-------------
Automated Testing
Workaround
----------
Manual clean up evicted pods
Changed in starlingx: | |
status: | New → In Progress |
Changed in starlingx: | |
assignee: | nobody → Joshua Kraitberg (jkraitbe-wr) |
Changed in starlingx: | |
importance: | Undecided → Low |
tags: | added: stx.9.0 stx.update |
Reviewed: https:/ /review. opendev. org/c/starlingx /ansible- playbooks/ +/900849 /opendev. org/starlingx/ ansible- playbooks/ commit/ 6b3566a358bf076 77e8449a9f4f334 aaad09aa34
Committed: https:/
Submitter: "Zuul (22348)"
Branch: master
commit 6b3566a358bf076 77e8449a9f4f334 aaad09aa34
Author: Joshua Kraitberg <email address hidden>
Date: Mon Nov 13 22:40:09 2023 -0500
Fix: LV sizes not restored
This change did not work as stated in test plan: /review. opendev. org/c/starlingx /ansible- playbooks/ +/873377
https:/
LV sizes were not being restored pre-unlock.
To restore LV sizes, the original sizes are added to the runtime
configuration after being pulled from controller0 puppet hieradata,
which is currently not being used when doing the puppet apply step.
TEST PLAN distribution= 21
PASS: Optimized upgrade on AIO-SX, stx6 to stx8
PASS: Optimized upgrade on AIO-SX subcloud, stx6 to stx8
PASS: Optimized restore on AIO-SX, stx8
** For all tests confirm LV's are sized correctly before and after
unlock
** Before all tests increase the size of all partitions:
- system host-fs-modify controller-0 kubelet=22
- system controllerfs-modify docker-
- etc.
Closes-Bug: 2043491 5db0f7d0564dd34 e561a470ae7
Signed-off-by: Joshua Kraitberg <email address hidden>
Change-Id: Ic3fcc341371b16