Patch orchestration failed on unlocking controllers
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
High
|
Eric MacDonald |
Bug Description
Brief Description
-----------------
During test patch apply using patch orchestration controller unlock was failed . When controller was unlocked manually it was successful . Further investigation by Bart following findings.
2020-06-11 17:06:43 – The VIM issues a host-install request to patching for controller-0. This seems to complete immediately.
The VIM then waits 15 seconds (intentionally).
2020-06-11 17:06:59 – The VIM issues a host-lock request for controller-0 to sysinv:
Sysinv queries patching to see if the host is patch current.
The patcher seems to say the host is not patch current:
sysinv 2020-06-11 17:07:00.289 254793 WARNING wsme.api [-] Client-side error: host-unlock rejected: Not patch current. 'sw-patch host-install controller-0' is required.: ClientSideError: host-unlock rejected: Not patch current. 'sw-patch host-install controller-0' is required.
Patch orchestration details
apply-phase:
total-stages: 2
current-stage: 0
stop-at-stage: 2
timeout: 10173 seconds
completion-
start-
end-date-time: 2020-06-11 17:07:00
result: failed
reason: host unlock failed
stages:
stage-id: 0
stage-name: sw-patch-
timeout: 5536 seconds
result: failed
reason: host unlock failed
steps:
result: success
reason:
result: success
reason:
result: success
reason:
result: success
reason:
result: success
reason:
result: failed
reason: host unlock failed
result: initial
reason:
stage-id: 1
stage-name: sw-patch-
timeout: 4636 seconds
result: initial
reason:
steps:
result: initial
reason:
Severity
--------
Major
System Configuration
-------
wcp-71-75
Expected Behavior
------------------
No failure on unlock
Actual Behavior
----------------
As description says unlock by patch orchestration fails
Reproducibility
---------------
Tried only once with this load.
Load
-------
2020-06-10_20-00-00
Last Pass
---------
It was passed on 2020-06-10_20-00-00 with different
Reboot able patch . Above failure was large test patch.
Timestamp/Logs
--------------
2020-06-11 17:06:43
Test Activity
-------------
Regression test
summary: |
- patch orchestration failed on unlocking controllers + Patch orchestration failed on unlocking controllers |
tags: | added: stx.4.0 |
Changed in starlingx: | |
assignee: | Eric MacDonald (rocksolidmtce) → Bart Wensley (bartwensley) |
assignee: | Bart Wensley (bartwensley) → Eric MacDonald (rocksolidmtce) |
status: | Triaged → In Progress |
tags: | added: stx.4.0 stx.nfv |
This issue was reproduced in wcp-78-79(duplex lab) in load 2020-06-10 22:43:29. percentage: 100% date-time: 2020-06-12 13:44:15 worker- hosts
total- steps: 7
current- step: 5
start- date-time: 2020-06-12 13:44:15
end-date- time: 2020-06-12 13:50:34
apply-phase:
total-stages: 2
current-stage: 0
stop-at-stage: 2
timeout: 11073 seconds
completion-
start-
end-date-time: 2020-06-12 13:50:34
result: failed
reason: host unlock failed
stages:
stage-id: 0
stage-name: sw-patch-
timeout: 5536 seconds
result: failed
reason: host unlock failed
steps: