Debian: cert-mon PriorityQueue regression in python3
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
Medium
|
Kyle MacLeod |
Bug Description
Brief Description
-----------------
DC Debian - subcloud was with dc-cert in "out-of-sync" after coming online, cert-manager didn't audit subcloud
Severity
--------
<Major: System/Feature is usable but degraded>
Steps to Reproduce
50 HW subclouds were deployed in parallel
16 HW subclouds were deployed successfully (went to online state)
Ran subcloud manage command for the 16 HW subclouds
1 HW subcloud kept in "out-of-sync" state
| 9 | subcloud2007 | managed | online | complete | in-sync | None | None |
| 10 | subcloud2008 | managed | online | complete | in-sync | None | None |
| 11 | subcloud2009 | managed | online | complete | in-sync | None | None |
| 28 | subcloud2026 | managed | online | complete | in-sync | None | None |
| 29 | subcloud2027 | managed | online | complete | in-sync | None | None |
| 30 | subcloud2028 | managed | online | complete | in-sync | None | None |
| 31 | subcloud2029 | managed | online | complete | in-sync | None | None |
| 32 | subcloud2030 | managed | online | complete | in-sync | None | None |
| 45 | subcloud2043 | managed | online | complete | out-of-sync | None | None |
| 46 | subcloud2044 | managed | online | complete | in-sync | None | None |
| 47 | subcloud2045 | managed | online | complete | in-sync | None | None |
| 48 | subcloud2046 | managed | online | complete | in-sync | None | None |
| 49 | subcloud2047 | managed | online | complete | in-sync | None | None |
| 50 | subcloud2048 | managed | online | complete | in-sync | None | None |
| 51 | subcloud2049 | managed | online | complete | in-sync | None | None |
| 52 | subcloud2050 | managed | online | complete | in-sync | None | None |
| Field | Value |
| id | 45 |
| name | subcloud2043 |
| description | None |
| location | None |
| software_version | 22.12 |
| management | managed |
| availability | online |
| deploy_status | complete |
| management_subnet | fdff:719a:
| management_start_ip | fdff:719a:
| management_end_ip | fdff:719a:
| management_
| systemcontrolle
| group_id | 1 |
| created_at | 2022-10-10 17:41:14.917949 |
| updated_at | 2022-10-10 20:08:05.963986 |
| backup_status | None |
| backup_datetime | None |
| dc-cert_sync_status | unknown |
| firmware_
| identity_
| kubernetes_
| kube-rootca_
| load_sync_status | in-sync |
| patching_
| platform_
Expected Behavior
-----------------
All subclouds should go to "in-sync" state after managing them
Actual Behavior
---------------
1 subcloud kept in "out-of-sync" state
Reproducibility
---------------
intermittent, seen in one subcloud
System Configuration
-------
DC
Load info (eg: 2022-03-
STX Master
----------
Last Pass
----------
New test scenario - Debian Scale testing
Timestamp/Logs
Unexpected behavior:
dcmanager.log
2022-10-10 18:38:41.651 115387 INFO dcmanager.
cert-mon.log (DEPLOY PHASE)
2022-10-
2022-10-
2022-10-
dcmanager.log (SUBCLOUD MANAGE COMMAND)
2022-10-10 20:08:05.978 115387 INFO dcmanager.
cert-mon.log (AFTER RUNNING SUBCLOUD MANAGE COMMAND)
2022-10-
expexted behavior (subcloud2050)
cert-mon.log (deploy phase)
2022-10-
2022-10-
2022-10-
2022-10-
2022-10-
2022-10-
2022-10-
dcmanager.log
2022-10-10 20:08:22.745 115387 INFO dcmanager.
cert-mon.log (after running subcloud manage command)
2022-10-
2022-10-
2022-10-
2022-10-
2022-10-
n
Alarms
Test Activity
Feature Testing
Workaround
No workaround identified
tags: | added: stx.8.0 stx.debian stx.distcloud stx.security |
Changed in starlingx: | |
importance: | Undecided → Medium |
assignee: | nobody → Kyle MacLeod (kmacleod) |
Fix proposed to branch: master /review. opendev. org/c/starlingx /config/ +/861099
Review: https:/