Collect tool times out collecting from large fully loaded stressed system
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
Low
|
Eric MacDonald |
Bug Description
Brief Description
-----------------
The 'containerizati
Severity
--------
Minor: This is not in-service product affecting but does prevent getting a collect bundle that includes the active controller in large systems under stress.
Steps to Reproduce
------------------
Run 'collect -a' while deploying 1500 pods, 30 pods per node in a 2+2+50 standard system
Expected Behavior
------------------
Collect completes just fine
Actual Behavior
----------------
Collect times out.
Reproducibility
---------------
100%
System Configuration
-------
2+2+50 Standard System
Branch/Pull Time/Commit
-------
Any
Last Pass
---------
Never passed under this type of stress loading.
Timestamp/Logs
--------------
Error: operation timeout ; failed to collect from controller-0 [target] (reason:10)
Test Activity
-------------
Stress testing
Workaround
----------
Reduce pod loading and retry
Changed in starlingx: | |
importance: | Undecided → Low |
tags: | added: stx.tools |
tags: | added: stx.9.0 |
Changed in starlingx: | |
assignee: | nobody → Eric MacDonald (rocksolidmtce) |
Fix proposed to branch: master /review. opendev. org/c/starlingx /utilities/ +/873473
Review: https:/