Multiple Local registry: 500 Server Error cause application-apply errors
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Invalid
|
High
|
Abraham Arce |
Bug Description
Brief Description
-----------------
During application-apply of platform-integ-apps or stx-openstack, Local registry fails and return 500 errors, causing application-apply to fail.
Severity
--------
Provide the severity of the defect.
Minor/Major: System is usable, but it takes several extra minutes (from ~30mins to ~90mins) to do the install, each failure restarts application-apply process.
Steps to Reproduce
------------------
Follow up wiki/docs procedure. During application-apply, either of platform-integ-apps or stx-openstack, this error can be observed.
Expected Behavior
------------------
Docker images are successfully downloaded from Local registry.
Actual Behavior
----------------
Local registry fails to fulfill requests and returns 500 errors. System is not able to download the images required to complete the apply.
Reproducibility
---------------
100% reproducible, eventually, after two or three applies, all images are downloaded and application-apply completes.
System Configuration
-------
Observed on all configs, Simplex, Duplex, Standard, Standard-External, Baremetal.
Branch/Pull Time/Commit
-------
###
### StarlingX
### Built from master
###
OS="centos"
SW_VERSION="19.01"
BUILD_TARGET="Host Installer"
BUILD_TYPE="Formal"
BUILD_ID=
JOB="STX_
<email address hidden>"
BUILD_NUMBER="207"
BUILD_HOST=
BUILD_DATE=
Last Pass
---------
These errors started on build from 08/06, confirmed with logs on 08/05, no 500 errors were detected:
http://
Timestamp/Logs
--------------
This was observed on all 4 configs, Here are the errors from a Standard (2+2) collect attached:
http://
A collect is attached. This is what we have on our sanity logs from robot framework:
20190809 14:38:07.782 - INFO - +------ START KW: SSHLibrary.Write [ ${cmd} ]
-integ-apps|awk '{print $10}'- system application-
20190809 14:38:07.793 - INFO - +------ END KW: SSHLibrary.Write (11)
20190809 14:38:07.793 - INFO - +------ START KW: SSHLibrary.Read Until Prompt [ ]
20190809 14:38:08.879 - INFO - apply-failed
--
20190809 15:24:08.618 - INFO - +------- START KW: SSHLibrary.Write [ ${cmd} ]
stack|awk '{print $10}' INFO - system application-
20190809 15:24:08.629 - INFO - +------- END KW: SSHLibrary.Write (11)
20190809 15:24:08.629 - INFO - +------- START KW: SSHLibrary.Read Until Prompt [ ]
20190809 15:24:09.757 - INFO - apply-failed
As you can see, both, platform-integ-apps and also, stx-openstack failed to apply. This was observed on all 4 configurations. Our sanity suite automatically retries the application-apply, eventually it succeeds on applying it.
Test Activity
-------------
Sanity.
description: | updated |
Changed in starlingx: | |
status: | Triaged → Invalid |
Marking as high priority as application apply should pass and the apply should not take 60-90 minutes.
Assigning to Yong to identify a prime to lead the investigation.