apps failed to upload or apply due to armada pod not ready
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
Medium
|
Jim Gauld |
Bug Description
Brief Description
-----------------
During system installation, 3 apps (cert-manager, oidc-auth-apps and platform-
Severity
--------
Major
Steps to Reproduce
------------------
Installing a system and apply applications on it
Expected Behavior
------------------
Application applied success
Actual Behavior
----------------
Application apply-failed or upload-failed
Reproducibility
---------------
Intermittent
1/10 It seems to happen once in two weeks.
System Configuration
-------
One node system
Two node system
Lab-name: AIO-Plus
Branch/Pull Time/Commit
-------
2020-07-02_22-00-00
Last Pass
---------
2020-07-01_22-00-00
Timestamp/Logs
--------------
[2020-07-03 06:37:27,166] 314 DEBUG MainThread ssh.send :: Send 'system --os-endpoint-type internalURL --os-region-name RegionOne application-list'
[2020-07-03 06:37:28,608] 436 DEBUG MainThread ssh.expect :: Output:
+------
| application | version | manifest name | manifest file | status | progress |
+------
| cert-manager | 1.0-5 | cert-manager-
| nginx-ingress-
| oidc-auth-apps | 1.0-27 | oidc-auth-manifest | manifest.yaml | uploading | extracting application tar file |
| platform-integ-apps | 1.0-9 | platform-
[2020-07-03 07:18:08,570] 314 DEBUG MainThread ssh.send :: Send 'system --os-endpoint-type internalURL --os-region-name RegionOne application-list'
[2020-07-03 07:18:09,969] 436 DEBUG MainThread ssh.expect :: Output:
+------
| application | version | manifest name | manifest file | status | progress |
+------
| cert-manager | 1.0-5 | cert-manager-
| nginx-ingress-
| | | | yaml | | |
| | | | | | |
| oidc-auth-apps | 1.0-27 | oidc-auth-manifest | manifest.yaml | upload-failed | Upload of application oidc-auth-apps (1.0-27) failed: Failed to validate application manifest. |
| platform-integ-apps | 1.0-9 | platform-
+------
[sysadmin@
+------
| Alarm ID | Reason Text | Entity ID | Severity | Time Stamp |
+------
| 750.002 | Application Apply Failure | k8s_application
| 750.001 | Application Upload Failure | k8s_application
| 750.001 | Application Upload Failure | k8s_application
+------
sysinv 2020-07-03 06:37:13.618 104562 INFO sysinv.
sysinv 2020-07-03 06:37:13.788 104562 ERROR sysinv.
Command: kubectl --kubeconfig /etc/kubernetes
Exit code: 1
Stdout: ''
Stderr: 'error: unable to upgrade connection: container not found ("armada-api")\n'
sysinv 2020-07-03 06:37:13.789 104562 INFO sysinv.
[sysadmin@
armada armada-
kubectl describe above armada pod:
Events:
Type Reason Age From Message
---- ------ ---- ---- -------
Normal Scheduled 20h default-scheduler Successfully assigned armada/
Normal Pulled 20h kubelet, controller-0 Container image "registry.
Normal Created 20h kubelet, controller-0 Created container init
Normal Started 20h kubelet, controller-0 Started container init
Normal Pulled 20h kubelet, controller-0 Container image "registry.
Normal Created 20h kubelet, controller-0 Created container armada-api
Normal Started 20h kubelet, controller-0 Started container armada-api
Normal Pulled 20h kubelet, controller-0 Container image "registry.
Normal Created 20h kubelet, controller-0 Created container tiller
Normal Started 20h kubelet, controller-0 Started container tiller
Warning NodeAffinity 20h kubelet, controller-0 Predicate NodeAffinity failed
Warning FailedMount 20h kubelet, controller-0 MountVolume.SetUp failed for volume "armada-etc" : failed to sync configmap cache: timed out waiting for the condition
Warning FailedMount 20h kubelet, controller-0 MountVolume.SetUp failed for volume "armada-
Warning FailedMount 83m (x96 over 20h) kubelet, controller-0 Unable to attach or mount volumes: unmounted volumes=
Warning FailedMount 58m (x129 over 19h) kubelet, controller-0 Unable to attach or mount volumes: unmounted volumes=
Warning FailedMount 13m (x107 over 20h) kubelet, controller-0 Unable to attach or mount volumes: unmounted volumes=
Warning FailedMount 8m15s (x607 over 20h) kubelet, controller-0 MountVolume.SetUp failed for volume "armada-etc" : stat /var/lib/
Warning FailedMount 3m30s (x252 over 20h) kubelet, controller-0 Unable to attach or mount volumes: unmounted volumes=
Sat Jul 4 03:10:07 UTC 2020
Test Activity
-------------
installation
Changed in starlingx: | |
importance: | Undecided → Critical |
importance: | Critical → High |
status: | New → Confirmed |
tags: | added: stx.4.0 stx.containers |
Changed in starlingx: | |
assignee: | nobody → Bob Church (rchurch) |
tags: | added: not-yet-in-r-stx40 |
Changed in starlingx: | |
assignee: | Jim Gauld (jgauld) → Suvro Ghosh (suvr0) |
tags: | added: stx.5.0 |
Changed in starlingx: | |
assignee: | Suvro Ghosh (suvr0) → Jim Gauld (jgauld) |
Collect log added at /files. starlingx. kube.cengn. ca/launchpad/ 1886429
https:/