Containers: worker nodes gets stuck at ContainerCreating for some time due to no network connectivity to floating IP
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Invalid
|
Medium
|
Joseph Richard |
Bug Description
Brief Description
-----------------
After install and unlock worker nodes, they stuck at ContainerCreating for extended amount of time (> 40m) due to they failed to pull images from external repo
Severity
--------
Minor
Steps to Reproduce
------------------
- Install and configure controller-0
- Install controller-1 and worker nodes from controller-0, and unlock them
Expected Behavior
------------------
- Worker nodes should pull images from internal registry and in Ready states shortly after unlock completes
Actual Behavior
----------------
- worker nodes were trying to pull images from external repo and got stuck at NotReady - ContainerCreating for 40 minutes plus
Reproducibility
---------------
Intermittent
System Configuration
-------
Multi-node system
Branch/Pull Time/Commit
-------
f/stein as of 2019-02-25
Timestamp/Logs
--------------
NAME STATUS ROLES AGE VERSION
compute-0 NotReady <none> 45m v1.12.3
compute-1 NotReady <none> 44m v1.12.3
controller-0 Ready master 117m v1.12.3
controller-1 Ready master 65m v1.12.3
[wrsroot@
NAMESPACE NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE
kube-system calico-node-m6znx 0/2 ContainerCreating 0 41m 192.168.204.91 compute-0 <none>
kube-system calico-node-w9nlk 0/2 ContainerCreating 0 40m 192.168.204.185 compute-1 <none>
kube-system kube-proxy-66j88 0/1 ContainerCreating 0 40m 192.168.204.185 compute-1 <none>
kube-system kube-proxy-86jn4 0/1 ContainerCreating 0 41m 192.168.204.91 compute-0 <none>
Events:
Type Reason Age From Message
---- ------ ---- ---- -------
Normal Scheduled 43m default-scheduler Successfully assigned kube-system/
Warning FailedCreatePod
tags: | added: stx.containers |
Changed in starlingx: | |
assignee: | Angie Wang (angiewang) → Joseph Richard (josephrichard) |
tags: |
added: stx.2.0 removed: stx.2019.05 |
tags: | added: stx.retestneeded |
tags: | removed: stx.containers |
Marking as release gating; medium priority as the timeout is intermittent. This should be used to make the container download more robust and less dependent on the external registry.