SNMP docker images are not being downloaded correctly
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
Medium
|
Takamasa Takenaka |
Bug Description
Brief Description
-----------------
The system fails to apply snmp application. The snmp pod gets stuck on ImagePullBackOff trying to pull trap-agent and subagent images.
When container registry url is modified for docker.io, we expect all snmp containers from configured url.
Severity
--------
<Critical: System/Feature is not usable after the defect>
Steps to Reproduce
------------------
1. Upload snmp application:
system application-upload /usr/local/
2. Apply the snmp application:
system application-apply snmp
Expected Behavior
------------------
Application should reach 'applied' status.
Actual Behavior
----------------
Application reaches 'apply-failed' status
Reproducibility
---------------
<Reproducible/
100% reproducible on ipv6 systems. (it could happen in IPv4)
System Configuration
-------
Two node system (But it could happen any type of configuration)
Timestamp/Logs
--------------
Based on sysinv.log, I can see that one of the 3 images was pulled from configured registry:
sysinv 2021-11-25 12:05:44.169 1163158 INFO sysinv.
But the other two are being pulled from docker.io instead:
kubectl describe pod -n kube-system ns-snmp-
Warning Failed 4m43s kubelet Failed to pull image "docker.
Warning Failed 4m30s kubelet Failed to pull image "docker.
Warning Failed 4m18s (x2 over 4m43s) kubelet Error: ErrImagePull
Warning Failed 4m18s kubelet Failed to pull image "docker.
Warning Failed 4m17s kubelet Error: ImagePullBackOff
Normal BackOff 4m17s kubelet Back-off pulling image "docker.
Warning Failed 4m17s kubelet Error: ImagePullBackOff
Normal Pulling 4m3s (x3 over 5m7s) kubelet Pulling image "docker.
Warning Failed 3m51s (x3 over 4m55s) kubelet Error: ErrImagePull
Normal Pulling 3m51s (x3 over 4m55s) kubelet Pulling image "docker.
Warning Failed 3m51s kubelet Failed to pull image "docker.
Warning Failed 3m39s kubelet Failed to pull image "docker.
Normal BackOff 15s (x11 over 4m17s) kubelet Back-off pulling image "docker.
And it's failing to retrieve this images if there is no connection to docker.io.
Test Activity
-------------
Regression Testing
Workaround
----------
Update snmp helm chart directly with configured url for stx-fm-subagent and stx-trap-subagent
system helm-override-
system helm-override-
system application-apply snmp
Changed in starlingx: | |
assignee: | nobody → Takamasa Takenaka (ttakenak) |
status: | New → In Progress |
Changed in starlingx: | |
importance: | Undecided → Critical |
importance: | Critical → Medium |
tags: | added: stx.6.0 stx.apps |
tags: | added: stx.fault |
Fix proposed to branch: master /review. opendev. org/c/starlingx /snmp-armada- app/+/819698
Review: https:/