CNI goes useless if the watcher loses connectivity with the API for longer than the retry timeout
Bug #1776676 reported by
Antoni Segura Puimedon
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
kuryr-kubernetes |
Fix Released
|
Critical
|
Michal Dulko |
Bug Description
When CNI probes are not enabled, if the API becomes unavailable for longer than the watch retry timeout, the pod watcher gracefully exits but the CNI daemon continues running. In containerized deployments specially this means that the CNI daemon pod will not be restarted and will never go back to working status without manual intervention (deleting the pod).
We should make sure that if no watchers remain, we sys exit the controller/CNI daemon, since it can't be doing anything useful anyway and this way we'll get back to working condition eventually should the API become reachable again.
Changed in kuryr-kubernetes: | |
status: | New → Triaged |
importance: | Undecided → Critical |
assignee: | nobody → Antoni Segura Puimedon (celebdor) |
Changed in kuryr-kubernetes: | |
assignee: | Antoni Segura Puimedon (celebdor) → Michal Dulko (michal-dulko-f) |
To post a comment you must log in.
Fix proposed to branch: master /review. openstack. org/575119
Review: https:/