Transitive failure in units during upgrade/refresh
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Canonical Juju |
Triaged
|
Medium
|
Heather Lanigan |
Bug Description
I'm currently testing an upgrade process on a deployment with a machine charm (zookeeper on rev114 to latest), running on Juju 3.1.6 (but i have tested also on 3.1.7).
First, I deploy rev114 with
```
juju deploy zookeeper --channel 3/edge --revision 114 -n 3
```
When I refresh the charm, with `juju refresh zookeeper`, the units fails at first. This is shown both in the juju status output and in the debug log where there is a message as the following:
```
juju.worker.
```
After some times (one or two minutes), the failure self-heal and resolves. However, the messages can be misleading especially during upgrades processes that are very delicate.
Although this was observed on zookeeper charm, this has proven to apply not only to this charm, but to be more general and apply to all others.
summary: |
- Transiently failure in units during upgrade/refresh + Transitive failure in units during upgrade/refresh |
Changed in juju: | |
assignee: | nobody → Heather Lanigan (hmlanigan) |
Changed in juju: | |
milestone: | 3.3.3 → 3.3.4 |
Changed in juju: | |
milestone: | 3.3.4 → 3.3.5 |
Changed in juju: | |
milestone: | 3.3.5 → 3.3.6 |
Changed in juju: | |
assignee: | Heather Lanigan (hmlanigan) → Caner Derici (cderici) |
milestone: | 3.3.6 → 3.4.4 |
Changed in juju: | |
status: | Triaged → In Progress |
Changed in juju: | |
milestone: | 3.4.4 → 3.4.5 |
Changed in juju: | |
milestone: | 3.4.5 → 3.4.6 |
Changed in juju: | |
importance: | High → Medium |
milestone: | 3.4.6 → none |
status: | In Progress → Triaged |
This issue can happen with deploy as well, and has been seen in large models
Due to the async charm download feature introduced in juju 3.0, it's possible that the uniter tries to get the charm from the controller before it's completed download. At which point it errors and retries later.
Instead, the controller can return a pending download to try again later error so the uniter knows the charm is queued to be downloaded, or in process.