Controller in HA stays down after upgrade

Bug #1999642 reported by Joseph Phillips
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Canonical Juju
Triaged
Medium
Unassigned

Bug Description

After upgrading a controller (using --build agent) in a running HA set-up, one of the agents remained down.

This ended up being because the API server worker was not able to come up. The error occurring repeatedly was as follows:

2022-12-14 12:58:12 DEBUG juju.worker.dependency engine.go:618 "api-server" manifold worker stopped: duplicate metrics collector registration attempted
stack trace:
duplicate metrics collector registration attempted
github.com/juju/juju/worker/apiserver.ManifoldConfig.start:211:
2022-12-14 12:58:12 ERROR juju.worker.dependency engine.go:695 "api-server" manifold worker returned unexpected error: duplicate metrics collector registration attempted

Revision history for this message
Joseph Phillips (manadart) wrote :

Upon restarting the agent, it restored operation.

Changed in juju:
status: New → Triaged
importance: Undecided → Medium
Revision history for this message
Simon Richardson (simonrichardson) wrote :

There was a proposal sometime ago, that we would unregister every time before we did a new register. Although that seems overkill, we could do a pattern where you always defer an unregister and just ignore the error if there was one, similar to the one in worker/modlecache/worker.go:201 (was you that did that), then at least we're always unregistering.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.