[2.6.4] Manual Provider: Workers seems to crash on both controller and model machines
Bug #1833282 reported by
Pedro Guimarães
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Canonical Juju |
Triaged
|
Low
|
Unassigned |
Bug Description
Xenial deployment with manual provider. No enable-ha feature used.
Juju 2.6.4 (2.6/Candidate)
Running on top of VMWare
Each VM has multiple networks connected into it. VMs can reach each other on multiple networks.
Machines crash and go to "down" state just after being added to model.
I can still reach machines using "juju ssh" command though.
Also, deploying 2.6.3/stable on this same environment worked fine last week.
juju crashdumps for both "kubernetes" and "controller" models and also /var/log from juju controller here: https:/
description: | updated |
Changed in juju: | |
status: | New → Incomplete |
Changed in juju: | |
status: | In Progress → Incomplete |
Changed in juju: | |
status: | New → Triaged |
milestone: | none → 2.7-beta1 |
Changed in juju: | |
milestone: | 2.7-beta1 → 2.7-rc1 |
Changed in juju: | |
milestone: | 2.7-rc1 → none |
importance: | Undecided → Wishlist |
To post a comment you must log in.
So it looks like the controller comes up, but when it goes to start the peergrouper worker things start to get into error loops. The machines have multiple networks but are manual provider and don't support spaces. This causes things to end up not coming up as expected. Here's a repeating section of the peergrouper trying to start, causing a series of errors, raft then starts to error, and eventually we get a restart.
https:/ /pastebin. canonical. com/p/bdt2fJR7D 4/