MAAS temporarily reporting no instance while bringing up a new instance.
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Canonical Juju |
Incomplete
|
Low
|
Unassigned | ||
MAAS |
Expired
|
Undecided
|
Unassigned |
Bug Description
#1731108 might be related!
Just performed a bootstrap to a KVMs on MAAS v2.3.3
added a new model "deployment"
installed postgresql-HA
then returned to controller model and enable-ha I got this many times:
machine-0: 10:46:03 ERROR juju.worker.
u@MAAS:~# juju status
Model Controller Cloud/Region Version SLA
controller jujumaas maaslab 2.3.7 unsupported
Machine State DNS Inst id Series AZ Message
0 started x.x.x.42 4s33ky xenial default Deployed
1 started x.x.x.40 efdqxc xenial default Deployed
2 started x.x.x.41 fw3dw7 xenial default Deployed
u@MAAS:~# juju controllers --refresh
Controller Model User Access Cloud/Region Models Machines HA Version
jujumaas* controller admin superuser maaslab 3 6 3 2.3.7
More log:
machine-0: 10:49:28 ERROR juju.worker.
machine-0: 10:49:28 DEBUG juju.worker.
cannot get network interfaces of "4s33ky": instance "4s33ky" not found (not found)
github.
github.
github.
github.
machine-2: 10:49:29 DEBUG juju.worker.
machine-0: 10:49:30 DEBUG juju.worker.
machine-0: 10:49:31 DEBUG juju.apiserver <- [1] machine-0 {"request-
machine-0: 10:49:31 DEBUG juju.apiserver -> [1] machine-0 1.249135ms {"request-
machine-0: 10:49:31 WARNING juju.cmd.jujud determining kvm support: INFO: Your CPU does not support KVM extensions
KVM acceleration can NOT be used
: exit status 1
no kvm containers possible
machine-0: 10:49:31 DEBUG juju.apiserver <- [1] machine-0 {"request-
machine-0: 10:49:31 DEBUG juju.apiserver -> [1] machine-0 1.100905ms {"request-
machine-0: 10:49:31 DEBUG juju.apiserver <- [1] machine-0 {"request-
machine-0: 10:49:31 DEBUG juju.apiserver -> [1] machine-0 61.834742ms {"request-
machine-0: 10:49:31 DEBUG juju.worker start "0-container-
machine-0: 10:49:31 INFO juju.worker start "0-container-
machine-0: 10:49:31 DEBUG juju.worker "0-container-
machine-0: 10:49:31 DEBUG juju.cmd.jujud upgrades done, starting worker "0-container-
machine-0: 10:49:31 DEBUG juju.apiserver <- [1] machine-0 {"request-
machine-0: 10:49:31 DEBUG juju.apiserver <- [1] machine-0 {"request-
machine-0: 10:49:31 DEBUG juju.apiserver -> [1] machine-0 2.5669ms {"request-
machine-0: 10:49:31 DEBUG juju.apiserver -> [1] machine-0 2.609106ms {"request-
machine-0: 10:49:31 DEBUG juju.apiserver <- [1] machine-0 {"request-
machine-0: 10:49:31 DEBUG juju.apiserver <- [1] machine-0 {"request-
machine-0: 10:49:31 DEBUG juju.provider.maas opening model "controller".
machine-0: 10:49:31 DEBUG juju.apiserver -> [1] machine-0 488.675989ms {"request-
machine-0: 10:49:31 DEBUG juju.worker killing runner 0xc4203bb040
machine-0: 10:49:31 INFO juju.worker runner is dying
machine-0: 10:49:31 DEBUG juju.worker killing "0-container-
machine-0: 10:49:31 DEBUG juju.cmd.jujud stopping so killing worker "0-container-
machine-0: 10:49:31 DEBUG juju.apiserver <- [1] machine-0 {"request-
machine-0: 10:49:31 DEBUG juju.apiserver -> [1] machine-0 352.256µs {"request-
machine-0: 10:49:31 DEBUG juju.apiserver -> [1] machine-0 490.115139ms {"request-
machine-0: 10:49:31 INFO juju.worker stopped "0-container-
machine-0: 10:49:31 DEBUG juju.worker "0-container-
machine-0: 10:49:31 DEBUG juju.worker no restart, removing "0-container-
machine-0: 10:49:31 DEBUG juju.worker.
machine-0: 10:49:31 ERROR juju.worker.
machine-0: 10:49:31 DEBUG juju.worker.
cannot get network interfaces of "4s33ky": instance "4s33ky" not found
github.
github.
github.
github.
machine-0: 10:49:31 DEBUG juju.apiserver <- [1] machine-0 {"request-
machine-0: 10:49:31 DEBUG juju.apiserver -> [1] machine-0 1.536168ms {"request-
machine-0: 10:49:31 DEBUG juju.worker.
machine-0: 10:49:31 DEBUG juju.apiserver <- [1] machine-0 {"request-
machine-0: 10:49:31 DEBUG juju.apiserver -> [1] machine-0 1.283401ms {"request-
machine-0: 10:49:31 DEBUG juju.network no lxc bridge addresses to filter for machine
machine-0: 10:49:31 DEBUG juju.network "lxdbr0" has addresses [fe80::
machine-0: 10:49:31 DEBUG juju.network cannot get "virbr0" addresses: route ip+net: no such network interface (ignoring)
machine-0: 10:49:31 DEBUG juju.network including address local-machine:
machine-0: 10:49:31 DEBUG juju.network including address local-cloud:
machine-0: 10:49:31 DEBUG juju.network including address local-machine:::1 for machine
machine-0: 10:49:31 DEBUG juju.network addresses after filtering: [local-
machine-0: 10:49:31 INFO juju.worker.
description: | updated |
Changed in juju: | |
importance: | Undecided → Medium |
status: | New → Triaged |
summary: |
- ERROR cannot update observed network config + MAAS temporarily reporting no instance while bringing up a new instance. |
Changed in juju: | |
status: | Triaged → Incomplete |
tags: | removed: maas |
Is this something that eventually worked, or is this something that
completely blocked enable-ha from continuing?
That looks like we're getting a failure asking MAAS about the details of
"4s33ky" which seems very odd given that it is exactly the machine that we
are running on.
I don't see the error trace in here, but it seems like it would be part of:
instId, err := m.InstanceId()
if err != nil {
return nil, errors.Trace(err)
}
interfaceInfos, err := netEnviron. NetworkInterfac es(api. callContext, instId)
which ultimately leads down to: acquiredInstanc es(ctx, []instance. Id{instId} ) Cause(err) .(gomaasapi. ServerError) ; ok && NotFoundf( "instance %q", instId) Annotatef( err, "getting instance %q", instId) NotFoundf( "instance %q", instId)
instances, err := environ.
if err != nil {
// This path can never trigger on MAAS 2, but MAAS 2 doesn't
// return an error for a machine not found, it just returns
// empty results. The clause below catches that.
if maasErr, ok := errors.
maasErr.StatusCode == http.StatusNotFound {
return nil, errors.
}
return nil, errors.
}
if len(instances) == 0 {
return nil, errors.
}
inst := instances[0]
return inst, nil
... MachinesArgs{ stemIDs( ids),
args := gomaasapi.
AgentName: environ.uuid,
SystemIDs: instanceIdsToSy
}
... maasController. Machines( args)
machines, err := environ.
And I'm guessing somewhere in there is where we are getting "not found". config= juju.apiserver. common= TRACE on the controller model, but
(You should be able to see the call stack if you used
logging-
there is enough context in those messages that we probably know where the
error is coming from. (most likely MAAS is returning an empty list when we
are probing for the machine, and I don't know why that would be.)
On Thu, May 24, 2018 at 12:54 PM, MichaelEino <email address hidden>
wrote:
> Public bug reported: dependency "machiner" manifold dependency "machiner" manifold
>
> #1731108 might be related!
>
> Just performed a bootstrap to a KVMs on MAAS v2.3.3
> added a new model "deployment"
> installed postgresql-HA
> then returned to controller model and enable-ha I got this many times:
>
> machine-0: 10:46:03 ERROR juju.worker.
> worker returned unexpected error: cannot update observed network config:
> cannot get network interfaces of "4s33ky": instance "4s33ky" not found
> (not found)
>
> u@MAAS:~# juju status
> Model Controller Cloud/Region Version SLA
> controller jujumaas maaslab 2.3.7 unsupported
> Machine State DNS Inst id Series AZ Message
> 0 started x.x.x.42 4s33ky xenial default Deployed
> 1 started x.x.x.40 efdqxc xenial default Deployed
> 2 started x.x.x.41 fw3dw7 xenial default Deployed
>
> u@MAAS:~# juju controllers --refresh
> Controller Model User Access Cloud/Region Models Machines
> HA Version
> jujumaas* controller admin superuser maaslab 3 6
> 3 2.3.7
>
> More log:
>
> machine-0: 10:49:28 ERROR juju.worker.
> worker returned unexpected error: cannot update observed network config:
> cannot get network interfaces of "4s33ky": instance "4s33ky" not found (not
> found)
> m...