Containers fail to get ip when non-maas dhcp/dns is used
| Affects | Status | Importance | Assigned to | Milestone | |
|---|---|---|---|---|---|
| | juju-core |
High
|
Unassigned | ||
Bug Description
When non-maas dhcp/dns is used, services deployed to containers fail to get ip addresses and fail deployment. This completely blocks deploying any openstack services on containers.
I am running 14.04.2 with juju version 1.24.0-
My juju status shows: http://
Issue created with openstack-
| description: | updated |
| Changed in juju-core: | |
| status: | New → Triaged |
| importance: | Undecided → High |
| Tim Penhey (thumper) wrote : | #1 |
| james beedy (jamesbeedy) wrote : | #2 |
Tim,
The only log files I have on my container host are:
/var/log/
/var/lib/
/var/lib/
| james beedy (jamesbeedy) wrote : | #3 |
There are another 7 log files for the other 7 containers, if needed I can post those too....although they show no difference from the two I posted other than the machine and service name.
| james beedy (jamesbeedy) wrote : | #4 |
I guess there aren't even log files in /var/lib/
| Changed in juju-core: | |
| milestone: | none → 1.25.0 |
| Dimiter Naydenov (dimitern) wrote : | #5 |
If MAAS is not managing DHCP for the nodes, how do they get an address?
If the nodes come up OK (with DHCP allocated addresses), but containers do not, then it's most likely an issue on the juju side, not maas.
Can you provide and attack here the logs from /var/log/juju/*?
What's your environments.yaml contents (scrubbed of secrets, if any please)?
| james beedy (jamesbeedy) wrote : | #6 |
Dimiter,
Thanks for asking. DHCP is being served on 10.16.100.0/24 alongside other networks by a set of pfsense servers. DNS is served internally by a bind9 cluster with an authoritative master and two slaves. When my dhcp server hands out an ip address to a client it also updates my authoritative DNS server with the appropriate forward and reverse records for the zone, following which, the authoritative master transfers the zone to each of the slaves. I obtain successful 'host' queries by hostname, fqdn, and ip address immediately after a node is issued its ip address by the dhcp sever in enlistment, commissioning, and deploy phases.
I can deploy a test momentarily to reproduce and grab some more insight.... My environments.yaml is as follows:
default: maas
environments:
maas:
type: maas
maas-server: 'http://
maas-oauth: ***
admin-secret: "***"
default-series: trusty
authorized-
apt-http-proxy: 'http://
lxc-clone: true
bootstrap-
no-proxy: localhost,
| james beedy (jamesbeedy) wrote : | #7 |
As a test, using the above environments.yaml from an un-bootstraped env, I ran a "juju bootstrap -e maas", then "juju add-machine ceph-mon-
At this point "juju status" shows: http://
Next I run "juju deploy mysql --to lxc:1", after which my juju status indicates: http://
bootstrap node:
all-
machine-0.log: http://
machine 1:
machine-0.log: http://
maas-server:
maas.log: http://
| james beedy (jamesbeedy) wrote : | #8 |
I feel like this is at the heart of the issue:
ERROR: certificate common name ''*'' doesn''t match requested host name ''10.16.100.58''.; To connect to 10.16.100.58 insecurely, use `--no-check-
| james beedy (jamesbeedy) wrote : | #9 |
/var/log/
| james beedy (jamesbeedy) wrote : | #10 |
per ^ it looks like tftp is only being served on the default virsh net of 192.168.
| Dimiter Naydenov (dimitern) wrote : | #11 |
That indeed seems the core of the problem - not being able to download the cloud image for the lxc container. Additionally, looking at the logs it seems odd thet API hostports contain the same address 4 times - ceph-osd-
The error about the certificate leads me to think the certupdater worker hasn't done its job to recreate the certificate to include the bootstrap node's addresses (rather than the default '*').
Why do you need to specify the MAAS cluster controller as APT proxy? MAAS already does that transparently (when it works - I had to change the squid proxy config on my MAAS to make it work on port 8000). You might try dropping the apt-http-proxy and no-proxy lines from your environments.yaml and retry.
To simplify the lxc deployment process, also try using lxc-clone: false to see if that makes a difference.
Logs from both machines show errors trying to set instance status, which is disturbing, but I doubt that's the last thing in the log.
In any case, try the suggestions with logging-config: <root>=TRACE and pass --debug to bootstrap to get more context.
Please note with debug and trace logging there *will be* secrets/keys in the logs you'll want to scrub before attaching.
| tags: | added: network |
| tags: | added: bug-squad |
| Changed in juju-core: | |
| milestone: | 1.25-alpha1 → 1.25-beta1 |
| Alexis Bruemmer (alexis-bruemmer) wrote : | #12 |
Please provide the logs requested in comment #11
| Changed in juju-core: | |
| status: | Triaged → Incomplete |
| Changed in juju-core: | |
| milestone: | 1.25-beta1 → 1.25-beta2 |
| Changed in juju-core: | |
| milestone: | 1.25-beta2 → 1.25.1 |
| Changed in juju-core: | |
| milestone: | 1.25.1 → 1.26.0 |
| james beedy (jamesbeedy) wrote : | #13 |
My apologies for leaving this bug unattended. I haven't reproduced the environment (including the way dhcp,dns, and maas were configured together) since I abandoned the configuration detailed above. That being said, I would like to help get this functionality working to some degree...... I intend on setting up a virtual stack dedicated to following up on this. I'll report back soon! Thanks!
| Cheryl Jennings (cherylj) wrote : | #14 |
I wonder if the lxc wget cert issues are related to bug #1512782
| Changed in juju-core: | |
| milestone: | 1.26.0 → 2.0-beta5 |
| Changed in juju-core: | |
| milestone: | 2.0-beta5 → 2.0-beta4 |
| Changed in juju-core: | |
| milestone: | 2.0-beta4 → none |
| Launchpad Janitor (janitor) wrote : | #15 |
[Expired for juju-core because there has been no activity for 60 days.]
| Changed in juju-core: | |
| status: | Incomplete → Expired |


James, there should be some log files for the containers on the MAAS provisioned machines here:
/var/ lib/juju/ containers/ <container- name>/. ..
Can we get those added to this bug?