MAAS should allow active DHCP probing to be disabled

Bug #1637256 reported by Stefan Bader
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
MAAS
Invalid
Medium
Unassigned
maas (Ubuntu)
Confirmed
Undecided
Unassigned

Bug Description

Release: Xenial/16.04
MaaS version: 2.0.0+bzr5189-0ubuntu1~16.04.1

Local setup:

Internet -> 192.168.2.1/24 (GW) -> 192.168.2.19/24 (MaaS em0)
                                   192.168.123.1 (MaaS em0.10)

In the end all interfaces on the MaaS host were static but before that the untagged interface had been DHCP. For that reason there was a host section in the DHCP server on the GW host that assigned a define IP address to it.

With MaaS 2.0 the rack service will issue DHCP discover requests on all detected interfaces every 10 minutes. The GW hosts tries to reply with an address but doing so causes connectivity between GW and MaaS to be disrupted. SSH connections to MaaS managed nodes in the 192.168.123.0/24 subnet do even time out (that could be because the DHCP server for that subnet, which is managed by MaaS, is configured with a fix IP reply the same way as the one on the GW host is).

For now I was able to work around this by adding "ignore bootp; ignore booting;" statements to the DHCP host section on the GW host. That way I loose the protection against misconfiguration but at least got a stable connection between GW and MaaS.

Revision history for this message
Mike Pontillo (mpontillo) wrote :

It's strange that connectivity is interrupted when the DHCP discovery is attempted. It's not clear to me why connectivity should be interrupted at all. (For example, if an ARP entry is being cleared out, it should be added back relatively quickly.) Right now I don't know what could be changed in MAAS to make this better; I think to make progress on this issue, we need to determine exactly why connectivity is disrupted in this case.

It would be helpful to get a packet capture during the probe (and subsequent connectivity loss) to get a better idea about what's going on.

Note that the behavior changed drastically in MAAS 2.1, so it might be different if you re-test.

Changed in maas:
status: New → Incomplete
Changed in maas (Ubuntu):
status: New → Incomplete
Revision history for this message
Mike Pontillo (mpontillo) wrote :

For reference, here's the bug (and related merge) that triggered a lot of changes to this code in MAAS 2.1.

https://bugs.launchpad.net/maas/+bug/1628645

Revision history for this message
Mike Pontillo (mpontillo) wrote :

While the root cause of this issue is still incomplete, I think it would be good to allow users to disable active DHCP probing. We have other active network scanning methods in MAAS 2.1, but those can all be disabled with a global setting (or on an individual subnet).

summary: - MaaS might disrupt network connectivity by probing DHCP services
+ MAAS should allow active DHCP probing to be disabled
Changed in maas:
status: Incomplete → Triaged
importance: Undecided → Medium
Revision history for this message
Stefan Bader (smb) wrote :

I re-enabled dhcp replies on my (external to MaaS) DHCP server after upgrading to MaaS 2.1. This has been running for a few hours now without experiencing problems. So the initial issue has gone. This report could be closed or used for tracking the implementation of the probing switch.

Changed in maas (Ubuntu):
status: Incomplete → Confirmed
Revision history for this message
Adam Collard (adam-collard) wrote :

This bug has not seen any activity in the last 6 months, so it is being automatically closed.

If you are still experiencing this issue, please feel free to re-open.

MAAS Team

Changed in maas:
status: Triaged → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.