neutron_dhcp_server is running on different controller than shown in crm status, metadata server will not run
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Invalid
|
High
|
Fuel Library (Deprecated) |
Bug Description
fuel 6.0
3 controller setup running into the following:
Issue:
If the node running my private VLAN network's dhcp/metadata server is restarted the dhcp server will move to another node in the cluster, However if the dhcp server reported as master is not that node, the metadata server will not be listening. Additionally, rebooting the new master will place the master on previous node, but it will not work. rebooting both simultaneously will resolve.
Symptom:
Instantiating a system in horizon will fail to inject ssh keypair and repetitively display in logs:
2015-06-29 20:35:14,099 - url_helper.
Steps to reproduce:
1. Find controller with dhcp/metadata server listening
2. Reboot
3. try to instantiate host and fail.
snippets:
Online: [ node-1 node-2 node-3 ]
p_neutron-
root@node-1:~# ip netns show
haproxy
root@node-2:~# ip netns show
qdhcp-54d9418f-
haproxy
root@node-2:~# ip netns exec qdhcp-54d9418f-
Active Internet connections (only servers)
Proto Recv-Q Send-Q Local Address Foreign Address State PID/Program name
tcp 0 0 0.0.0.0:80 0.0.0.0:* LISTEN 43435/python
tcp 0 0 10.3.17.26:53 0.0.0.0:* LISTEN 43386/dnsmasq
tcp 0 0 169.254.169.254:53 0.0.0.0:* LISTEN 43386/dnsmasq
tcp6 0 0 fe80::f816:
root@node-3:~# ip netns show
haproxy
Instances create successfully.
---reboot node-2---
Online: [ node-1 node-2 node-3 ]
p_neutron-
root@node-1:~# ip netns show
haproxy
root@node-2:~# ip netns show
qdhcp-652b83ac-
haproxy
root@node-2:~# ip netns exec qdhcp-652b83ac-
Active Internet connections (only servers)
Proto Recv-Q Send-Q Local Address Foreign Address State PID/Program name
tcp 0 0 10.3.19.21:53 0.0.0.0:* LISTEN 27238/dnsmasq
tcp 0 0 169.254.169.254:53 0.0.0.0:* LISTEN 27238/dnsmasq
tcp6 0 0 fe80::f816:
root@node-3:~# ip netns show
qdhcp-652b83ac-
haproxy
root@node-3:~# ip netns exec qdhcp-652b83ac-
Active Internet connections (only servers)
Proto Recv-Q Send-Q Local Address Foreign Address State PID/Program name
tcp 0 0 0.0.0.0:80 0.0.0.0:* LISTEN 31665/python
tcp 0 0 10.3.19.22:53 0.0.0.0:* LISTEN 31603/dnsmasq
tcp 0 0 169.254.169.254:53 0.0.0.0:* LISTEN 31603/dnsmasq
tcp6 0 0 fe80::f816:
Instances fail to create correctly (dhcp was provided by node-2 which has no listening metadata-server; host found in cat /var/lib/
---reboot node-2---
cluster says node-3 is dhcp
node-2 retains dhcp server but no metadata server
node-3 does not have dhcp servers running.
Instances fail to create correctly (as expected)
--reboot node-3--
no change. 3 still listed as master. 2 retains dhcp server but no metadata server
--reboot both nodes--
node-2 now listed as dhcp master.
node-2 has metadata server listening
can instantiate instances
* node-1 is master in this scenario, however this was iterated in a similar fashion with differnent master, dhcp master node combos with no change in this strange behavior.
The only manual edit was to set "enable_
Is this a bug or am i just completely missing something.
I'm new to openstack, and believe i searched thoroughly , but likely have not
Tim
Changed in fuel: | |
status: | Confirmed → Incomplete |
milestone: | 7.0 → 6.0.2 |
Changed in fuel: | |
assignee: | Fuel Library Team (fuel-library) → Aleksandr Didenko (adidenko) |
Changed in fuel: | |
milestone: | 6.0.2 → 6.0.1 |
Hi, please attach diagnostic snapshot.