Keepalived's VRRP child process is constantly dying and respawning on the controller
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
Invalid
|
Undecided
|
Unassigned |
Bug Description
While testing the new IPv6 gate jobs, they time out during the overcloud deployment.
Looking at the /var/log/messages of the controller, the following error is repeated multiple times every second:
Mar 17 08:52:53 localhost Keepalived[6947]: VRRP child process(8100) died: Respawning
Mar 17 08:52:53 localhost Keepalived_
Mar 17 08:52:53 localhost Keepalived_
Mar 17 08:52:53 localhost Keepalived_
Mar 17 08:52:53 localhost Keepalived_
Mar 17 08:52:53 localhost Keepalived_
Mar 17 08:52:53 localhost Keepalived_
Mar 17 08:52:53 localhost Keepalived_
Mar 17 08:52:53 localhost Keepalived_
Mar 17 08:52:53 localhost Keepalived_
Mar 17 08:52:53 localhost Keepalived_
Mar 17 08:52:53 localhost Keepalived_
Mar 17 08:52:53 localhost Keepalived[6947]: VRRP child process(8101) died: Respawning
[same repeats with different pids over and over]
These were the deployment arguments:
OVERCLOUD_
Full logs available here: http://
It happen during a gate job for: https:/
I can reproduce the issue locally. Looking at keepalived.conf, I see some vrrp_instances seem to have an invalid configuration, e.g.:
vrrp_instance 53 { fd00:fd00: 2000::11 dev
virtual_router_id 53
# Advert interval
advert_int 1
# for electing MASTER, highest priority wins.
priority 101
state MASTER
interface
virtual_ipaddress {
fd00:
}
track_script {
haproxy
}
}
Note there is no "interface" defined.