Comment 2 for bug 1960587

Revision history for this message
Dwane Pottratz (dpcsar) wrote : Re: comment out 127.0.1.1 hostanme for galera

Hi Dmitriy,

The bug is not in galera. It is in rabbitmq. Not sure if you want me to open a new bug or modify this one.

I am working with stable/xena branch.

openstack_user_config.yml contains 'no_containers: true'
the bug doesn't reproduce in containers.

infra1 /etc/hosts file:
127.0.0.1 localhost
127.0.1.1 infra1

# The following lines are desirable for IPv6 capable hosts
::1 ip6-localhost ip6-loopback
fe00::0 ip6-localnet
ff00::0 ip6-mcastprefix
ff02::1 ip6-allnodes
ff02::2 ip6-allrouters
### BEGIN OPENSTACK-ANSIBLE MANAGED BLOCK ###
192.168.2.21 infra1.home infra1
192.168.2.22 infra2.home infra2
192.168.2.23 infra3.home infra3
192.168.2.20 haproxy0.home haproxy0
### END OPENSTACK-ANSIBLE MANAGED BLOCK ###
192.168.2.21 infra1 infra1
192.168.2.22 infra2 infra2
192.168.2.23 infra3 infra3

Here is the output from running the playbook rabbitmq-install.yml
TASK [rabbitmq_server : Ensure RabbitMQ node [0] is started] *******************
fatal: [infra1]: FAILED! => {"changed": false, "msg": "Unable to start service rabbitmq-server: Job for rabbitmq-server.service failed because the control process exited with error code.\nSee \"systemctl status rabbitmq-server.service\" and \"journalctl -xe\" for details.\n"}

Here is log file on the server:
-- The job identifier is 17034 and the job result is done.
Feb 11 07:42:43 infra1 systemd[1]: Starting RabbitMQ broker...
-- Subject: A start job for unit rabbitmq-server.service has begun execution
-- Defined-By: systemd
-- Support: http://www.ubuntu.com/support
--
-- A start job for unit rabbitmq-server.service has begun execution.
--
-- The job identifier is 17034.
Feb 11 07:42:43 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:43.688676-08:00 [erro] <0.130.0>
Feb 11 07:42:43 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:43.688676-08:00 [erro] <0.130.0> BOOT FAILED
Feb 11 07:42:43 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:43.688676-08:00 [erro] <0.130.0> ===========
Feb 11 07:42:43 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:43.688676-08:00 [erro] <0.130.0> ERROR: epmd error for host infra1: address (cannot connect to host/port)
Feb 11 07:42:43 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:43.688676-08:00 [erro] <0.130.0>
Feb 11 07:42:43 infra1 rabbitmq-server[63163]: BOOT FAILED
Feb 11 07:42:43 infra1 rabbitmq-server[63163]: ===========
Feb 11 07:42:43 infra1 rabbitmq-server[63163]: ERROR: epmd error for host infra1: address (cannot connect to host/port)
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693325-08:00 [erro] <0.130.0> supervisor: {local,rabbit_prelaunch_sup}
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693325-08:00 [erro] <0.130.0> errorContext: start_error
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693325-08:00 [erro] <0.130.0> reason: {epmd_error,"infra1",address}
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693325-08:00 [erro] <0.130.0> offender: [{pid,undefined},
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693325-08:00 [erro] <0.130.0> {id,prelaunch},
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693325-08:00 [erro] <0.130.0> {mfargs,{rabbit_prelaunch,run_prelaunch_first_phase,[]}},
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693325-08:00 [erro] <0.130.0> {restart_type,transient},
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693325-08:00 [erro] <0.130.0> {significant,false},
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693325-08:00 [erro] <0.130.0> {shutdown,5000},
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693325-08:00 [erro] <0.130.0> {child_type,worker}]
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693325-08:00 [erro] <0.130.0>
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693679-08:00 [erro] <0.128.0> crasher:
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693679-08:00 [erro] <0.128.0> initial call: application_master:init/4
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693679-08:00 [erro] <0.128.0> pid: <0.128.0>
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693679-08:00 [erro] <0.128.0> registered_name: []
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693679-08:00 [erro] <0.128.0> exception exit: {{shutdown,
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693679-08:00 [erro] <0.128.0> {failed_to_start_child,prelaunch,
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693679-08:00 [erro] <0.128.0> {epmd_error,"infra1",address}}},
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693679-08:00 [erro] <0.128.0> {rabbit_prelaunch_app,start,[normal,[]]}}
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693679-08:00 [erro] <0.128.0> in function application_master:init/4 (application_master.erl, line 142)
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693679-08:00 [erro] <0.128.0> ancestors: [<0.127.0>]
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693679-08:00 [erro] <0.128.0> message_queue_len: 1
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693679-08:00 [erro] <0.128.0> messages: [{'EXIT',<0.129.0>,normal}]
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693679-08:00 [erro] <0.128.0> links: [<0.127.0>,<0.44.0>]
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693679-08:00 [erro] <0.128.0> dictionary: []
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693679-08:00 [erro] <0.128.0> trap_exit: true
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693679-08:00 [erro] <0.128.0> status: running
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693679-08:00 [erro] <0.128.0> heap_size: 376
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693679-08:00 [erro] <0.128.0> stack_size: 29
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693679-08:00 [erro] <0.128.0> reductions: 188
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693679-08:00 [erro] <0.128.0> neighbours:
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.693679-08:00 [erro] <0.128.0>
Feb 11 07:42:44 infra1 rabbitmq-server[63163]: 2022-02-11 07:42:44.697587-08:00 [noti] <0.44.0> Application rabbitmq_prelaunch exited with reason: {{shutdown,{failed_to_start_child,prelaunch,{epmd_error,"infra1",address}}},{rabbit_p>
Feb 11 07:42:46 infra1 rabbitmq-server[63163]: {"Kernel pid terminated",application_controller,"{application_start_failure,rabbitmq_prelaunch,{{shutdown,{failed_to_start_child,prelaunch,{epmd_error,\"infra1\",address}}},{rabbit_prel>
Feb 11 07:42:46 infra1 rabbitmq-server[63163]: Kernel pid terminated (application_controller) ({application_start_failure,rabbitmq_prelaunch,{{shutdown,{failed_to_start_child,prelaunch,{epmd_error,"infra1",address}}},{rabbit_prelaun>
Feb 11 07:42:46 infra1 rabbitmq-server[63163]:
Feb 11 07:42:46 infra1 rabbitmq-server[63163]: Crash dump is being written to: erl_crash.dump...done
Feb 11 07:42:46 infra1 systemd[1]: rabbitmq-server.service: Main process exited, code=exited, status=1/FAILURE
-- Subject: Unit process exited
-- Defined-By: systemd
-- Support: http://www.ubuntu.com/support

**** Sorry, the copy text from the console doesn't capture the whole line of the text '>' is were they cut off ****

If I remove the line '127.0.1.1 infra1' in /etc/hosts (and in other hosts) then run rabbitmq-install.yml again. It will pass.

Another thing that I noticed is the added lines in /etc/hosts:
192.168.2.21 infra1 infra1
192.168.2.22 infra2 infra2
192.168.2.23 infra3 infra3

I did not add those. OSA did, but they don't look like they are managed by ansbile because there is no marker lines.