As result of failures RabbitMQ created huge log file (~3Gb). rsyslog processes this file very slowly, resulting in large delay between a moment when message was generated and when it is put into files at Fuel master node. Even a week after the issue syslog still processes old messages.
In average rsyslog is able to process 100M file in 3.5 hours:
[root@fuel ~]# ls -al rabbitmq.log.1
-rw-------. 1 root root 108237878 May 10 13:25 rabbitmq.log.1
[root@fuel ~]# head rabbitmq.log.1
2016-05-10T07:30:35.388293+00:00 notice: =ERROR REPORT==== 4-May-2016::00:18:48 ===
2016-05-10T07:30:35.388293+00:00 notice: Channel error on connection <0.9727.15> (192.168.0.49:48984 -> 192.168.0.88:5673, vhost: '/', user: 'nova'), channel 1:
2016-05-10T07:30:35.388293+00:00 notice: operation basic.consume caused a channel exception not_found: "no queue 'reply_611b2b8a718049219bf62ca37ef9d906' in vhost '/'"
2016-05-10T07:30:35.388293+00:00 notice: =ERROR REPORT==== 4-May-2016::00:18:48 ===
2016-05-10T07:30:35.388293+00:00 notice: Channel error on connection <0.9727.15> (192.168.0.49:48984 -> 192.168.0.88:5673, vhost: '/', user: 'nova'), channel 1:
2016-05-10T07:30:35.388293+00:00 notice: operation basic.consume caused a channel exception not_found: "no queue 'reply_611b2b8a718049219bf62ca37ef9d906' in vhost '/'"
2016-05-10T07:30:35.388293+00:00 notice: =ERROR REPORT==== 4-May-2016::00:18:48 ===
2016-05-10T07:30:35.388293+00:00 notice: Channel error on connection <0.9727.15> (192.168.0.49:48984 -> 192.168.0.88:5673, vhost: '/', user: 'nova'), channel 1:
2016-05-10T07:30:35.388293+00:00 notice: operation basic.consume caused a channel exception not_found: "no queue 'reply_611b2b8a718049219bf62ca37ef9d906' in vhost '/'"
2016-05-10T07:30:35.388293+00:00 notice: =ERROR REPORT==== 4-May-2016::00:18:48 ===
[root@fuel ~]# tail rabbitmq.log.1
2016-05-10T11:00:38.001870+00:00 notice: Channel error on connection <0.9727.15> (192.168.0.49:48984 -> 192.168.0.88:5673, vhost: '/', user: 'nova'), channel 1:
2016-05-10T11:00:38.001870+00:00 notice: operation basic.consume caused a channel exception not_found: "no queue 'reply_611b2b8a718049219bf62ca37ef9d906' in vhost '/'"
2016-05-10T11:00:38.001876+00:00 notice: =ERROR REPORT==== 4-May-2016::00:23:56 ===
2016-05-10T11:00:38.001876+00:00 notice: Channel error on connection <0.9727.15> (192.168.0.49:48984 -> 192.168.0.88:5673, vhost: '/', user: 'nova'), channel 1:
2016-05-10T11:00:38.001882+00:00 notice: operation basic.consume caused a channel exception not_found: "no queue 'reply_611b2b8a718049219bf62ca37ef9d906' in vhost '/'"
2016-05-10T11:00:38.001882+00:00 notice: =ERROR REPORT==== 4-May-2016::00:23:56 ===
2016-05-10T11:00:38.001889+00:00 notice: Channel error on connection <0.9727.15> (192.168.0.49:48984 -> 192.168.0.88:5673, vhost: '/', user: 'nova'), channel 1:
2016-05-10T11:00:38.001889+00:00 notice: operation basic.consume caused a channel exception not_found: "no queue 'reply_611b2b8a718049219bf62ca37ef9d906' in vhost '/'"
2016-05-10T11:00:38.001894+00:00 notice: =ERROR REPORT==== 4-May-2016::00:23:56 ===
2016-05-10T11:00:38.001894+00:00 notice: Channel error on connection <0.9727.15> (192.168.0.49:48984 -> 192.168.0.88:5673, vhost: '/', user: 'nova'), channel 1:
Steps to reproduce:
1. Emulate large RabbitMQ log file by creating a file similar to the one written by RabbitMQ to "/var/log/rabbitmq/rabbit@messaging-node-XXX". The size should be hundreds of Mb.
2. Check how syslog processes messages and how fast they are copied to Fuel master node
Note. This bug impacts supportability since logs are not available at Fuel master node and thus in the snapshot.
[root@fuel ~]# fuel2 fuel-version
api: '1'
auth_required: true
feature_groups: []
openstack_version: mitaka-9.0
release: '9.0'