Activity log for bug #1609766

Date Who What changed Old value New value Message
2016-08-04 12:05:30 Kirill Bespalov bug added bug
2016-08-04 12:05:30 Kirill Bespalov attachment added nova-compute.log https://bugs.launchpad.net/bugs/1609766/+attachment/4713977/+files/nova-compute.log
2016-08-04 12:05:56 Kirill Bespalov attachment added rabbitmq.log.3.gz https://bugs.launchpad.net/oslo.messaging/+bug/1609766/+attachment/4713978/+files/rabbitmq.log.3.gz
2016-08-04 12:06:31 Kirill Bespalov oslo.messaging: assignee Kirill Bespalov (k-besplv)
2016-08-04 12:56:22 OpenStack Infra oslo.messaging: status New In Progress
2016-08-04 16:19:09 Dmitry Mescheryakov bug added subscriber Dmitry Mescheryakov
2016-08-11 13:09:14 Dmitry Mescheryakov description Steps to reproduce: 1. Create a load on OpenStack, trigger restart of one of RabbitMQ nodes (exact reason unknown). 2. Observe a lot of operation basic.publish caused a channel exception not_found: "no exchange 'reply_d8786e66456a4660bebb362668a027e4' in vhost '/'" entries in RabbitMQ log for various reply queues. Looking earlier in the RabbitMQ log one can found 2016-08-03T13:33:12.945437+00:00 notice: operation queue.declare caused a channel exception not_found: "failed to perform operation on queue 'reply_d8786e66456a4660bebb362668a027e4' in vhost '/' due to timeout" 2016-08-03T13:34:43.006600+00:00 notice: operation queue.bind caused a channel exception not_found: "no exchange 'reply_d8786e66456a4660bebb362668a027e4' in vhost '/'" The following stacktrace corresponds to the later message in nova-compute.log: http://paste.openstack.org/show/548803/ It seems that during RabbitMQ failover we might end up with declared queue, which is not bound to an exchange (there an exception listed in paste above is thrown). Later oslo.messaging successfully starts consuming from that queue, since the queue exists, but it is useless because it is not bound to an exchange. You may find the whole logs containing snippets from above attached - nova-compute.log and rabbitmq.log.3.gz Steps to reproduce: 1. Create a load on OpenStack, trigger restart of one of RabbitMQ nodes (exact reason unknown). 2. Observe a lot of operation basic.publish caused a channel exception not_found: "no exchange 'reply_d8786e66456a4660bebb362668a027e4' in vhost '/'"    entries in RabbitMQ log for various reply queues. Looking earlier in the RabbitMQ log one can found 2016-08-03T13:33:12.945437+00:00 notice: operation queue.declare caused a channel exception not_found: "failed to perform operation on queue 'reply_d8786e66456a4660bebb362668a027e4' in vhost '/' due to timeout" 2016-08-03T13:34:43.006600+00:00 notice: operation queue.bind caused a channel exception not_found: "no exchange 'reply_d8786e66456a4660bebb362668a027e4' in vhost '/'" The following stacktrace corresponds to the later message in nova-compute.log: http://paste.openstack.org/show/548803/ It seems that during RabbitMQ failover we might end up with declared queue, which is not bound to an exchange (there an exception listed in paste above is thrown). Later oslo.messaging successfully starts consuming from that queue, since the queue exists, but it is useless because it is not bound to an exchange. You may find the whole logs containing snippets from above attached - nova-compute.log and rabbitmq.log.3.gz Versions: oslo.messaging - stable/mitaka RabbitMQ - 3.6.1 kombu 3.0.32 pyamqp 1.4.8
2016-09-06 09:02:53 OpenStack Infra oslo.messaging: assignee Kirill Bespalov (k-besplv) Mehdi Abaakouk (sileht)
2016-09-12 10:27:59 OpenStack Infra oslo.messaging: assignee Mehdi Abaakouk (sileht) Kirill Bespalov (k-besplv)
2016-09-12 15:46:05 Gustavo Randich bug added subscriber Gustavo Randich
2016-09-12 17:01:38 OpenStack Infra oslo.messaging: status In Progress Fix Released
2016-10-20 14:38:15 OpenStack Infra tags in-stable-newton
2016-10-27 10:40:30 OpenStack Infra tags in-stable-newton in-stable-mitaka in-stable-newton