Ceilometer

Bug #1434322
Activity log

Activity log for bug #1434322

Date	Who	What changed	Old value	New value	Message
2015-03-20 00:22:11	Rohit Jaiswal	bug			added bug
2015-03-20 00:22:16	Rohit Jaiswal	ceilometer: assignee		Rohit Jaiswal (rohit-jaiswal-3)
2015-03-20 00:25:01	Rohit Jaiswal	summary	Collector keeps on requeueing a message in case of a persistent error from dispatcher when requeueing is enabled	Collector keeps on requeueing and retrying a message in case of a persistent error from dispatcher when requeueing is enabled
2015-03-20 00:25:34	Rohit Jaiswal	description	When requeue_sample_on_dispatcher_error is enabled in ceilometer.conf, the collector will try to requeue the sample if it gets an error from the dispatcher. When the sample gets requeued, it will get picked up again and dispatched to the storage layer. If the same underlying error condition prevails, the error will be raised back to the collector and the message will be requeued again. This cyclical process will continue until the error is gone(in which case the sample is not requeued again) or the collector/ RMQ are restarted. In such scenario, when there is an persistent error condition, frequent retrying puts extra load on storage db and the messaging layer (rabbit), since the message and potentially more samples keep getting requeued and not cleared from the queue. There should be a configurable upper limit to cap the number of requeues of samples/events by Collector in case of dispatcher error.	When requeue_sample_on_dispatcher_error is enabled in ceilometer.conf, the collector will try to requeue the sample if it gets an error from the dispatcher. When the sample gets requeued, it will get picked up again and dispatched to the storage layer. If the same underlying error condition prevails, the error will be raised back to the collector and the message will be requeued again. This cyclical process will continue until the error is gone(in which case the sample is not requeued again) or the collector/ RMQ are restarted. In this scenario, when there is an persistent error condition, frequent retrying puts extra load on storage db and the messaging layer (rabbit), since the message and potentially more samples keep getting requeued and not cleared from the queue. There should be a configurable upper limit to cap the number of requeues of samples/events by Collector in case of dispatcher error.
2015-03-20 00:27:37	Rohit Jaiswal	description	When requeue_sample_on_dispatcher_error is enabled in ceilometer.conf, the collector will try to requeue the sample if it gets an error from the dispatcher. When the sample gets requeued, it will get picked up again and dispatched to the storage layer. If the same underlying error condition prevails, the error will be raised back to the collector and the message will be requeued again. This cyclical process will continue until the error is gone(in which case the sample is not requeued again) or the collector/ RMQ are restarted. In this scenario, when there is an persistent error condition, frequent retrying puts extra load on storage db and the messaging layer (rabbit), since the message and potentially more samples keep getting requeued and not cleared from the queue. There should be a configurable upper limit to cap the number of requeues of samples/events by Collector in case of dispatcher error.	When requeue_sample_on_dispatcher_error is enabled in ceilometer.conf, the collector will try to requeue the sample if it gets an error from the dispatcher. When the sample gets requeued, it will get picked up again and dispatched to the storage layer. If the same underlying error condition prevails, the error will be raised back to the collector and the message will be requeued again. This cyclical process will continue until the error is gone(in which case the sample is not requeued again) or the collector/ RMQ are restarted. In this scenario, when there is an persistent error condition, frequent retrying puts extra load on storage db and the messaging layer (rabbit), since the message and potentially more samples keep getting requeued and not cleared from the queue. It does not make sense to keep retrying continuously in case of a persistent error condition. There should be a configurable upper limit to cap the number of requeues of samples/events by Collector in case of dispatcher error.
2015-03-20 00:29:21	Rohit Jaiswal	description	When requeue_sample_on_dispatcher_error is enabled in ceilometer.conf, the collector will try to requeue the sample if it gets an error from the dispatcher. When the sample gets requeued, it will get picked up again and dispatched to the storage layer. If the same underlying error condition prevails, the error will be raised back to the collector and the message will be requeued again. This cyclical process will continue until the error is gone(in which case the sample is not requeued again) or the collector/ RMQ are restarted. In this scenario, when there is an persistent error condition, frequent retrying puts extra load on storage db and the messaging layer (rabbit), since the message and potentially more samples keep getting requeued and not cleared from the queue. It does not make sense to keep retrying continuously in case of a persistent error condition. There should be a configurable upper limit to cap the number of requeues of samples/events by Collector in case of dispatcher error.	When requeue_sample_on_dispatcher_error is enabled in ceilometer.conf, the collector will try to requeue the sample if it gets an error from the dispatcher. When the sample gets requeued, it will get picked up again and dispatched to the storage layer. If the same underlying error condition prevails, the error will be raised back to the collector and the message will be requeued again. This cyclical process will continue until the error is gone(in which case the sample is not requeued again) or the collector/ RMQ are restarted. In this scenario, when there is an persistent error condition, frequent retrying puts extra load on storage db and the messaging layer (rabbit), since the message and potentially more samples keep getting requeued and not cleared from the queue. It does not make sense to keep retrying continuously in case of a persistent error condition. There should be a configurable upper limit to cap the number of requeues of samples/events by Collector in case of dispatcher error. eg. requeue_sample_on_dispatcher_error_max_retries
2015-03-20 02:29:29	Rohit Jaiswal	description	When requeue_sample_on_dispatcher_error is enabled in ceilometer.conf, the collector will try to requeue the sample if it gets an error from the dispatcher. When the sample gets requeued, it will get picked up again and dispatched to the storage layer. If the same underlying error condition prevails, the error will be raised back to the collector and the message will be requeued again. This cyclical process will continue until the error is gone(in which case the sample is not requeued again) or the collector/ RMQ are restarted. In this scenario, when there is an persistent error condition, frequent retrying puts extra load on storage db and the messaging layer (rabbit), since the message and potentially more samples keep getting requeued and not cleared from the queue. It does not make sense to keep retrying continuously in case of a persistent error condition. There should be a configurable upper limit to cap the number of requeues of samples/events by Collector in case of dispatcher error. eg. requeue_sample_on_dispatcher_error_max_retries	When requeue_sample_on_dispatcher_error is enabled in ceilometer.conf, the collector will try to requeue the sample if it gets an error from the dispatcher. When the sample gets requeued, it will get picked up again and dispatched to the storage layer. If the same underlying error condition prevails, the error will be raised back to the collector and the message will be requeued again. This cyclical process will continue until the error is gone(in which case the sample is not requeued again) or the collector/ RMQ are restarted. In this scenario, when there is an persistent error condition, frequent retrying puts extra load on storage db and the messaging layer (rabbit), wasting collector CPU cycles since the message and potentially more samples keep getting requeued and not cleared from the queue. It does not make sense to keep retrying continuously in case of a persistent error condition. There should be a configurable upper limit to cap the number of requeues of samples/events by Collector in case of dispatcher error. eg. requeue_sample_on_dispatcher_error_max_retries
2015-03-20 04:52:42	Rohit Jaiswal	summary	Collector keeps on requeueing and retrying a message in case of a persistent error from dispatcher when requeueing is enabled	Collector continuously re-queues sample when dispatcher reports persistent error when requeue is enabled
2015-03-23 15:12:56	gordon chung	ceilometer: status	New	Triaged
2016-04-23 02:47:58	ZhiQiang Fan	ceilometer: status	Triaged	New
2016-04-23 02:48:00	ZhiQiang Fan	ceilometer: assignee	Rohit Jaiswal (rohit-jaiswal-3)
2016-09-01 18:22:39	Milan Potdar	bug			added subscriber Milan Potdar
2016-11-02 14:10:15	gordon chung	ceilometer: status	New	Triaged
2016-11-02 14:10:19	gordon chung	ceilometer: importance	Undecided	Medium
2017-06-15 12:32:58	gordon chung	ceilometer: status	Triaged	Won't Fix