Heka decoder and filter metrics aren't collected

Bug #1550229 reported by Simon Pasquier
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
StackLight
Confirmed
Medium
LMA-Toolchain Fuel Plugins

Bug Description

This is a follow up of https://bugs.launchpad.net/lma-toolchain/+bug/1545743.

It can happen that the message injected by the heka_monitoring filter exceeds the maximum message size limit. A typical scenario is when the logstreamers match hundreds of files because Heka will create 1 decoder per file.
In that situation, no Heka metric are emitted and the related Grafana panels show no data.

There are (at least) 2 options to fix this issue:
- Increase the max message size limit.
- Strip down the number of collected metrics.

Revision history for this message
Swann Croiset (swann-w) wrote :
Revision history for this message
Simon Pasquier (simon-pasquier) wrote :

In that case, we would be limited by the max_process_inject parameter.

Anyway, it would make more sense to aggregate the metrics by classes of decoder because right now the data is barely usable (too many decoder items). My take is that we should rethink which metrics we collect (for instance it would be useful to collect InChanCapacity/InChanLength) and how we aggregate them for efficiency and usability.

Revision history for this message
Swann Croiset (swann-w) wrote :

yes we must collect InChanCapacity/InChanLenth metrics.

the max_process_inject is set to 10 and I'm confident this is enough to split in 10 pieces the "bulk metric". If it's not the case we can introduce the same mechanisum we have in GSE filter:
https://github.com/openstack/fuel-plugin-lma-collector/blob/master/deployment_scripts/puppet/modules/lma_collector/files/plugins/filters/gse_cluster_filter.lua#L113-L129

Changed in lma-toolchain:
milestone: 1.0.0 → 0.10.0
Changed in lma-toolchain:
milestone: 0.10.0 → none
Swann Croiset (swann-w)
Changed in lma-toolchain:
milestone: none → 0.10.1
Swann Croiset (swann-w)
Changed in lma-toolchain:
milestone: 0.10.1 → 0.11.0
Changed in lma-toolchain:
milestone: 0.11.0 → 1.0.0
milestone: 1.0.0 → none
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.