Graph Flow Control using Galera 3 wsrep_flow_control_paused_ns

Bug #1325836 reported by Kenny Gryp
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Percona Monitoring Plugins
New
Undecided
Unassigned

Bug Description

The current Flow Control graphs are broken in some scenarios.

Clustercheck does a show global status, which resets the wsrep_flow_control_paused status variable. I have found a case where flow control would almost always be 0 on a cacti graph, but the system has massive flow control issues.

Reason for that is that clustercheck was performed on the database 8 times per second. While that's not optimal, doing it once per minute (which is very very low frequency) can also easily distort the flow control graphs.

Galera 3 has a counter wsrep_flow_control_paused_ns which can be used.

Tags: cacti galera
Revision history for this message
Kenny Gryp (gryp) wrote :

This is not documented yet, see https://github.com/codership/galera/issues/48

Revision history for this message
Roman Vynar (roman-vynar) wrote :

To consider:
> wsrep_flow_control_sent, wsrep_flow_control_recv DERIVE in galera 3, GAUGE in galera 2
> wsrep_flow_control_paused is always GAUGE
> wsrep_flow_control_paused_ns is always DERIVE but only exists in galera 3

tags: added: cacti galera
Revision history for this message
Roman Vynar (roman-vynar) wrote :

wsrep_flow_control_paused in v3 is how much % of the time flow control was active since 'flush status' or total server uptime (in general, useless for v3) . wsrep_flow_control_paused_ns is fine.

Revision history for this message
Kenny Gryp (gryp) wrote :
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.