warning: log line attempted over max size - leadership related
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
juju-core |
Fix Released
|
High
|
Ian Booth | ||
1.23 |
Fix Released
|
Critical
|
Ian Booth | ||
1.24 |
Fix Released
|
Critical
|
Ian Booth |
Bug Description
This environment is nothing too special. Juju 1.23.0, been running for over a month. Now I'm getting lots of errors in my syslog, meanwhile jujud and mongo are both spinning on cpu taking up in the 60% range all the time, while using up in the 1G range of memory.
Restarting the services is not enough to clear things up.
I attached a full machine-0.log (it's over 9G uncompressed) and a mongo database dump.
Getting messages repeating every minute or so like this:
May 21 18:44:44 juju-lcy01-
...
"555e26bc91a7b1
tags: | added: cloud-installer |
description: | updated |
tags: | added: mongodb |
Changed in juju-core: | |
milestone: | none → 1.25.0 |
status: | New → Triaged |
importance: | Undecided → High |
Changed in juju-core: | |
assignee: | nobody → Ian Booth (wallyworld) |
importance: | High → Critical |
status: | Triaged → In Progress |
Changed in juju-core: | |
status: | In Progress → Fix Committed |
Changed in juju-core: | |
importance: | Critical → High |
status: | Fix Committed → Fix Released |
tags: | added: kanban-cross-team |
I've had a look at the DB and the lease collection looks like it has problems. All the lease documents have huge numbers of txn-queue entries, which all refer to completed transactions. Typically mgo/txn will leave only the last completed transaction to touch a document in the txn-queue field so this is quite unusual.
Here's the txn-queue sizes of each of the lease docs in the dump:
landscape- client- precise- leadership: 20746 client- trusty- leadership: 21381 client- utopic- leadership: 20685 client- vivid-leadershi p: 5942 client- leadership: 18785 server- leadership: 18777 client- leadership: 18769 server- leadership: 18771 client- leadership: 18747 server- leadership: 18751 leadership: 18714 leadership: 5918
landscape-
landscape-
landscape-
precise-
precise-
swap-leadership: 18690
trusty-
trusty-
ubuntu-leadership: 18782
utopic-
utopic-
vivid-client-
vivid-server-
I'm not sure if this is related to bug 1454891 but it certainly could be.