Bug #1733708 “load spike on HA 2.2.6 controller following remove...” : Bugs : Canonical Juju

Revision history for this message

Paul Collins (pjdc) wrote on 2017-11-21:

#1

grafana screenshot, previous 90m Edit (650.5 KiB, image/png)

Paul Collins (pjdc) on 2017-11-21

description:

updated

Revision history for this message

Paul Collins (pjdc) wrote on 2017-11-21:

#2

some mongotop: https://pastebin.canonical.com/203792/ (Canonical-only link, sorry!)

Revision history for this message

Ian Booth (wallyworld) wrote on 2017-11-21:

#3

Can we get mongo syslog and juju controller logs as well?

Revision history for this message

Paul Collins (pjdc) wrote on 2017-11-22:

#4

machine-*.log from 20:50 to 20:59: https://pastebin.canonical.com/203796/

mongod syslog on primary (it became primary at 07:53:50) from 20:50 to 20:59: https://private-fileshare.canonical.com/~pjdc/juju2controllers-mongod-syslog.xz

Revision history for this message

Paul Collins (pjdc) wrote on 2017-11-22:

#5

Sorry, those mongodb logs are from 23:50 to 23:59. Here's 20:50 to 20:59:

https://private-fileshare.canonical.com/~pjdc/juju2controllers-mongod-syslog-2.xz

The "wrong" logs might still be of interest as they probably reflect the system's current state.

Revision history for this message

Ian Booth (wallyworld) wrote on 2017-11-22:

#6

https://pastebin.canonical.com/203796/ looks like a typo - it contains some mongo commands rather than machine log entries

Revision history for this message

Paul Collins (pjdc) wrote on 2017-11-22:

#7

Yep, sorry. The machine-*.log entries are https://pastebin.canonical.com/203794/ (try https://pastebin.canonical.com/203794/plain/ if the former takes too long to load)

Tim Penhey (thumper) on 2017-11-22

Changed in juju:
status:	New → Triaged
importance:	Undecided → High
milestone:	none → 2.3.1

Revision history for this message

Junien F (axino) wrote on 2017-11-22:

#8

FWIW, This is not the first time it happens, it also happened to me last week (when we were still on 2.2.4). And I'm pretty sure I linked at least one load spike to an application removal earlier than that.

Revision history for this message

Tim Penhey (thumper) wrote on 2017-11-29:

#9

I watched the removal of an application today with wgrant and t0mb0.

Before we started we checked out the database. The txns collection had just been cleaned up from the previous night and was a reasonable level. There were no pending cleanups for the model, and mongotop looked sane.

The command was just to remove a single application that had two units. Each of these principle units would have had four subordinates.

Just after the removal, the juju.txns.log collection had reads go through the roof. A lot of write load into juju.txns. The read load on juju.txns.log was high for a number of minutes causing 'juju status' on other models to spike to 20s from 0.5s.

There was nothing untoward going on as far as I could tell, but the statetracker report on each of the controllers showed that every statepool had 184 models (and a few extra just because). This would have resulted in 552 models each polling the txns.log.

I think what we are hitting here is a situation where the cascading changes due to a deletion cause load in the txns collection. This causes a spike in the txns.log reads due to the number of tailers. This introduces significan i/o load on mongo which in turn can cause other commands to fail, which causes retries, which triggers additional transactions, which just adds to the overall load. This can degrade into a death spiral that only is recoverable by restarting the application servers.

To solve this, I think we really need to investigate a way to provide a central txns.log tailer for each controller.

Revision history for this message

Junien F (axino) wrote on 2017-11-29:

#10

I found out today that in another model (the staging version of the model you took a look at yesterday), an application is removed multiple times per day, but this doesn't generate any load increase. This staging model has about half the number of statuseshistory records, and a third of logs records of the production model (in case this is relevant).

This probably invalidates the theory you posted above :|

Revision history for this message

Tim Penhey (thumper) wrote on 2017-11-29:

#11

https://github.com/juju/juju/pull/8153

Changed in juju:
status:	Triaged → In Progress
assignee:	nobody → Tim Penhey (thumper)

Revision history for this message

Tim Penhey (thumper) wrote on 2017-11-29:

#12

Junien, is the staging model using the same controllers?

I think the load occurs when there are enough changes. Sometimes the load is enough to trigger bad behaviour and sometime not.

Are there subordinates in this staging model?

tags:

added: scalability

Revision history for this message

Junien F (axino) wrote on 2017-11-30:

#13

Yes it's using the same controller, and yes, the staging models has the same number of subordinates (6) than production.

However, production has 4 units where staging only has 2.

Revision history for this message

Tim Penhey (thumper) wrote on 2017-12-07:

#14

https://github.com/juju/juju/pull/8185

Canonical Juju QA Bot (juju-qa-bot) on 2017-12-08

Changed in juju:
milestone:	2.3.1 → none

Tim Penhey (thumper) on 2017-12-10

Changed in juju:
milestone:	none → 2.3.2

Revision history for this message

Junien F (axino) wrote on 2017-12-20:

#15

This is still impacting a 2.2.8 controller, when removing applications from a 2.2.6 model. mongodb "commands" counter (see https://docs.mongodb.com/manual/reference/command/serverStatus/ , section opcounters.command) increases highly, from ~600 per second to ~3500 per second.

I captured 100M worth of operations (using db.setProfilingLevel(2, 100)) during high load, and after a restart, when load was back to normal.

I'm unable to find anything relevant in this. And as a matter of fact, the 100MB were reached faster when under normal load than when under high load.

"perf top" showed this line at the top on the mongodb primary :
24.89% jujud [.] crypto/sha1.blockAMD64

And it went up to 60% on the secondaries (it's lower on the primary because mongodb takes a fair share of the CPU, I guess). And SHA1 is used during the SASL handshake. So it could be that all the controllers are trying to SASL to mongodb as fast as possible.

In fact, even with mongodb profiling (logging any command taking more than 100ms), I can see mongodb logging multiple thousands of saslStart per minute during the high load period.

When load was back to normal, I configured profiling to log _everything_, and only got less than 10 saslStart in 30s seconds.

I'll keep on investigating when this re-occurs (we can trigger it easily).

Note that 2.2.8 made "high load" lower than before.

Revision history for this message

John A Meinel (jameinel) wrote on 2017-12-21: Re: [Bug 1733708] Re: load spike on HA 2.2.6 controller following remove-application

#16

I believe the issue is that something internal to Juju causes us to not
fully consume cursors, which ultimately leads us to thinking that
connection is full so we connect one more time to mongo. If we had fully
consumed and closed the cursor, then that connection to mongo would be
available in the pool for us to re-use for the next query, which would
avoid the need to handshake a new cursor. (and also deals with things like
too-many-open-file-handles.)

I think having some sort of logging that gets triggered whenever a new
connection is created, with a stack trace of what we're doing to cause the
connection to be opened could be useful. Its tricky to do in code, because
it requires hacking the mgo driver to generate the full stack trace. Do we
have any way to reproduce this sort of behavior that isn't production?

On Wed, Dec 20, 2017 at 7:54 PM, Junien Fridrick <<email address hidden>
> wrote:

> This is still impacting a 2.2.8 controller, when removing applications
> from a 2.2.6 model. mongodb "commands" counter (see
> https://docs.mongodb.com/manual/reference/command/serverStatus/ ,
> section opcounters.command) increases highly, from ~600 per second to
> ~3500 per second.
>
> I captured 100M worth of operations (using db.setProfilingLevel(2, 100))
> during high load, and after a restart, when load was back to normal.
>
> I'm unable to find anything relevant in this. And as a matter of fact,
> the 100MB were reached faster when under normal load than when under
> high load.
>
> "perf top" showed this line at the top on the mongodb primary :
> 24.89% jujud [.]
> crypto/sha1.blockAMD64
>
> And it went up to 60% on the secondaries (it's lower on the primary
> because mongodb takes a fair share of the CPU, I guess). And SHA1 is
> used during the SASL handshake. So it could be that all the controllers
> are trying to SASL to mongodb as fast as possible.
>
> In fact, even with mongodb profiling (logging any command taking more
> than 100ms), I can see mongodb logging multiple thousands of saslStart
> per minute during the high load period.
>
> When load was back to normal, I configured profiling to log
> _everything_, and only got less than 10 saslStart in 30s seconds.
>
> I'll keep on investigating when this re-occurs (we can trigger it
> easily).
>
> Note that 2.2.8 made "high load" lower than before.
>
> --
> You received this bug notification because you are subscribed to juju.
> Matching subscriptions: juju bugs
> https://bugs.launchpad.net/bugs/1733708
>
> Title:
> load spike on HA 2.2.6 controller following remove-application
>
> To manage notifications about this bug go to:
> https://bugs.launchpad.net/juju/+bug/1733708/+subscriptions
>

I believe the issue is that something internal to Juju causes us to not
fully consume cursors, which ultimately leads us to thinking that
connection is full so we connect one more time to mongo. If we had fully
consumed and closed the cursor, then that connection to mongo would be
available in the pool for us to re-use for the next query, which would
avoid the need to handshake a new cursor. (and also deals with things like
too-many-open-file-handles.)

I think having some sort of logging that gets triggered whenever a new
connection is created, with a stack trace of what we're doing to cause the
connection to be opened could be useful. Its tricky to do in code, because
it requires hacking the mgo driver to generate the full stack trace. Do we
have any way to reproduce this sort of behavior that isn't production?

On Wed, Dec 20, 2017 at 7:54 PM, Junien Fridrick <1733708@bugs.launchpad.net
> wrote:

> This is still impacting a 2.2.8 controller, when removing applications
> from a 2.2.6 model. mongodb "commands" counter (see
> https://docs.mongodb.com/manual/reference/command/serverStatus/ ,
> section opcounters.command) increases highly, from ~600 per second to
> ~3500 per second.
>
> I captured 100M worth of operations (using db.setProfilingLevel(2, 100))
> during high load, and after a restart, when load was back to normal.
>
> I'm unable to find anything relevant in this. And as a matter of fact,
> the 100MB were reached faster when under normal load than when under
> high load.
>
> "perf top" showed this line at the top on the mongodb primary :
> 24.89%  jujud                                               [.]
> crypto/sha1.blockAMD64
>
> And it went up to 60% on the secondaries (it's lower on the primary
> because mongodb takes a fair share of the CPU, I guess). And SHA1 is
> used during the SASL handshake. So it could be that all the controllers
> are trying to SASL to mongodb as fast as possible.
>
> In fact, even with mongodb profiling (logging any command taking more
> than 100ms), I can see mongodb logging multiple thousands of saslStart
> per minute during the high load period.
>
> When load was back to normal, I configured profiling to log
> _everything_, and only got less than 10 saslStart in 30s seconds.
>
> I'll keep on investigating when this re-occurs (we can trigger it
> easily).
>
> Note that 2.2.8 made "high load" lower than before.
>
> --
> You received this bug notification because you are subscribed to juju.
> Matching subscriptions: juju bugs
> https://bugs.launchpad.net/bugs/1733708
>
> Title:
>   load spike on HA 2.2.6 controller following remove-application
>
> To manage notifications about this bug go to:
> https://bugs.launchpad.net/juju/+bug/1733708/+subscriptions
>

Revision history for this message

Junien F (axino) wrote on 2017-12-21:

#17

> Do we have any way to reproduce this sort of behavior that isn't production ?

Not that I'm aware of, unfortunately.

Revision history for this message

Junien F (axino) wrote on 2017-12-21:

#18

Is there a way to kill juju in such a way that it would generate a core dump and/or display all stack traces or something ? (same as what happens when you SIGQUIT a telegraf process)

Revision history for this message

Junien F (axino) wrote on 2017-12-21:

#19

I forgot about the pprof facilities, I'll take a look at them next time !

Revision history for this message

John A Meinel (jameinel) wrote on 2017-12-21:

#20

SIGQUIT does exactly that for Juju, with the caveat that it outputs to
whatever the original log file was when we started up. (it is captured via
systemd, and when you rotate log files, that doesn't cause stderr to rotate
as well.)

We also have pprof and some of the other utilities as well.

On Thu, Dec 21, 2017 at 11:20 AM, Junien Fridrick <
<email address hidden>> wrote:

> I forgot about the pprof facilities, I'll take a look at them next time
> !
>
> --
> You received this bug notification because you are subscribed to juju.
> Matching subscriptions: juju bugs
> https://bugs.launchpad.net/bugs/1733708
>
> Title:
> load spike on HA 2.2.6 controller following remove-application
>
> To manage notifications about this bug go to:
> https://bugs.launchpad.net/juju/+bug/1733708/+subscriptions
>

Revision history for this message

Tim Penhey (thumper) wrote on 2018-01-02:

#21

After more investigation, I came across this:

https://github.com/juju/juju/blob/2.2/state/status.go#L371

Statuses history is being removed all at once, and isn't batched at all. I believe that this will be causing i/o timeouts and reconnections.

Revision history for this message

Tim Penhey (thumper) wrote on 2018-01-03:

#22

There is a function in state/prune.go for pruning collections, that seems to have some hard coded references to history, but it could / should be reused in the places where we are removing all the statuses history documents.

Revision history for this message

Andrew Wilkins (axwalk) wrote on 2018-01-03:

#23

https://github.com/juju/juju/pull/8254 batch-deletes status history.

Revision history for this message

William Grant (wgrant) wrote on 2018-01-03:

#24

Download full text (3.6 KiB)

After a couple of deploys yesterday reinforced the statuseshistory pruning hypothesis, we preserved some candidate machines and executed controlled demolitions today which basically proved that the statuseshistory cleanup is the problem.

For context, the two models (on the same 3-way HA OpenStack controller running Juju 2.2.8) that we can reproduce this on have frequently created and deleted applications with two (staging) and four (production) units each with four and six subordinates respectively. Staging is usually deployed to a couple of times a day, and production 1-3 times a week. Relevant model settings are update-status-hook-interval=5m, max-status-history-age=336h, max-status-history-size=5G. The principal and a couple of the subordinates set workload status on every update-status.

Deployments consist of deploying a new application, verifying it, adding relations, then remove-application and remove-machine --force on the old one (the remove-machine --force is mostly a leftover from the 2.1 subordinate bugs). In almost every production deployment, the remove phase tickles this bug. In staging deployments it is very rare, but has happened on at least two occasions. The main differences between the models are that production has twice as many machines and 50% more subordinates, and its units tend to live for several days rather than no more than one or two. It was on this basis that we speculated in November that statuseshistory might be to blame.

During the week of December 18, we were attempting to tickle the bug in controlled, monitored circumstances, so performed production deployments more closely together than is usual. The second production deployment in 24 hours did not tickle the bug, the first time in months that a deployment had proceeded without incident, and so we were unable to gather particularly useful data.

Both production and staging went without deployments for 12 days over the EOY break. The first staging deployment of 2018 tickled the bug -- unusual for staging. Controller logs showed "cannot delete history for unit" errors with IO timeouts for some of the units, some repeating over the next 20 minutes. The saslStart storm mentioned in comment #15 was again evident. A controller restart resolved things.

I preserved remaining production and staging machines from before the break for more detailed experiments. We manually purged the statuseshistories for the units on one machine of the other 12-day-old staging application, and a remove-machine --force worked fine; the machine vanished in about 5s and no added "juju status" latency was observed. Without removing statueshistories, remove-machine --force of the other machine also worked okay, but it took more than 30s, load on one controller spiked above 90, and status was delayed by several seconds during the removal. Each machine's units had about 50000 statuseshistories in total (on both production and staging there were ~7700 statuseshistory documents for each unit agent, and the same for the charms that update their status often).

Since that seemed like a reasonably successful test, we tried it on a 13-day-old production application inside a normal deployment ...

After a couple of deploys yesterday reinforced the statuseshistory pruning hypothesis, we preserved some candidate machines and executed controlled demolitions today which basically proved that the statuseshistory cleanup is the problem.

For context, the two models (on the same 3-way HA OpenStack controller running Juju 2.2.8) that we can reproduce this on have frequently created and deleted applications with two (staging) and four (production) units each with four and six subordinates respectively. Staging is usually deployed to a couple of times a day, and production 1-3 times a week. Relevant model settings are update-status-hook-interval=5m, max-status-history-age=336h, max-status-history-size=5G. The principal and a couple of the subordinates set workload status on every update-status.

Deployments consist of deploying a new application, verifying it, adding relations, then remove-application and remove-machine --force on the old one (the remove-machine --force is mostly a leftover from the 2.1 subordinate bugs). In almost every production deployment, the remove phase tickles this bug. In staging deployments it is very rare, but has happened on at least two occasions. The main differences between the models are that production has twice as many machines and 50% more subordinates, and its units tend to live for several days rather than no more than one or two. It was on this basis that we speculated in November that statuseshistory might be to blame.

During the week of December 18, we were attempting to tickle the bug in controlled, monitored circumstances, so performed production deployments more closely together than is usual. The second production deployment in 24 hours did not tickle the bug, the first time in months that a deployment had proceeded without incident, and so we were unable to gather particularly useful data.

Both production and staging went without deployments for 12 days over the EOY break. The first staging deployment of 2018 tickled the bug -- unusual for staging. Controller logs showed "cannot delete history for unit" errors with IO timeouts for some of the units, some repeating over the next 20 minutes. The saslStart storm mentioned in comment #15 was again evident. A controller restart resolved things.

I preserved remaining production and staging machines from before the break for more detailed experiments. We manually purged the statuseshistories for the units on one machine of the other 12-day-old staging application, and a remove-machine --force worked fine; the machine vanished in about 5s and no added "juju status" latency was observed. Without removing statueshistories, remove-machine --force of the other machine also worked okay, but it took more than 30s, load on one controller spiked above 90, and status was delayed by several seconds during the removal. Each machine's units had about 50000 statuseshistories in total (on both production and staging there were ~7700 statuseshistory documents for each unit agent, and the same for the charms that update their status often).

Since that seemed like a reasonably successful test, we tried it on a 13-day-old production application inside a normal deployment that in >95% of cases breaks the controller. We purged all ~290000 statuseshistories sequentially, which took about 17 minutes. The deployment then ran smoothly, the machines died quickly, controller load did not exceed 15, and status remained snappy throughout the endeavour.

So I'm now highly confident that statuseshistory is the problem. But deletions are so incredibly slow that the approach in the PR may not end up being sufficient.

Revision history for this message

William Grant (wgrant) wrote on 2018-01-03:

#25

We also this morning saw a saslStart storm during a destroy-model from an unrelated team on the same controller, but it was too late to gather any data.

Revision history for this message

Tim Penhey (thumper) wrote on 2018-01-03:

#26

The deletion of the statuses history values is done using a cleanup process. These are triggered as the units transition from dying to dead and are executed asynchronously, but serially for a particular model.

What we would see would be a unit hanging around for a while although dead as the cleanup jobs run. The cleanup could still delete the statuses history in small batches to effectively yield time back to the database for dealing with other requests.

Tim Penhey (thumper) on 2018-01-03

Changed in juju:
milestone:	2.3.2 → 2.4-beta1
status:	In Progress → Triaged
assignee:	Tim Penhey (thumper) → nobody

Revision history for this message

Junien F (axino) wrote on 2018-01-12:

#27

Download full text (15.0 KiB)

Hi,

Disclaimer : much reading but not much progress here :(

So this still happening, even with 2.2.9. wgrant has a controller that is under high load / high sha1.blockAMD64 activity. On this controller, I tried to get a perf flamegraph but it wasn't really useful : https://private-fileshare.canonical.com/~axino/lp1733708/perf.svg (you can click on boxes to zoom on them).

I also got a graph of the pprof profile data, and it yielded https://private-fileshare.canonical.com/~axino/lp1733708/calls.svg - this isn't as useful as it looks. However, when taking a look with "peek", you can get hints at what's going on. We're going to start on sha1.blockAMD64, and go up the call chain :

(pprof) peek crypto/sha1.blockAMD64
24.79s of 26.04s total (95.20%)
Dropped 381 nodes (cum <= 0.13s)
----------------------------------------------------------+-------------
      flat flat% sum% cum cum% calls calls% + context
----------------------------------------------------------+-------------
                                            16.77s 100% | crypto/sha1.block
    16.77s 64.40% 64.40% 16.77s 64.40% | crypto/sha1.blockAMD64
----------------------------------------------------------+-------------

^ means that all calls to crypto/sha1.blockAMD64 are made from crypto/sha1.block. Let's continue up :

(pprof) peek crypto/sha1.block$
24.79s of 26.04s total (95.20%)
Dropped 381 nodes (cum <= 0.13s)
----------------------------------------------------------+-------------
      flat flat% sum% cum cum% calls calls% + context
----------------------------------------------------------+-------------
                                               17s 100% | crypto/sha1.(*digest).Write
     0.22s 0.84% 0.84% 17.01s 65.32% | crypto/sha1.block
                                            16.77s 100% | crypto/sha1.blockAMD64
----------------------------------------------------------+-------------

(pprof) peek crypto/sha1.$\*digest$.Write
24.79s of 26.04s total (95.20%)
Dropped 381 nodes (cum <= 0.13s)
----------------------------------------------------------+-------------
      flat flat% sum% cum cum% calls calls% + context
----------------------------------------------------------+-------------
                                            10.28s 53.15% | crypto/sha1.(*digest).checkSum
                                             4.57s 23.63% | crypto/hmac.(*hmac).Sum
                                             4.21s 21.77% | crypto/hmac.(*hmac).Reset
                                             0.28s 1.45% | crypto/hmac.(*hmac).Write
     1.83s 7.03% 7.03% 19.39s 74.46% | crypto/sha1.(*digest).Write
                                               17s 96.81% | crypto/sha1.block
                                             0.56s 3.19% | runtime.memmove
----------------------------------------------------------+-------------

There's a split here, with 3 different callers > 20%, but they actually all converge back just at the boundary of the "crypto" package. Let's keep looking :

(pprof) peek \.checkSum$
24.79s of 26.04s total (95.2...

Hi,

Disclaimer : much reading but not much progress here :(

So this still happening, even with 2.2.9. wgrant has a controller that is under high load / high sha1.blockAMD64 activity. On this controller, I tried to get a perf flamegraph but it wasn't really useful : https://private-fileshare.canonical.com/~axino/lp1733708/perf.svg (you can click on boxes to zoom on them).

I also got a graph of the pprof profile data, and it yielded https://private-fileshare.canonical.com/~axino/lp1733708/calls.svg - this isn't as useful as it looks. However, when taking a look with "peek", you can get hints at what's going on. We're going to start on sha1.blockAMD64, and go up the call chain :

(pprof) peek crypto/sha1.blockAMD64
24.79s of 26.04s total (95.20%)
Dropped 381 nodes (cum <= 0.13s)
----------------------------------------------------------+-------------
      flat  flat%   sum%        cum   cum%   calls calls% + context
----------------------------------------------------------+-------------
                                            16.77s   100% |   crypto/sha1.block
    16.77s 64.40% 64.40%     16.77s 64.40%                | crypto/sha1.blockAMD64
----------------------------------------------------------+-------------

^ means that all calls to crypto/sha1.blockAMD64 are made from crypto/sha1.block. Let's continue up :

(pprof) peek crypto/sha1.block$
24.79s of 26.04s total (95.20%)
Dropped 381 nodes (cum <= 0.13s)
----------------------------------------------------------+-------------
      flat  flat%   sum%        cum   cum%   calls calls% + context
----------------------------------------------------------+-------------
                                               17s   100% |   crypto/sha1.(*digest).Write
     0.22s  0.84%  0.84%     17.01s 65.32%                | crypto/sha1.block
                                            16.77s   100% |   crypto/sha1.blockAMD64
----------------------------------------------------------+-------------

(pprof) peek crypto/sha1.$\*digest$.Write
24.79s of 26.04s total (95.20%)
Dropped 381 nodes (cum <= 0.13s)
----------------------------------------------------------+-------------
      flat  flat%   sum%        cum   cum%   calls calls% + context
----------------------------------------------------------+-------------
                                            10.28s 53.15% |   crypto/sha1.(*digest).checkSum
                                             4.57s 23.63% |   crypto/hmac.(*hmac).Sum
                                             4.21s 21.77% |   crypto/hmac.(*hmac).Reset
                                             0.28s  1.45% |   crypto/hmac.(*hmac).Write
     1.83s  7.03%  7.03%     19.39s 74.46%                | crypto/sha1.(*digest).Write
                                               17s 96.81% |   crypto/sha1.block
                                             0.56s  3.19% |   runtime.memmove
----------------------------------------------------------+-------------

There's a split here, with 3 different callers > 20%, but they actually all converge back just at the boundary of the "crypto" package. Let's keep looking :

(pprof) peek \.checkSum$
24.79s of 26.04s total (95.20%)
Dropped 381 nodes (cum <= 0.13s)
----------------------------------------------------------+-------------
      flat  flat%   sum%        cum   cum%   calls calls% + context
----------------------------------------------------------+-------------
                                            12.31s   100% |   crypto/sha1.(*digest).Sum
        2s  7.68%  7.68%     12.31s 47.27%                | crypto/sha1.(*digest).checkSum
                                            10.28s   100% |   crypto/sha1.(*digest).Write
----------------------------------------------------------+-------------

(pprof) peek \.Sum
24.79s of 26.04s total (95.20%)
Dropped 381 nodes (cum <= 0.13s)
----------------------------------------------------------+-------------
      flat  flat%   sum%        cum   cum%   calls calls% + context
----------------------------------------------------------+-------------
                                            13.52s   100% |   crypto/hmac.(*hmac).Sum
     0.67s  2.57%  2.57%     13.53s 51.96%                | crypto/sha1.(*digest).Sum
                                            12.31s 95.72% |   crypto/sha1.(*digest).checkSum
                                             0.29s  2.26% |   runtime.memmove
                                             0.26s  2.02% |   runtime.duffcopy
----------------------------------------------------------+-------------
                                            18.40s   100% |   gopkg.in/mgo.v2/internal/scram.(*Client).saltPassword
     0.28s  1.08%  3.65%     18.46s 70.89%                | crypto/hmac.(*hmac).Sum
                                            13.52s 74.57% |   crypto/sha1.(*digest).Sum
                                             4.57s 25.21% |   crypto/sha1.(*digest).Write
                                             0.04s  0.22% |   crypto/sha1.(*digest).Reset
----------------------------------------------------------+-------------

OK this is our first boundary outside of the "crypto" package, with gopkg.in/mgo.v2/internal/scram.(*Client).saltPassword (see https://github.com/go-mgo/mgo/blob/v2/internal/scram/scram.go). The 3 "split" callers from earlier all get called from here. So 100% of the sha1.blockAMD64 calls are from saltPassword(). Let's continue :

(pprof) peek saltPassword
24.79s of 26.04s total (95.20%)
Dropped 381 nodes (cum <= 0.13s)
----------------------------------------------------------+-------------
      flat  flat%   sum%        cum   cum%   calls calls% + context
----------------------------------------------------------+-------------
                                            24.03s   100% |   gopkg.in/mgo.v2/internal/scram.(*Client).step2
     0.65s  2.50%  2.50%     24.03s 92.28%                | gopkg.in/mgo.v2/internal/scram.(*Client).saltPassword
                                            18.40s 78.73% |   crypto/hmac.(*hmac).Sum
                                             4.46s 19.08% |   crypto/hmac.(*hmac).Reset
                                             0.51s  2.18% |   crypto/hmac.(*hmac).Write
----------------------------------------------------------+-------------

(pprof) peek step2
24.79s of 26.04s total (95.20%)
Dropped 381 nodes (cum <= 0.13s)
----------------------------------------------------------+-------------
      flat  flat%   sum%        cum   cum%   calls calls% + context
----------------------------------------------------------+-------------
                                            24.14s   100% |   gopkg.in/mgo.v2/internal/scram.(*Client).Step
         0     0%     0%     24.14s 92.70%                | gopkg.in/mgo.v2/internal/scram.(*Client).step2
                                            24.03s   100% |   gopkg.in/mgo.v2/internal/scram.(*Client).saltPassword
----------------------------------------------------------+-------------

(pprof) peek Step
24.79s of 26.04s total (95.20%)
Dropped 381 nodes (cum <= 0.13s)
----------------------------------------------------------+-------------
      flat  flat%   sum%        cum   cum%   calls calls% + context
----------------------------------------------------------+-------------
                                            24.16s   100% |   gopkg.in/mgo%2ev2.(*mongoSocket).loginSASL
         0     0%     0%     24.16s 92.78%                | gopkg.in/mgo%2ev2.(*saslScram).Step
                                            24.16s   100% |   gopkg.in/mgo.v2/internal/scram.(*Client).Step
----------------------------------------------------------+-------------
                                            24.16s   100% |   gopkg.in/mgo%2ev2.(*saslScram).Step
         0     0%     0%     24.16s 92.78%                | gopkg.in/mgo.v2/internal/scram.(*Client).Step
                                            24.14s   100% |   gopkg.in/mgo.v2/internal/scram.(*Client).step2
----------------------------------------------------------+-------------

(pprof) peek loginSASL
24.79s of 26.04s total (95.20%)
Dropped 381 nodes (cum <= 0.13s)
----------------------------------------------------------+-------------
      flat  flat%   sum%        cum   cum%   calls calls% + context
----------------------------------------------------------+-------------
                                            24.57s   100% |   gopkg.in/mgo%2ev2.(*mongoSocket).Login
         0     0%     0%     24.57s 94.35%                | gopkg.in/mgo%2ev2.(*mongoSocket).loginSASL
                                            24.16s 98.37% |   gopkg.in/mgo%2ev2.(*saslScram).Step
                                             0.40s  1.63% |   gopkg.in/mgo%2ev2.(*mongoSocket).loginRun
----------------------------------------------------------+-------------

(pprof) peek Login
24.79s of 26.04s total (95.20%)
Dropped 381 nodes (cum <= 0.13s)
----------------------------------------------------------+-------------
      flat  flat%   sum%        cum   cum%   calls calls% + context
----------------------------------------------------------+-------------
                                            24.58s   100% |   gopkg.in/mgo%2ev2.(*Session).acquireSocket
         0     0%     0%     24.58s 94.39%                | gopkg.in/mgo%2ev2.(*Session).socketLogin
                                            24.58s   100% |   gopkg.in/mgo%2ev2.(*mongoSocket).Login
----------------------------------------------------------+-------------
                                            24.58s   100% |   gopkg.in/mgo%2ev2.(*Session).socketLogin
         0     0%     0%     24.58s 94.39%                | gopkg.in/mgo%2ev2.(*mongoSocket).Login
                                            24.57s   100% |   gopkg.in/mgo%2ev2.(*mongoSocket).loginSASL
----------------------------------------------------------+-------------

(pprof) peek acquireSocket
24.79s of 26.04s total (95.20%)
Dropped 381 nodes (cum <= 0.13s)
----------------------------------------------------------+-------------
      flat  flat%   sum%        cum   cum%   calls calls% + context
----------------------------------------------------------+-------------
                                            22.58s 90.98% |   gopkg.in/mgo%2ev2.(*Query).One
                                             1.30s  5.24% |   gopkg.in/mgo%2ev2.(*Collection).writeOp
                                             0.62s  2.50% |   gopkg.in/mgo%2ev2.(*Query).Iter
                                             0.32s  1.29% |   gopkg.in/mgo%2ev2.(*Database).Run
         0     0%     0%     24.82s 95.31%                | gopkg.in/mgo%2ev2.(*Session).acquireSocket
                                            24.58s 99.03% |   gopkg.in/mgo%2ev2.(*Session).socketLogin
                                             0.24s  0.97% |   gopkg.in/mgo%2ev2.(*mongoCluster).AcquireSocket
----------------------------------------------------------+-------------

(pprof) peek \.One
24.79s of 26.04s total (95.20%)
Dropped 381 nodes (cum <= 0.13s)
----------------------------------------------------------+-------------
      flat  flat%   sum%        cum   cum%   calls calls% + context
----------------------------------------------------------+-------------
                                            17.81s 80.33% |   github.com/juju/juju/state.getStatus
                                             1.58s  7.13% |   github.com/juju/juju/state.(*State).Unit
                                             0.50s  2.26% |   github.com/juju/juju/state.(*State).getMachineDoc
                                             0.47s  2.12% |   github.com/juju/juju/state.getInstanceData
                                             0.44s  1.98% |   github.com/juju/juju/state.(*allWatcherStateBacking).Changed
                                             0.37s  1.67% |   github.com/juju/juju/state.(*Unit).AssignedMachineId
                                             0.31s  1.40% |   github.com/juju/juju/state.(*State).Application
                                             0.28s  1.26% |   github.com/juju/juju/state.readTxnRevno
                                             0.27s  1.22% |   github.com/juju/juju/state.readSettingsDocInto
                                             0.14s  0.63% |   github.com/juju/juju/state.(*State).APIHostPorts
         0     0%     0%     22.77s 87.44%                | github.com/juju/juju/mongo.queryWrapper.One
                                            22.77s   100% |   gopkg.in/mgo%2ev2.(*Query).One
----------------------------------------------------------+-------------
                                            22.77s   100% |   github.com/juju/juju/mongo.queryWrapper.One
         0     0%     0%     22.78s 87.48%                | gopkg.in/mgo%2ev2.(*Query).One
                                            22.58s 99.12% |   gopkg.in/mgo%2ev2.(*Session).acquireSocket
                                             0.10s  0.44% |   gopkg.in/mgo%2ev2.(*mongoSocket).SimpleQuery
                                             0.10s  0.44% |   gopkg.in/mgo.v2/bson.Unmarshal
----------------------------------------------------------+-------------

OK, end of the line. We can see here the various juju functions making calls to mongodb via mgo (note that on other profiles, the repartition is different, i.e. getStatus() is not always so predominant).

Looking at a "normal" controller, we have :
----------------------------------------------------------+-------------
                                             2.10s 78.36% |   github.com/juju/juju/mongo.queryWrapper.One
                                             0.25s  9.33% |   gopkg.in/mgo.v2/txn.(*Runner).load
                                             0.17s  6.34% |   gopkg.in/mgo.v2/txn.(*flusher).assert
                                             0.16s  5.97% |   gopkg.in/mgo.v2/txn.(*flusher).rescan
     0.01s 0.047% 0.093%      2.71s 12.63%                | gopkg.in/mgo%2ev2.(*Query).One
                                             1.33s 50.76% |   gopkg.in/mgo%2ev2.(*mongoSocket).SimpleQuery
                                             1.17s 44.66% |   gopkg.in/mgo.v2/bson.Unmarshal
                                             0.07s  2.67% |   runtime.newobject
                                             0.05s  1.91% |   gopkg.in/mgo%2ev2.(*Session).acquireSocket
----------------------------------------------------------+-------------

so Query.One() is mostly calling SimpleQuery and bson.Unmarshal, as expected.

So I believe all this indicates that _somehow_, the cause of this bug is that juju just stops using the mongodb pool and creates a new connection each time it wants to do a request. If you get back at the flamegraph, and click on "runtime.goexit", then search for "loginSASL" at the top right, you can see that nearly every state is spending 99% of its time logging in the to database.

I haven't been able to root cause that yet, and the following would have helped :

a) pprof tracing - see bug 1742955
b) enabling mgo debug logging - I'm not sure how to do that via the "logging-config" model config option - and I'm not sure that's even possible. If it's not possible then let's file a bug.

Thanks

Revision history for this message

John A Meinel (jameinel) wrote on 2018-01-15:

#28

Download full text (16.1 KiB)

Given that this overhead is calls to login SASL, and this bug filed against
mgo, I wonder if it is the culprit:
https://github.com/go-mgo/mgo/issues/254

(it seems to fit the evidence, at least).

John
=:->

On Fri, Jan 12, 2018 at 6:37 PM, Junien Fridrick <<email address hidden>
> wrote:

> Hi,
>
>
> Disclaimer : much reading but not much progress here :(
>
>
> So this still happening, even with 2.2.9. wgrant has a controller that is
> under high load / high sha1.blockAMD64 activity. On this controller, I
> tried to get a perf flamegraph but it wasn't really useful :
> https://private-fileshare.canonical.com/~axino/lp1733708/perf.svg (you
> can click on boxes to zoom on them).
>
> I also got a graph of the pprof profile data, and it yielded https
> ://private-fileshare.canonical.com/~axino/lp1733708/calls.svg - this
> isn't as useful as it looks. However, when taking a look with "peek",
> you can get hints at what's going on. We're going to start on
> sha1.blockAMD64, and go up the call chain :
>
>
> (pprof) peek crypto/sha1.blockAMD64
> 24.79s of 26.04s total (95.20%)
> Dropped 381 nodes (cum <= 0.13s)
> ----------------------------------------------------------+-------------
> flat flat% sum% cum cum% calls calls% + context
> ----------------------------------------------------------+-------------
> 16.77s 100% |
> crypto/sha1.block
> 16.77s 64.40% 64.40% 16.77s 64.40% |
> crypto/sha1.blockAMD64
> ----------------------------------------------------------+-------------
>
> ^ means that all calls to crypto/sha1.blockAMD64 are made from
> crypto/sha1.block. Let's continue up :
>
> (pprof) peek crypto/sha1.block$
> 24.79s of 26.04s total (95.20%)
> Dropped 381 nodes (cum <= 0.13s)
> ----------------------------------------------------------+-------------
> flat flat% sum% cum cum% calls calls% + context
> ----------------------------------------------------------+-------------
> 17s 100% |
> crypto/sha1.(*digest).Write
> 0.22s 0.84% 0.84% 17.01s 65.32% |
> crypto/sha1.block
> 16.77s 100% |
> crypto/sha1.blockAMD64
> ----------------------------------------------------------+-------------
>
>
> (pprof) peek crypto/sha1.$\*digest$.Write
> 24.79s of 26.04s total (95.20%)
> Dropped 381 nodes (cum <= 0.13s)
> ----------------------------------------------------------+-------------
> flat flat% sum% cum cum% calls calls% + context
> ----------------------------------------------------------+-------------
> 10.28s 53.15% |
> crypto/sha1.(*digest).checkSum
> 4.57s 23.63% |
> crypto/hmac.(*hmac).Sum
> 4.21s 21.77% |
> crypto/hmac.(*hmac).Reset
> 0.28s 1.45% |
> crypto/hmac.(*hmac).Write
> 1.83s 7.03% 7.03% 19.39s 74.46% |
> crypto/sha1.(*digest).Write
> ...

Given that this overhead is calls to login SASL, and this bug filed against
mgo, I wonder if it is the culprit:
https://github.com/go-mgo/mgo/issues/254

(it seems to fit the evidence, at least).

John
=:->

On Fri, Jan 12, 2018 at 6:37 PM, Junien Fridrick <1733708@bugs.launchpad.net
> wrote:

> Hi,
>
>
> Disclaimer : much reading but not much progress here :(
>
>
> So this still happening, even with 2.2.9. wgrant has a controller that is
> under high load / high sha1.blockAMD64 activity. On this controller, I
> tried to get a perf flamegraph but it wasn't really useful :
> https://private-fileshare.canonical.com/~axino/lp1733708/perf.svg (you
> can click on boxes to zoom on them).
>
> I also got a graph of the pprof profile data, and it yielded https
> ://private-fileshare.canonical.com/~axino/lp1733708/calls.svg - this
> isn't as useful as it looks. However, when taking a look with "peek",
> you can get hints at what's going on. We're going to start on
> sha1.blockAMD64, and go up the call chain :
>
>
> (pprof) peek crypto/sha1.blockAMD64
> 24.79s of 26.04s total (95.20%)
> Dropped 381 nodes (cum <= 0.13s)
> ----------------------------------------------------------+-------------
>       flat  flat%   sum%        cum   cum%   calls calls% + context
> ----------------------------------------------------------+-------------
>                                             16.77s   100% |
>  crypto/sha1.block
>     16.77s 64.40% 64.40%     16.77s 64.40%                |
> crypto/sha1.blockAMD64
> ----------------------------------------------------------+-------------
>
> ^ means that all calls to crypto/sha1.blockAMD64 are made from
> crypto/sha1.block. Let's continue up :
>
> (pprof) peek crypto/sha1.block$
> 24.79s of 26.04s total (95.20%)
> Dropped 381 nodes (cum <= 0.13s)
> ----------------------------------------------------------+-------------
>       flat  flat%   sum%        cum   cum%   calls calls% + context
> ----------------------------------------------------------+-------------
>                                                17s   100% |
>  crypto/sha1.(*digest).Write
>      0.22s  0.84%  0.84%     17.01s 65.32%                |
> crypto/sha1.block
>                                             16.77s   100% |
>  crypto/sha1.blockAMD64
> ----------------------------------------------------------+-------------
>
>
> (pprof) peek crypto/sha1.$\*digest$.Write
> 24.79s of 26.04s total (95.20%)
> Dropped 381 nodes (cum <= 0.13s)
> ----------------------------------------------------------+-------------
>       flat  flat%   sum%        cum   cum%   calls calls% + context
> ----------------------------------------------------------+-------------
>                                             10.28s 53.15% |
>  crypto/sha1.(*digest).checkSum
>                                              4.57s 23.63% |
>  crypto/hmac.(*hmac).Sum
>                                              4.21s 21.77% |
>  crypto/hmac.(*hmac).Reset
>                                              0.28s  1.45% |
>  crypto/hmac.(*hmac).Write
>      1.83s  7.03%  7.03%     19.39s 74.46%                |
> crypto/sha1.(*digest).Write
>                                                17s 96.81% |
>  crypto/sha1.block
>                                              0.56s  3.19% |
>  runtime.memmove
> ----------------------------------------------------------+-------------
>
>
> There's a split here, with 3 different callers > 20%, but they actually
> all converge back just at the boundary of the "crypto" package. Let's
> keep looking :
>
>
> (pprof) peek \.checkSum$
> 24.79s of 26.04s total (95.20%)
> Dropped 381 nodes (cum <= 0.13s)
> ----------------------------------------------------------+-------------
>       flat  flat%   sum%        cum   cum%   calls calls% + context
> ----------------------------------------------------------+-------------
>                                             12.31s   100% |
>  crypto/sha1.(*digest).Sum
>         2s  7.68%  7.68%     12.31s 47.27%                |
> crypto/sha1.(*digest).checkSum
>                                             10.28s   100% |
>  crypto/sha1.(*digest).Write
> ----------------------------------------------------------+-------------
>
>
> (pprof) peek \.Sum
> 24.79s of 26.04s total (95.20%)
> Dropped 381 nodes (cum <= 0.13s)
> ----------------------------------------------------------+-------------
>       flat  flat%   sum%        cum   cum%   calls calls% + context
> ----------------------------------------------------------+-------------
>                                             13.52s   100% |
>  crypto/hmac.(*hmac).Sum
>      0.67s  2.57%  2.57%     13.53s 51.96%                |
> crypto/sha1.(*digest).Sum
>                                             12.31s 95.72% |
>  crypto/sha1.(*digest).checkSum
>                                              0.29s  2.26% |
>  runtime.memmove
>                                              0.26s  2.02% |
>  runtime.duffcopy
> ----------------------------------------------------------+-------------
>                                             18.40s   100% |
> gopkg.in/mgo.v2/internal/scram.(*Client).saltPassword
>      0.28s  1.08%  3.65%     18.46s 70.89%                |
> crypto/hmac.(*hmac).Sum
>                                             13.52s 74.57% |
>  crypto/sha1.(*digest).Sum
>                                              4.57s 25.21% |
>  crypto/sha1.(*digest).Write
>                                              0.04s  0.22% |
>  crypto/sha1.(*digest).Reset
> ----------------------------------------------------------+-------------
>
>
>
> OK this is our first boundary outside of the "crypto" package, with
> gopkg.in/mgo.v2/internal/scram.(*Client).saltPassword (see
> https://github.com/go-mgo/mgo/blob/v2/internal/scram/scram.go). The 3
> "split" callers from earlier all get called from here. So 100% of the
> sha1.blockAMD64 calls are from saltPassword(). Let's continue :
>
>
>
> (pprof) peek saltPassword
> 24.79s of 26.04s total (95.20%)
> Dropped 381 nodes (cum <= 0.13s)
> ----------------------------------------------------------+-------------
>       flat  flat%   sum%        cum   cum%   calls calls% + context
> ----------------------------------------------------------+-------------
>                                             24.03s   100% |
> gopkg.in/mgo.v2/internal/scram.(*Client).step2
>      0.65s  2.50%  2.50%     24.03s 92.28%                |
> gopkg.in/mgo.v2/internal/scram.(*Client).saltPassword
>                                             18.40s 78.73% |
>  crypto/hmac.(*hmac).Sum
>                                              4.46s 19.08% |
>  crypto/hmac.(*hmac).Reset
>                                              0.51s  2.18% |
>  crypto/hmac.(*hmac).Write
> ----------------------------------------------------------+-------------
>
>
> (pprof) peek step2
> 24.79s of 26.04s total (95.20%)
> Dropped 381 nodes (cum <= 0.13s)
> ----------------------------------------------------------+-------------
>       flat  flat%   sum%        cum   cum%   calls calls% + context
> ----------------------------------------------------------+-------------
>                                             24.14s   100% |
> gopkg.in/mgo.v2/internal/scram.(*Client).Step
>          0     0%     0%     24.14s 92.70%                |
> gopkg.in/mgo.v2/internal/scram.(*Client).step2
>                                             24.03s   100% |
> gopkg.in/mgo.v2/internal/scram.(*Client).saltPassword
> ----------------------------------------------------------+-------------
>
>
> (pprof) peek Step
> 24.79s of 26.04s total (95.20%)
> Dropped 381 nodes (cum <= 0.13s)
> ----------------------------------------------------------+-------------
>       flat  flat%   sum%        cum   cum%   calls calls% + context
> ----------------------------------------------------------+-------------
>                                             24.16s   100% |
> gopkg.in/mgo%2ev2.(*mongoSocket).loginSASL
>          0     0%     0%     24.16s 92.78%                |
> gopkg.in/mgo%2ev2.(*saslScram).Step
>                                             24.16s   100% |
> gopkg.in/mgo.v2/internal/scram.(*Client).Step
> ----------------------------------------------------------+-------------
>                                             24.16s   100% |
> gopkg.in/mgo%2ev2.(*saslScram).Step
>          0     0%     0%     24.16s 92.78%                |
> gopkg.in/mgo.v2/internal/scram.(*Client).Step
>                                             24.14s   100% |
> gopkg.in/mgo.v2/internal/scram.(*Client).step2
> ----------------------------------------------------------+-------------
>
>
> (pprof) peek loginSASL
> 24.79s of 26.04s total (95.20%)
> Dropped 381 nodes (cum <= 0.13s)
> ----------------------------------------------------------+-------------
>       flat  flat%   sum%        cum   cum%   calls calls% + context
> ----------------------------------------------------------+-------------
>                                             24.57s   100% |
> gopkg.in/mgo%2ev2.(*mongoSocket).Login
>          0     0%     0%     24.57s 94.35%                |
> gopkg.in/mgo%2ev2.(*mongoSocket).loginSASL
>                                             24.16s 98.37% |
> gopkg.in/mgo%2ev2.(*saslScram).Step
>                                              0.40s  1.63% |
> gopkg.in/mgo%2ev2.(*mongoSocket).loginRun
> ----------------------------------------------------------+-------------
>
>
> (pprof) peek Login
> 24.79s of 26.04s total (95.20%)
> Dropped 381 nodes (cum <= 0.13s)
> ----------------------------------------------------------+-------------
>       flat  flat%   sum%        cum   cum%   calls calls% + context
> ----------------------------------------------------------+-------------
>                                             24.58s   100% |
> gopkg.in/mgo%2ev2.(*Session).acquireSocket
>          0     0%     0%     24.58s 94.39%                |
> gopkg.in/mgo%2ev2.(*Session).socketLogin
>                                             24.58s   100% |
> gopkg.in/mgo%2ev2.(*mongoSocket).Login
> ----------------------------------------------------------+-------------
>                                             24.58s   100% |
> gopkg.in/mgo%2ev2.(*Session).socketLogin
>          0     0%     0%     24.58s 94.39%                |
> gopkg.in/mgo%2ev2.(*mongoSocket).Login
>                                             24.57s   100% |
> gopkg.in/mgo%2ev2.(*mongoSocket).loginSASL
> ----------------------------------------------------------+-------------
>
>
> (pprof) peek acquireSocket
> 24.79s of 26.04s total (95.20%)
> Dropped 381 nodes (cum <= 0.13s)
> ----------------------------------------------------------+-------------
>       flat  flat%   sum%        cum   cum%   calls calls% + context
> ----------------------------------------------------------+-------------
>                                             22.58s 90.98% |
> gopkg.in/mgo%2ev2.(*Query).One
>                                              1.30s  5.24% |
> gopkg.in/mgo%2ev2.(*Collection).writeOp
>                                              0.62s  2.50% |
> gopkg.in/mgo%2ev2.(*Query).Iter
>                                              0.32s  1.29% |
> gopkg.in/mgo%2ev2.(*Database).Run
>          0     0%     0%     24.82s 95.31%                |
> gopkg.in/mgo%2ev2.(*Session).acquireSocket
>                                             24.58s 99.03% |
> gopkg.in/mgo%2ev2.(*Session).socketLogin
>                                              0.24s  0.97% |
> gopkg.in/mgo%2ev2.(*mongoCluster).AcquireSocket
> ----------------------------------------------------------+-------------
>
>
> (pprof) peek \.One
> 24.79s of 26.04s total (95.20%)
> Dropped 381 nodes (cum <= 0.13s)
> ----------------------------------------------------------+-------------
>       flat  flat%   sum%        cum   cum%   calls calls% + context
> ----------------------------------------------------------+-------------
>                                             17.81s 80.33% |
> github.com/juju/juju/state.getStatus
>                                              1.58s  7.13% |
> github.com/juju/juju/state.(*State).Unit
>                                              0.50s  2.26% |
> github.com/juju/juju/state.(*State).getMachineDoc
>                                              0.47s  2.12% |
> github.com/juju/juju/state.getInstanceData
>                                              0.44s  1.98% |
> github.com/juju/juju/state.(*allWatcherStateBacking).Changed
>                                              0.37s  1.67% |
> github.com/juju/juju/state.(*Unit).AssignedMachineId
>                                              0.31s  1.40% |
> github.com/juju/juju/state.(*State).Application
>                                              0.28s  1.26% |
> github.com/juju/juju/state.readTxnRevno
>                                              0.27s  1.22% |
> github.com/juju/juju/state.readSettingsDocInto
>                                              0.14s  0.63% |
> github.com/juju/juju/state.(*State).APIHostPorts
>          0     0%     0%     22.77s 87.44%                |
> github.com/juju/juju/mongo.queryWrapper.One
>                                             22.77s   100% |
> gopkg.in/mgo%2ev2.(*Query).One
> ----------------------------------------------------------+-------------
>                                             22.77s   100% |
> github.com/juju/juju/mongo.queryWrapper.One
>          0     0%     0%     22.78s 87.48%                |
> gopkg.in/mgo%2ev2.(*Query).One
>                                             22.58s 99.12% |
> gopkg.in/mgo%2ev2.(*Session).acquireSocket
>                                              0.10s  0.44% |
> gopkg.in/mgo%2ev2.(*mongoSocket).SimpleQuery
>                                              0.10s  0.44% |
> gopkg.in/mgo.v2/bson.Unmarshal
> ----------------------------------------------------------+-------------
>
>
> OK, end of the line. We can see here the various juju functions making
> calls to mongodb via mgo (note that on other profiles, the repartition
> is different, i.e. getStatus() is not always so predominant).
>
> Looking at a "normal" controller, we have :
> ----------------------------------------------------------+-------------
>                                              2.10s 78.36% |
> github.com/juju/juju/mongo.queryWrapper.One
>                                              0.25s  9.33% |
> gopkg.in/mgo.v2/txn.(*Runner).load
>                                              0.17s  6.34% |
> gopkg.in/mgo.v2/txn.(*flusher).assert
>                                              0.16s  5.97% |
> gopkg.in/mgo.v2/txn.(*flusher).rescan
>      0.01s 0.047% 0.093%      2.71s 12.63%                |
> gopkg.in/mgo%2ev2.(*Query).One
>                                              1.33s 50.76% |
> gopkg.in/mgo%2ev2.(*mongoSocket).SimpleQuery
>                                              1.17s 44.66% |
> gopkg.in/mgo.v2/bson.Unmarshal
>                                              0.07s  2.67% |
>  runtime.newobject
>                                              0.05s  1.91% |
> gopkg.in/mgo%2ev2.(*Session).acquireSocket
> ----------------------------------------------------------+-------------
>
> so Query.One() is mostly calling SimpleQuery and bson.Unmarshal, as
> expected.
>
> So I believe all this indicates that _somehow_, the cause of this bug is
> that juju just stops using the mongodb pool and creates a new connection
> each time it wants to do a request. If you get back at the flamegraph,
> and click on "runtime.goexit", then search for "loginSASL" at the top
> right, you can see that nearly every state is spending 99% of its time
> logging in the to database.
>
> I haven't been able to root cause that yet, and the following would have
> helped :
>
> a) pprof tracing - see bug 1742955
> b) enabling mgo debug logging - I'm not sure how to do that via the
> "logging-config" model config option - and I'm not sure that's even
> possible. If it's not possible then let's file a bug.
>
> Thanks
>
> --
> You received this bug notification because you are subscribed to juju.
> Matching subscriptions: juju bugs
> https://bugs.launchpad.net/bugs/1733708
>
> Title:
>   load spike on HA 2.2.6 controller following remove-application
>
> To manage notifications about this bug go to:
> https://bugs.launchpad.net/juju/+bug/1733708/+subscriptions
>

Revision history for this message

Anastasia (anastasia-macmood) wrote on 2018-03-22:

#29

@John A Meinel,

I am not too sure where this is for other series... It appears that there was a lot of code committed to improve the performance but we have not had a recent confirmation that the loads are resolved.

I'll mark it as "Incomplete" for now. Also, we will not be fixing this in 2.2 as there are no further point releases planned in this series.

Changed in juju:
status:	Triaged → Fix Committed

Revision history for this message

Anastasia (anastasia-macmood) wrote on 2018-03-22:

#30

Actually, marking as "Fix Committed" for "juju" since I am sure all related code has been forwardported to "develop" (heading into 2.4+) as part of a larger merge.

Anastasia (anastasia-macmood) on 2018-07-10

Changed in juju:
status:	Fix Committed → Fix Released

Canonical Juju

load spike on HA 2.2.6 controller following remove-application

Bug Description

Other bug subscribers

Bug attachments

Remote bug watches

	Status	Importance	Assigned to	Milestone
Canonical Juju	Fix Released	High	Unassigned	Canonical Juju 2.4-beta1
2.2	Won't Fix	High	Andrew Wilkins
2.3	Fix Released	High	Andrew Wilkins	Canonical Juju 2.3.2