communication failed - user timeout caused connection failure
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Launchpad itself |
Fix Released
|
High
|
Julian Edwards |
Bug Description
A number of times over the past few days, people have been complaining that builds are being bounced from one builder to another. maxb reported this morning that issues with a build of bzr. Checking the logs shows the builder dispatching fine, and then:
2010-09-24 07:00:49+0000 [-] <americium:http://
Checking the history for this builder shows that, although it's still taking on new builds, the last build to finish was over 3 hrs ago.
Looking for similar errors in the logs results in quite a few buildds, shown below. I checked a few of these buildd's histories, and they all have a large gap of 3-4 hours where they didn't finish anything - as if none of them could communicate the results back.
2010-09-24 07:31:13+0000 [-] <actinium:http://
2010-09-24 07:31:13+0000 [-] <actinium:http://
2010-09-24 07:31:13+0000 [-] <einsteinium:http://
2010-09-24 07:31:13+0000 [-] <einsteinium:http://
2010-09-24 07:31:53+0000 [-] <cushaw:http://
2010-09-24 07:31:53+0000 [-] <cushaw:http://
2010-09-24 07:31:53+0000 [-] <hawthorn:http://
2010-09-24 07:31:53+0000 [-] <hawthorn:http://
2010-09-24 07:31:54+0000 [-] <allspice:http://
2010-09-24 07:31:54+0000 [-] <allspice:http://
2010-09-24 07:31:54+0000 [-] <adare:http://
2010-09-24 07:31:54+0000 [-] <adare:http://
2010-09-24 07:31:54+0000 [-] <gourd:http://
2010-09-24 07:31:54+0000 [-] <gourd:http://
2010-09-24 07:31:57+0000 [-] <palmer:http://
2010-09-24 07:31:57+0000 [-] <palmer:http://
2010-09-24 07:31:57+0000 [-] <genip:http://
2010-09-24 07:31:57+0000 [-] <genip:http://
2010-09-24 07:31:57+0000 [-] <crested:http://
2010-09-24 07:31:57+0000 [-] <crested:http://
2010-09-24 07:31:57+0000 [-] <plutonium:http://
2010-09-24 07:31:57+0000 [-] <plutonium:http://
2010-09-24 07:31:57+0000 [-] <nannyberry:http://
2010-09-24 07:31:57+0000 [-] <nannyberry:http://
2010-09-24 07:31:57+0000 [-] <mercury:http://
2010-09-24 07:31:58+0000 [-] <mercury:http://
2010-09-24 07:31:58+0000 [-] <lakoocha:http://
2010-09-24 07:31:58+0000 [-] <lakoocha:http://
etc.
Related branches
- Jonathan Lange (community): Approve
-
Diff: 7192 lines (+2211/-3509)24 files modifiedlib/lp/buildmaster/doc/builder.txt (+2/-118)
lib/lp/buildmaster/interfaces/builder.py (+83/-62)
lib/lp/buildmaster/manager.py (+205/-469)
lib/lp/buildmaster/model/builder.py (+240/-224)
lib/lp/buildmaster/model/buildfarmjobbehavior.py (+60/-52)
lib/lp/buildmaster/model/packagebuild.py (+6/-0)
lib/lp/buildmaster/tests/mock_slaves.py (+157/-32)
lib/lp/buildmaster/tests/test_builder.py (+582/-154)
lib/lp/buildmaster/tests/test_manager.py (+248/-782)
lib/lp/buildmaster/tests/test_packagebuild.py (+12/-0)
lib/lp/code/model/recipebuilder.py (+32/-28)
lib/lp/soyuz/browser/tests/test_builder_views.py (+1/-1)
lib/lp/soyuz/doc/buildd-dispatching.txt (+0/-371)
lib/lp/soyuz/doc/buildd-slavescanner.txt (+0/-876)
lib/lp/soyuz/model/binarypackagebuildbehavior.py (+59/-41)
lib/lp/soyuz/tests/test_binarypackagebuildbehavior.py (+290/-8)
lib/lp/soyuz/tests/test_doc.py (+0/-6)
lib/lp/testing/factory.py (+8/-2)
lib/lp/translations/doc/translationtemplatesbuildbehavior.txt (+0/-114)
lib/lp/translations/model/translationtemplatesbuildbehavior.py (+20/-14)
lib/lp/translations/stories/buildfarm/xx-build-summary.txt (+1/-1)
lib/lp/translations/tests/test_translationtemplatesbuildbehavior.py (+202/-153)
lib/lp_sitecustomize.py (+3/-0)
utilities/migrater/file-ownership.txt (+0/-1)
description: | updated |
Changed in soyuz: | |
status: | Triaged → Fix Released |
milestone: | none → 10.11 |
maxb witnessed the issue again:
10:32 < maxb> noodles775: shipova just ejected my build, it's now starting again on thorium
10:34 < maxb> noodles775: for the record, that build on shipova had most definitely started. It had been running for an hour, and was displaying build log output
Checking the log this time shows what seems to be a different issue - the builder being marked as not ok:
2010-09-24 08:30:53+0000 [-] shipova was made unavailable, resetting attached job