Repeated timeouts when trying to accept packages into intrepid-backports

Bug #332529 reported by Scott Kitterman
2
Affects Status Importance Assigned to Milestone
Launchpad itself
Triaged
High
Unassigned

Bug Description

See (Error ID: OOPS-1148G1692) for one example. It seems to work about one time in 20. This is blocking work and needs to be addressed immediately.

Revision history for this message
Scott Kitterman (kitterman) wrote :

Or maybe even fewer. (Error ID: OOPS-1148D1805)

Revision history for this message
Scott Kitterman (kitterman) wrote :

It seems to have been a transient issue, but I think it still ought to be investigated.

Revision history for this message
Scott Kitterman (kitterman) wrote :

Happening again:
(Error ID: OOPS-1150E542)
(Error ID: OOPS-1150E543)
(Error ID: OOPS-1150C597)
(Error ID: OOPS-1150F576)
(Error ID: OOPS-1150G579)

And then I gave up.

Changed in launchpad:
status: New → Confirmed
Revision history for this message
Diogo Matsubara (matsubara) wrote :

Celso, can you take a look at this?

Changed in launchpad:
assignee: nobody → cprov
Changed in soyuz:
importance: Undecided → High
milestone: none → pending
status: Confirmed → Triaged
Revision history for this message
Scott Kitterman (kitterman) wrote :

Happened again.

[16:47:22] <ScottK> It seems I've just lost my ability to accept packages via LP (Error ID: OOPS-1203A1853)
[16:47:23] <ubottu> https://devpad.canonical.com/~jamesh/oops.cgi/1203A1853
[16:47:44] <ScottK> This is somewhat bad timing as we're about a week from a release ....
[16:47:56] <ScottK> I'd appreciate it if someone would take a look.
[16:48:37] <cody-somerville> What page gives that oops?
[16:48:55] --> thumper (n=quassel@125-236-193-95.adsl.xtra.co.nz) has joined #launchpad
[16:49:26] <ScottK> https://launchpad.net/ubuntu/jaunty/+queue?queue_state=1
[16:49:35] <ScottK> Right after I mash the accept button.
[16:50:25] <-- gianmt (<email address hidden>) has quit ("Leaving")
[16:50:39] <cody-somerville> Its a time out
[16:50:44] <ScottK> (Error ID: OOPS-1203D1950) if multiple copies help.
[16:50:45] <ubottu> https://devpad.canonical.com/~jamesh/oops.cgi/1203D1950
[16:50:52] <ScottK> cody-somerville: I'm aware of this.
[16:50:59] <ScottK> It's an internal problem though.
[16:51:05] <ScottK> It's happened before.
[16:54:17] <cody-somerville> ScottK, is it happening for all packages you try to accept or just a specific one?
[16:55:21] <ScottK> Bug #332529 is similar
[16:55:22] <ubottu> Launchpad bug 332529 in soyuz "Repeated timeouts when trying to accept packages into intrepid-backports" [High,Triaged] https://launchpad.net/bugs/332529
[16:55:35] <ScottK> There's only one that needs accepting right now.
[16:55:40] <-- magcius (<email address hidden>) has quit (Connection timed out)
[16:56:22] <-- BjornT (<email address hidden>) has quit (Read error: 110 (Connection timed out))
[16:57:06] <-- dominiks (<email address hidden>) has quit ("...")
[16:57:15] --> luke-jr_ (<email address hidden>) has joined #launchpad
[16:58:51] <ScottK> It finally went through.

Revision history for this message
Julian Edwards (julian-edwards) wrote :

Sorry about this, it was caused by accepting a package that had closed a bug which had a lot of subscribers. A similar optimisation was done elsewhere to make that process quicker so we'll fix this page too.

Revision history for this message
Björn Tillenius (bjornt) wrote : Re: [Bug 332529] Re: Repeated timeouts when trying to accept packages into intrepid-backports

On Mon, Apr 20, 2009 at 10:57:51AM -0000, Julian Edwards wrote:
> Sorry about this, it was caused by accepting a package that had closed a
> bug which had a lot of subscribers. A similar optimisation was done
> elsewhere to make that process quicker so we'll fix this page too.

The fix was done in the single place that inserted bug notification
recipients, so things should be better on edge already. I'm not sure
that fix is enough to stop the page from timing out, though.

Revision history for this message
Celso Providelo (cprov) wrote :

Looking at https://devpad.canonical.com/~matsubara/oops.cgi/2009-04-17/A1853 you can see that the major problem are the SPR queries issues in SPR.createBuild() for having a better build-ETA.

I don't know exactly why/how a SPR.select(SPN.name='foobar') takes ~500 ms, but that code is executed 8 times (number of architectures) which certainly pushes sql-time above the timeout threshold.

Revision history for this message
Scott Kitterman (kitterman) wrote :

Still happens on edge: (Error ID: OOPS-1213EC246)

Revision history for this message
Scott Kitterman (kitterman) wrote :

Also on edge (in case you need more than one):

 (Error ID: OOPS-1213EB1171)
 (Error ID: OOPS-1213EB1172)

tags: added: queue-page
Curtis Hovey (sinzui)
Changed in soyuz:
assignee: Celso Providelo (cprov) → nobody
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.