If Jenkins is down at submission, the ticket gets stuck in queued forever

Bug #1289296 reported by Evan
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Ubuntu CI Engine
Fix Released
High
Andy Doan
Ubuntu CI Services
Fix Released
High
Andy Doan

Bug Description

I ran a deployment and Jenkins failed to start due to a race (as far as we can tell). I hadn't noticed this and tried to feed a ticket through, which is still sitting in the queued state an hour later, long after restarting Jenkins.

Nothing in the rabbit queues.
Nothing queued up in Jenkins.

Tags: airline

Related branches

Evan (ev)
Changed in ubuntu-ci-services-itself:
importance: Undecided → Critical
milestone: none → phase-0
assignee: nobody → Andy Doan (doanac)
Revision history for this message
Andy Doan (doanac) wrote :

i wish we had the lander's run_worker log for this, but i think this might be easy to re-create. I suspect this could be our issue:

http://bazaar.launchpad.net/~canonical-ci-engineering/ubuntu-ci-services-itself/trunk/view/head:/lander/lander/run_worker.py#L125

Looks like we try a fixed amount of times to contact jenkins before we give up. I'm thinking we probably want these loops to be "while True" type things.

It would be nice to have this contribute to our "engine health" page in the webui. This service doesn't have a REST api running though, so it might not be possible for phase0

Revision history for this message
Chris Johnston (cjohnston) wrote :

I think this *should* be fixed for now based on bug #1288947... It still needs to be figured out, but I don't think we ever ran into this issue before we started using the IS charm.

Revision history for this message
Andy Doan (doanac) wrote :

So the bug won't happen anymore. But I think we probably want to make the lander code more resilient still. I'm going to knock the importance down to "high" since we aren't dead in the water now.

Changed in ubuntu-ci-services-itself:
importance: Critical → High
Revision history for this message
Andy Doan (doanac) wrote :

I'd like to add a health status for this, but that might take a bit longer. this is the minimal for phase-0, and i'll see what i can do to get more done.

Changed in ubuntu-ci-services-itself:
status: New → In Progress
Changed in ubuntu-ci-services-itself:
status: In Progress → Fix Committed
Andy Doan (doanac)
Changed in ubuntu-ci-services-itself:
status: Fix Committed → Fix Released
tags: added: airline
Ursula Junque (ursinha)
Changed in uci-engine:
assignee: nobody → Andy Doan (doanac)
importance: Undecided → High
milestone: none → phase-0
status: New → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.