1.22 cannot upgrade to 1.26-alpha1: run.socket: no such file or directory

Bug #1493850 reported by Curtis Hovey
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
juju-core
Fix Released
Critical
Casey Marshall

Bug Description

Upgrades from 1.22.6 to 1.26-alpha1 fail:
    http://reports.vapour.ws/releases/3039/job/aws-upgrade-22-trusty-amd64/attempt/123
    http://reports.vapour.ws/releases/3039/job/joyent-upgrade-22-trusty-amd64/attempt/127

1.22.6: dummy-source/0
2015-09-09 05:54:29 ERROR juju run on machine 1 returned 1: error: dial unix /var/lib/juju/agents/unit-dummy-source-0/run.socket: no such file or directory

Revision history for this message
Curtis Hovey (sinzui) wrote :
Revision history for this message
Martin Packman (gz) wrote :

This also causes the OS-deployer job to fail:

<http://reports.vapour.ws/releases/3039/job/OS-deployer/attempt/247>

  swift-storage-z3:
    charm: local:trusty/swift-storage-92
    exposed: false
    service-status:
      current: error
      message: 'hook failed: "update-status"'

From the unit-swift-storage-z3-0.log:

2015-09-09 09:44:47 ERROR juju.juju.sockets sockets_nix.go:22 failed to listen on unix:@/var/lib/juju/agents/unit-swift-storage-z3-0/agent.socket: listen unix @/var/lib/juju/agents/unit-swift-storage-z3-0/agent.socket: bind: address already in use
2015-09-09 09:44:48 ERROR juju.worker.uniter.operation runhook.go:108 hook "update-status" failed: listen unix @/var/lib/juju/agents/unit-swift-storage-z3-0/agent.socket: bind: address already in use

Casey Marshall (cmars)
Changed in juju-core:
assignee: nobody → Casey Marshall (cmars)
Revision history for this message
Casey Marshall (cmars) wrote :
Changed in juju-core:
status: Triaged → In Progress
Revision history for this message
Casey Marshall (cmars) wrote :

Update: http://reviews.vapour.ws/r/2620/

So it turns out that unit agent upgrades aren't actually getting applied, possibly since ever. To confirm this, try running a 1.22 -> 1.26 upgrade (pre-maltese-falcon landing, if you like) with some logging to updates/steps123, after the unit tag check. It never gets executed.

Landed a workaround that handles the potential for inconsistent unit local state, as a stop-gap until we can fix unit agent upgrades.

Changed in juju-core:
status: In Progress → Fix Committed
Revision history for this message
Ian Booth (wallyworld) wrote :

New CI test run now passes.
http://juju-ci.vapour.ws:8080/job/aws-upgrade-22-trusty-amd64/127/

Marking as fix committed.

Changed in juju-core:
status: Fix Committed → Fix Released
Revision history for this message
Martin Packman (gz) wrote :

Have we got a follow up bug for having upgrade steps that never run?

Tim Penhey (thumper)
tags: removed: blocker
Aaron Bentley (abentley)
Changed in juju-core:
status: Fix Released → Triaged
status: Triaged → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.