Upgrade 1.24.7 -> 1.25.3 fails

Bug #1546100 reported by Stuart Bishop
10
This bug affects 1 person
Affects Status Importance Assigned to Milestone
juju-core
Fix Released
Critical
Anastasia
1.24
Won't Fix
High
Unassigned
1.25
Fix Released
Critical
Anastasia

Bug Description

An attempt to upgrade an OpenStack environment with manually provisioned machines failed, with Machine 0 in an error state and all other units 'lost'.

upgrade to 1.25.3 failed (giving up): tag machine instances: tagging instance manual:xxxxxx.internal for machine 4: setting server metadata: failed to set metadata map[juju-env-uuid:xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx] on server with id: manual:xxxxxx.internal
      caused by: Resource at http://10.99.99.99:8774/v2/xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx/servers/manual:xxxxxx.internal/metadata not found
      caused by: request (http://10.99.99.99:8774/v2/xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx/servers/manual:xxxxxx.internal/metadata) returned unexpected status: 404; error info: {"itemNotFound": {"message": "Server does not exist", "code": 404}}

I will need guidance on how to recover the environment.

Martin Packman (gz)
tags: added: manual-provider upgrade-juju
Revision history for this message
Cheryl Jennings (cherylj) wrote :

Relevant comment from IRC: The IP in the URLs is the address of the controller unit, and that is just openstack. I suspect the trigger is my units are all manually provisioned.

Revision history for this message
Andrew Wilkins (axwalk) wrote :

Yes, the issue is that they're manually provisioned. There's an oversight in upgrades/tags.go: we're passing all instances to the provider to add tags, but we should exclude any manual machines.

Changed in juju-core:
status: New → Triaged
importance: Undecided → Critical
milestone: none → 1.25.4
Changed in juju-core:
assignee: nobody → Anastasia (anastasia-macmood)
status: Triaged → In Progress
Changed in juju-core:
milestone: 1.25.4 → 2.0-beta2
milestone: 2.0-beta2 → 1.25.4
Ian Booth (wallyworld)
Changed in juju-core:
milestone: 1.25.4 → none
Changed in juju-core:
milestone: none → 2.0-beta4
no longer affects: juju-core
Revision history for this message
Anastasia (anastasia-macmood) wrote :

Whilst this bug is fixed, the ability to restart a broken upgrade needs to be addressed - bug # 1548564.

Revision history for this message
Anastasia (anastasia-macmood) wrote :

We are working on the upgrade path to recover the system in the original bug report.

Stuart Bishop (stub)
tags: added: canonical-is
Revision history for this message
Anastasia (anastasia-macmood) wrote :

At this stage, recovery is of environments with failed upgrade is manual. I'll describe the process in the bug that tracks recovery (bug # 1548564).

Revision history for this message
Cheryl Jennings (cherylj) wrote :

Users should upgrade directly to 1.25.4 to avoid hitting this issue.

Changed in juju-core:
assignee: nobody → Anastasia (anastasia-macmood)
importance: Undecided → Critical
status: New → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.