Mitaka > Newton upgrade with Ceph fails with: "WARNING: Waiting for Ceph cluster status to go HEALTH_OK"

Bug #1709370 reported by Giulio Fidente
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
tripleo
Won't Fix
Undecided
Unassigned
Newton
Fix Committed
High
Giulio Fidente

Bug Description

$ heat deployment-show 34342ccf-8cfb-4098-ad9f-b5fe046015b3
WARNING (shell) "heat deployment-show" is deprecated, please use "openstack software deployment show" instead
{
  "status": "FAILED",
  "server_id": "c7e83ce4-945f-4342-82c5-8c4082dad060",
  "config_id": "c6f9e505-31c4-4c0c-a02c-136041b8917a",
  "output_values": {
    "deploy_stdout": "INFO: starting c6f9e505-31c4-4c0c-a02c-136041b8917a\nWARNING: Waiting for Ceph cluster status to go HEALTH_OK\nWARNING: Waiting for Ceph cluster status to go HEALTH_OK\nWARNING: Waiting for Ceph cluster status to go HEALTH_OK\nWARNING: Waiting for Ceph cluster status to go HEALTH_OK\nWARNING: Waiting for Ceph cluster status to go HEALTH_OK\nWARNING: Waiting for Ceph cluster status to go HEALTH_OK\nWARNING: Waiting for Ceph cluster status to go HEALTH_OK\nWARNING: Waiting for Ceph cluster status to go HEALTH_OK\nWARNING: Waiting for Ceph cluster status to go HEALTH_OK\nWARNING: Waiting for Ceph cluster status to go HEALTH_OK\n",
    "deploy_stderr": "",
    "deploy_status_code": 124
  },
  "creation_time": "2017-06-30T21:27:22Z",
  "updated_time": "2017-06-30T21:34:03Z",
  "input_values": {
    "update_identifier": "",
    "deploy_identifier": "1498857598"
  },
  "action": "CREATE",
  "status_reason": "deploy_status_code : Deployment exited with non-zero status code: 124",
  "id": "34342ccf-8cfb-4098-ad9f-b5fe046015b3"
}

Tags: upgrade
Changed in tripleo:
status: Confirmed → Won't Fix
importance: High → Undecided
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-heat-templates (stable/newton)

Fix proposed to branch: stable/newton
Review: https://review.openstack.org/491847

Changed in tripleo:
milestone: none → pike-rc1
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to tripleo-heat-templates (stable/newton)

Reviewed: https://review.openstack.org/491847
Committed: https://git.openstack.org/cgit/openstack/tripleo-heat-templates/commit/?id=3f1a33666a9a5a960c79cd3142dc77c73cd7e506
Submitter: Jenkins
Branch: stable/newton

commit 3f1a33666a9a5a960c79cd3142dc77c73cd7e506
Author: Giulio Fidente <email address hidden>
Date: Tue Aug 8 18:06:32 2017 +0200

    [NEWTON-ONLY] Ignore Ceph healt warning states by default

    The Ceph cluster health state can change from _OK to _WARN during upgrade
    because MONs are upgraded one by one and the cluster tunables can't
    satisfy both instances at the same time.

    This submission changes the default value of IgnoreCephUpgradeWarnings
    into true, which make the upgrade script to ignore non _ERR states, while
    still checking for the node to be back into quorum, behaving similarily to
    ceph-ansible.

    Closes-Bug: #1709370
    Change-Id: I96140e32d9436aaa85003037bcc789384378b364

Revision history for this message
Steven Hardy (shardy) wrote :

newton only fix so removing pike milestone

Changed in tripleo:
milestone: pike-rc1 → none
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/tripleo-heat-templates 5.3.1

This issue was fixed in the openstack/tripleo-heat-templates 5.3.1 release.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.