nova-scheduler upgrade from newton to ocata doesn't work

Bug #1667728 reported by Emilien Macchi
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
tripleo
Fix Released
Critical
Sofer Athlan-Guyot

Bug Description

When upgrading TripleO from Newton to Ocata, nova-scheduler fails to stop and start at step 2.

Looking at the logs:
http://logs.openstack.org/64/437364/2/check/gate-tripleo-ci-centos-7-multinode-upgrades-nv/1823021/logs/subnode-2/syslog.txt.gz#_Feb_24_11_24_56

It seems like nova-scheduler wasn't stopped by the Ansible task at step 2, though packages were updated.
Since RabbitMQ is stopped at step2, there is no way Nova services can work at the end of step 2, therefore, we need to make sure Nova is actually stopped, which doesn't seem the case.

We should upgrade Nova packages at step 3 like other services.

summary: - nova-schedule upgrade from newton to ocata doesn't work
+ nova-scheduler upgrade from newton to ocata doesn't work
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-heat-templates (master)

Fix proposed to branch: master
Review: https://review.openstack.org/437958

Changed in tripleo:
assignee: nobody → Sofer Athlan-Guyot (sofer-athlan-guyot)
status: Triaged → In Progress
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-heat-templates (stable/ocata)

Fix proposed to branch: stable/ocata
Review: https://review.openstack.org/438058

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to tripleo-heat-templates (master)

Reviewed: https://review.openstack.org/437958
Committed: https://git.openstack.org/cgit/openstack/tripleo-heat-templates/commit/?id=5593877817b332bd7a650f051cff5f5988ab1e54
Submitter: Jenkins
Branch: master

commit 5593877817b332bd7a650f051cff5f5988ab1e54
Author: Sofer Athlan-Guyot <email address hidden>
Date: Fri Feb 24 17:01:35 2017 +0100

    Upgrade nova-api/scheduler/conductor packages at step3 not step2.

    The nova-api, nova-scheduler nova-conductor packages are updated during
    step2. The package upgrade trigger a restart of the service which fails
    and is constantly retried by systemd:

        Feb 24 12:34:24 centos-7-2-node-rax-iad-7463943-440549 systemd[1]: Failed to start OpenStack Nova Scheduler Server.
        Feb 24 12:34:24 centos-7-2-node-rax-iad-7463943-440549 systemd[1]: Unit openstack-nova-scheduler.service entered failed state.
        Feb 24 12:34:24 centos-7-2-node-rax-iad-7463943-440549 systemd[1]: openstack-nova-scheduler.service failed.
        Feb 24 12:34:24 centos-7-2-node-rax-iad-7463943-440549 systemd[1]: openstack-nova-scheduler.service holdoff time over, scheduling restart.

    We eventually reach timeout. We use
    https://github.com/openstack/tripleo-heat-templates/blob/master/puppet/services/tripleo-packages.yaml#L44-L46
    to upgrade existing packages.

    Add a note to the README.rst to make people aware of the general upgrade
    done at step3 and limit its usage to new package for individual service.

    Change-Id: I13b51bcfe0c98034944613f7e1c3f0168cd4de76
    Closes-Bug: #1667728

Changed in tripleo:
status: In Progress → Fix Released
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Change abandoned on tripleo-heat-templates (stable/ocata)

Change abandoned by Athlan-Guyot sofer (<email address hidden>) on branch: stable/ocata
Review: https://review.openstack.org/438058
Reason: redo the cherry pick after the merge.

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to tripleo-heat-templates (stable/ocata)

Reviewed: https://review.openstack.org/438058
Committed: https://git.openstack.org/cgit/openstack/tripleo-heat-templates/commit/?id=577ea9b01fb62c7f59f7e47b1f9959c52163ce08
Submitter: Jenkins
Branch: stable/ocata

commit 577ea9b01fb62c7f59f7e47b1f9959c52163ce08
Author: Sofer Athlan-Guyot <email address hidden>
Date: Fri Feb 24 17:01:35 2017 +0100

    Upgrade nova-api/scheduler/conductor packages at step3 not step2.

    The nova-api, nova-scheduler nova-conductor packages are updated during
    step2. The package upgrade trigger a restart of the service which fails
    and is constantly retried by systemd:

        Feb 24 12:34:24 centos-7-2-node-rax-iad-7463943-440549 systemd[1]: Failed to start OpenStack Nova Scheduler Server.
        Feb 24 12:34:24 centos-7-2-node-rax-iad-7463943-440549 systemd[1]: Unit openstack-nova-scheduler.service entered failed state.
        Feb 24 12:34:24 centos-7-2-node-rax-iad-7463943-440549 systemd[1]: openstack-nova-scheduler.service failed.
        Feb 24 12:34:24 centos-7-2-node-rax-iad-7463943-440549 systemd[1]: openstack-nova-scheduler.service holdoff time over, scheduling restart.

    We eventually reach timeout. We use
    https://github.com/openstack/tripleo-heat-templates/blob/master/puppet/services/tripleo-packages.yaml#L44-L46
    to upgrade existing packages.

    Add a note to the README.rst to make people aware of the general upgrade
    done at step3 and limit its usage to new package for individual service.

    Change-Id: I13b51bcfe0c98034944613f7e1c3f0168cd4de76
    Closes-Bug: #1667728
    (cherry picked from commit 5593877817b332bd7a650f051cff5f5988ab1e54)

tags: added: in-stable-ocata
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/tripleo-heat-templates 6.0.0.0rc2

This issue was fixed in the openstack/tripleo-heat-templates 6.0.0.0rc2 release candidate.

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/tripleo-heat-templates 7.0.0.0b1

This issue was fixed in the openstack/tripleo-heat-templates 7.0.0.0b1 development milestone.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.