Comment 8 for bug 1840176

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to integ (master)

Reviewed: https://review.opendev.org/678474
Committed: https://git.openstack.org/cgit/starlingx/integ/commit/?id=9faad457030dc7ffd88f2e4106b01157c9b9a0ed
Submitter: Zuul
Branch: master

commit 9faad457030dc7ffd88f2e4106b01157c9b9a0ed
Author: Daniel Badea <email address hidden>
Date: Mon Aug 26 06:59:28 2019 +0000

    ceph-init-wrapper use flock instead of flag files

    When swact occurs and ceph-init-wrapper is slow to respond
    to a status request it gets killed by SM. This means the
    corresponding flag file that marks status in progress is left
    behind.

    When controller swacts back ceph-init-wrapper sees status
    in progress and waits for it to finish (with a timeout).
    Because it does not respond fast enough SM tries to start
    again ceph-init-wrapper to get ceph-mon service up and running.

    This happens a couple of times until the service is declared
    failed and controller swacts back.

    To fix this we need to use flock instead of flag files as the
    locks will be automatically released by the OS when process
    is killed.

    Change-Id: If1912e8575258a4f79321d8435c8ae1b96b78b98
    Closes-bug: 1840176
    Signed-off-by: Daniel Badea <email address hidden>