Ceph public/cluster networks should not be updated during overcloud deploy

Bug #1980992 reported by John Fulton
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
tripleo
Fix Released
High
John Fulton

Bug Description

If you deploy ceph like this:

openstack overcloud ceph deploy \
-o /home/stack/templates/overcloud-ceph-deployed.yaml \
--container-image-prepare "/home/stack/containers-prepare-parameter.yaml" \
 --config /home/stack/initial-ceph.conf ...

Where initial-ceph.conf contains the following:

 [global]
 public_network = '172.120.3.0/24,172.117.3.0/24,172.118.3.0/24,172.119.3.0/24'
 cluster_network = '172.120.4.0/24,172.117.4.0/24,172.118.4.0/24,172.119.4.0/24'
 ms_bind_ipv4 = true
 ms_bind_ipv6 = false

Then the ceph networks are configured correctly. However, if you then deploy the overcloud it will update the Ceph public/cluster networks with data from Heat which might not be correct as the initial-ceph.conf is not excepted to be passed to the overcloud deployment.

This change is not sufficient because deployed_ceph should be true during the initial overcloud deployment:

https://review.opendev.org/c/openstack/tripleo-ansible/+/846950/1/tripleo_ansible/roles/tripleo_cephadm/tasks/cephadm_config_set.yaml#27

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-ansible (master)
Changed in tripleo:
status: Triaged → In Progress
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-ansible (stable/wallaby)

Fix proposed to branch: stable/wallaby
Review: https://review.opendev.org/c/openstack/tripleo-ansible/+/849083

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to tripleo-ansible (master)

Reviewed: https://review.opendev.org/c/openstack/tripleo-ansible/+/849048
Committed: https://opendev.org/openstack/tripleo-ansible/commit/70bbaf6b81074256cfa20c2698c2ae87d76d95cf
Submitter: "Zuul (22348)"
Branch: master

commit 70bbaf6b81074256cfa20c2698c2ae87d76d95cf
Author: John Fulton <email address hidden>
Date: Thu Jul 7 17:59:04 2022 -0400

    Only run Ceph network_config_set during initial deployment

    There is no reason to include the network_config_set tasks
    file in the cephadm_config_set tasks which can be run both
    before and during overcloud deployment. It is safer to only
    run it during the initial Ceph deployment. The danger is that
    the network_config_set tasks file from the tripleo_cephadm
    role can misconfigure Ceph during overcloud deployment if
    the Ceph deployment relied on an initial ceph.conf to
    configure the Ceph networks.

    Change-Id: I74e1197362517c3ad4bbdfc54adbf3ac934acf4b
    Closes-Bug: #1980992

Changed in tripleo:
status: In Progress → Fix Released
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to tripleo-ansible (stable/wallaby)

Reviewed: https://review.opendev.org/c/openstack/tripleo-ansible/+/849083
Committed: https://opendev.org/openstack/tripleo-ansible/commit/69d373547df71c422c181eb2efb9a5ab445057c7
Submitter: "Zuul (22348)"
Branch: stable/wallaby

commit 69d373547df71c422c181eb2efb9a5ab445057c7
Author: John Fulton <email address hidden>
Date: Thu Jul 7 17:59:04 2022 -0400

    Only run Ceph network_config_set during initial deployment

    There is no reason to include the network_config_set tasks
    file in the cephadm_config_set tasks which can be run both
    before and during overcloud deployment. It is safer to only
    run it during the initial Ceph deployment. The danger is that
    the network_config_set tasks file from the tripleo_cephadm
    role can misconfigure Ceph during overcloud deployment if
    the Ceph deployment relied on an initial ceph.conf to
    configure the Ceph networks.

    Change-Id: I74e1197362517c3ad4bbdfc54adbf3ac934acf4b
    Closes-Bug: #1980992
    (cherry picked from commit 70bbaf6b81074256cfa20c2698c2ae87d76d95cf)

tags: added: in-stable-wallaby
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/tripleo-ansible 5.0.0

This issue was fixed in the openstack/tripleo-ansible 5.0.0 release.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.