Obsolete filesystems should be removed from the calculation of the minimal PV size in AIO kickstart

Bug #1892554 reported by Angie Wang
14
This bug affects 1 person
Affects Status Importance Assigned to Milestone
StarlingX
Fix Released
High
Angie Wang

Bug Description

Brief Description
-----------------
In All-in-one system, Additional cgts-vg space created on installation for some filesystems like backup and obsolete filesystems like gnocchi and anchor. We should preserve the PV size that fits the minimum requirement to allow users to partition as they need.

Severity
--------
Minor as it just provides additional free space in cgts-vg

Expected Behavior
------------------
partitioned cgts-vg size fits the minimum requirement

Actual Behavior
----------------
additional free-space in partitioned cgts-vg

Reproducibility
---------------
100%

System Configuration
--------------------
AIO

Test Activity
-------------
Developer Testing

Angie Wang (angiewang)
Changed in starlingx:
assignee: nobody → Angie Wang (angiewang)
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to metal (master)

Fix proposed to branch: master
Review: https://review.opendev.org/747499

Changed in starlingx:
status: New → In Progress
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to config (master)

Fix proposed to branch: master
Review: https://review.opendev.org/747500

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to ansible-playbooks (master)

Fix proposed to branch: master
Review: https://review.opendev.org/747501

Bill Zvonar (billzvonar)
tags: added: stx.5.0
Ghada Khalil (gkhalil)
tags: added: stx.config
Changed in starlingx:
importance: Undecided → Medium
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to metal (master)

Reviewed: https://review.opendev.org/747499
Committed: https://git.openstack.org/cgit/starlingx/metal/commit/?id=c7e18ca9e92cfe339d37763dccd776848dff5941
Submitter: Zuul
Branch: master

commit c7e18ca9e92cfe339d37763dccd776848dff5941
Author: Angie Wang <email address hidden>
Date: Fri Aug 21 18:39:42 2020 -0400

    Correct the preserved cgts-vg size in AIO kickstart

    Filesystems gnocchi and anchor were obsolete. Remove them
    from the calculation of the minimum cgts-vg size. Correct
    some other filesystem sizes like backup and pgsql.

    We should preserve the PV size that fits the minimum
    requirement to allow users to partition as they need.

    Tested on both hardware labs and vbox.

    Change-Id: I80d0765e3c6719e71a0647cab15630077ab42067
    Partial-Bug: 1892554
    Signed-off-by: Angie Wang <email address hidden>

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to config (master)

Reviewed: https://review.opendev.org/747500
Committed: https://git.openstack.org/cgit/starlingx/config/commit/?id=4188c5690ef6291f36ad372826746885691c7087
Submitter: Zuul
Branch: master

commit 4188c5690ef6291f36ad372826746885691c7087
Author: Angie Wang <email address hidden>
Date: Fri Aug 21 18:40:50 2020 -0400

    Correct the minimum small root disk size

    Some filesystems/partitions (ie.kubelet-lv, scratch-lv,
    /opt/platform-backup) are not being counted or not
    counted with the correct size in the minimum
    root disk size calculation.

    Change-Id: I41b4e2ddc9b6e7a1e3ac4a62da48630ce47ee76c
    Partial-Bug: 1892554
    Depends-On: https://review.opendev.org/#/c/747499/
    Signed-off-by: Angie Wang <email address hidden>

Changed in starlingx:
status: In Progress → Fix Released
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to ansible-playbooks (master)

Reviewed: https://review.opendev.org/747501
Committed: https://git.openstack.org/cgit/starlingx/ansible-playbooks/commit/?id=43b3aed93f309bb468c26a06809743702c72f076
Submitter: Zuul
Branch: master

commit 43b3aed93f309bb468c26a06809743702c72f076
Author: Angie Wang <email address hidden>
Date: Fri Aug 21 19:37:52 2020 -0400

    Update the minimum root disk size to be aligned with sysinv

    Change-Id: I2798c9b98cfbeede8af486209479f7a83946f500
    Closes-Bug: 1892554
    Depends-On: https://review.opendev.org/747500
    Signed-off-by: Angie Wang <email address hidden>

Angie Wang (angiewang)
Changed in starlingx:
status: Fix Released → In Progress
Revision history for this message
Tee Ngo (teewrs) wrote :

The priority of this LP should be increased to high. It is causing DC upgrade to fail. The controller-1 would fail to unlock after being upgraded from stx4.0 (20.06) to master (20.12) with the following error in a DC lab:

2020-11-04-22-01-07_controller/puppet.log:2020-11-04T22:02:51.024 Error: 2020-11-04 22:02:50 +0000 Execution of '/usr/sbin/lvcreate -n dockerdistribution-lv --size 16777216k cgts-vg' returned 5: Volume group "cgts-vg" has insufficient free space (453 extents): 512 required.
2020-11-04-22-01-07_controller/puppet.log:2020-11-04T22:02:51.108 Error: 2020-11-04 22:02:50 +0000 /Stage[main]/Platform::Drbd::Dockerdistribution/Platform::Drbd::Filesystem[drbd-dockerdistribution]/Logical_volume[dockerdistribution-lv]/ensure: change from absent to present failed: Execution of '/usr/sbin/lvcreate -n dockerdistribution-lv --size 16777216k cgts-vg' returned 5: Volume group "cgts-vg" has insufficient free space (453 extents): 512 required.
2020-11-04-22-01-07_controller/puppet.log:2020-11-04T22:03:19.527 echo "Error removing osd ${osd_id}, exit code: ${RET}"
2020-11-04-22-01-07_controller/puppet.log:2020-11-04T22:03:19.554 echo "Error creating osd ${osd_id}, exit code: ${RET}"
2020-11-04-22-01-07_controller/puppet.log:2020-11-04T22:03:19.634 echo "Error removing osd ${osd_id}, exit code: ${RET}"
2020-11-04-22-01-07_controller/puppet.log:2020-11-04T22:03:19.659 echo "Error creating osd ${osd_id}, exit code: ${RET}"
2020-11-04-22-07-19_controller/puppet.log:2020-11-04T22:08:47.627 Error: 2020-11-04 22:08:47 +0000 Execution of '/usr/sbin/lvcreate -n dockerdistribution-lv --size 16777216k cgts-vg' returned 5: Volume group "cgts-vg" has insufficient free space (69 extents): 512 required.
2020-11-04-22-07-19_controller/puppet.log:2020-11-04T22:08:47.708 Error: 2020-11-04 22:08:47 +0000 /Stage[main]/Platform::Drbd::Dockerdistribution/Platform::Drbd::Filesystem[drbd-dockerdistribution]/Logical_volume[dockerdistribution-lv]/ensure: change from absent to present failed: Execution of '/usr/sbin/lvcreate -n dockerdistribution-lv --size 16777216k cgts-vg' returned 5: Volume group "cgts-vg" has insufficient free space (69 extents): 512 required.

I'm going to try a new master image with the following commits reverted
https://review.opendev.org/#/c/747499/
https://review.opendev.org/#/c/747500/

Ghada Khalil (gkhalil)
Changed in starlingx:
importance: Medium → High
Revision history for this message
Angie Wang (angiewang) wrote :

The upgrade fix is in progress

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to config (master)

Fix proposed to branch: master
Review: https://review.opendev.org/762572

Revision history for this message
Angie Wang (angiewang) wrote :

The upgrade failure fixed by https://review.opendev.org/c/starlingx/config/+/762572/.
Merged on 2020-12-03.

Changed in starlingx:
status: In Progress → Fix Released
Revision history for this message
OpenStack Infra (hudson-openstack) wrote :

Fix proposed to branch: master
Review: https://review.opendev.org/c/starlingx/config/+/793077

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to config (master)

Reviewed: https://review.opendev.org/c/starlingx/config/+/793077
Committed: https://opendev.org/starlingx/config/commit/23505ba77d76114cf8a0bf833f9a5bcd05bc1dd1
Submitter: "Zuul (22348)"
Branch: master

commit 23505ba77d76114cf8a0bf833f9a5bcd05bc1dd1
Author: Angie Wang <email address hidden>
Date: Tue May 25 18:49:21 2021 -0400

    Fix issue in partition data migration script

    The created partition dictonary partition_map is not
    an ordered dict so we need to sort it by its key -
    device node when iterating it to adjust the device
    nodes/paths for user created extra partitions to ensure
    the number of device node/path for each extra partition
    is calculated correctly, otherwise the adjustments
    could be messy that causes the partition DB update to
    fail.

    Tested AIO-SX upgrade with three additional partitions.

    Change-Id: I1cb3bbfaf144a59d29633c1784b0fde80529cd71
    Closes-Bug: 1892554
    Signed-off-by: Angie Wang <email address hidden>

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to config (r/stx.5.0)

Fix proposed to branch: r/stx.5.0
Review: https://review.opendev.org/c/starlingx/config/+/793352

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Change abandoned on config (r/stx.5.0)

Change abandoned by "Angie Wang <email address hidden>" on branch: r/stx.5.0
Review: https://review.opendev.org/c/starlingx/config/+/793352
Reason: Abandon as it's a corner case

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to config (f/centos8)

Fix proposed to branch: f/centos8
Review: https://review.opendev.org/c/starlingx/config/+/793460

Revision history for this message
OpenStack Infra (hudson-openstack) wrote :

Fix proposed to branch: f/centos8
Review: https://review.opendev.org/c/starlingx/config/+/793696

Revision history for this message
OpenStack Infra (hudson-openstack) wrote :

Fix proposed to branch: f/centos8
Review: https://review.opendev.org/c/starlingx/config/+/794611

Revision history for this message
OpenStack Infra (hudson-openstack) wrote :

Fix proposed to branch: f/centos8
Review: https://review.opendev.org/c/starlingx/config/+/794906

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Change abandoned on config (f/centos8)

Change abandoned by "Chuck Short <email address hidden>" on branch: f/centos8
Review: https://review.opendev.org/c/starlingx/config/+/794611

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to config (f/centos8)
Download full text (147.3 KiB)

Reviewed: https://review.opendev.org/c/starlingx/config/+/794906
Committed: https://opendev.org/starlingx/config/commit/75758b37a5a23c8811355b67e2a430a1713cd85b
Submitter: "Zuul (22348)"
Branch: f/centos8

commit 9e420d9513e5fafb1df4d29567bc299a9e04d58d
Author: Bin Qian <email address hidden>
Date: Mon May 31 14:45:52 2021 -0400

    Add more logging to run docker login

    Add error log for running docker login. The new log could
    help identify docker login failure.

    Closes-Bug: 1930310
    Change-Id: I8a709fb6665de8301fbe3022563499a92b2a0211
    Signed-off-by: Bin Qian <email address hidden>

commit 31c77439d2cea590dfcca13cfa646522665f8686
Author: albailey <email address hidden>
Date: Fri May 28 13:42:42 2021 -0500

    Fix controller-0 downgrade failing to kill ceph

    kill_ceph_storage_monitor tried to manipulate a pmon
    file that does not exist in an AIO-DX environment.

    We no longer invoke kill_ceph_storage_monitor in an
    AIO SX or DX env.

    This allows: "system host-downgrade controller-0"
    to proceed in an AIO-DX environment where that second
    controller (controller-0) was upgraded.

    Partial-Bug: 1929884
    Signed-off-by: albailey <email address hidden>
    Change-Id: I633853f75317736084feae96b5b849c601204c13

commit 0dc99eee608336fe01b58821ea404286371f1408
Author: albailey <email address hidden>
Date: Fri May 28 11:05:43 2021 -0500

    Fix file permissions failure during duplex upgrade abort

    When issuing a downgrade for controller-0 in a duplex upgrade
    abort and rollback scenario, the downgrade command was failing
    because the sysinv API does not have root permissions to set
    a file flag.
    The fix is to use RPC so the conductor can create the flag
    and allow the downgrade for controller-0 to get further.

    Partial-Bug: 1929884
    Signed-off-by: albailey <email address hidden>
    Change-Id: I913bcad73309fe887a12cbb016a518da93327947

commit 7ef3724dad173754e40b45538b1cc726a458cc1c
Author: Chen, Haochuan Z <email address hidden>
Date: Tue May 25 16:16:29 2021 +0800

    Fix bug rook-ceph provision with multi osd on one host

    Test case:
    1, deploy simplex system
    2, apply rook-ceph with below override value
    value.yaml
    cluster:
      storage:
        nodes:
        - name: controller-0
          devices:
          - name: sdb
          - name: sdc
    3, reboot

    Without this fix, only osd pod could launch successfully after boot
    as vg start with ceph could not correctly add in sysinv-database

    Closes-bug: 1929511

    Change-Id: Ia5be599cd168d13d2aab7b5e5890376c3c8a0019
    Signed-off-by: Chen, Haochuan Z <email address hidden>

commit 23505ba77d76114cf8a0bf833f9a5bcd05bc1dd1
Author: Angie Wang <email address hidden>
Date: Tue May 25 18:49:21 2021 -0400

    Fix issue in partition data migration script

    The created partition dictonary partition_map is not
    an ordered dict so we need to sort it by its key -
    device node when iterating it to adjust the device
    nodes/paths for user created extra partitions to ensure
    the number of device node...

tags: added: in-f-centos8
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Change abandoned on config (f/centos8)

Change abandoned by "Chuck Short <email address hidden>" on branch: f/centos8
Review: https://review.opendev.org/c/starlingx/config/+/793696

Revision history for this message
OpenStack Infra (hudson-openstack) wrote :

Change abandoned by "Chuck Short <email address hidden>" on branch: f/centos8
Review: https://review.opendev.org/c/starlingx/config/+/793460

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.