image label on SystemController and subcloud4 are different

Bug #1890340 reported by Difu Hu
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
StarlingX
Fix Released
Medium
Al Bailey

Bug Description

Brief Description
-----------------
subcloud4 has 2 labels. SystemController apply 1 label to image.
image label on SystemController and subcloud4 are different

Severity
--------
Major

Steps to Reproduce
------------------
precondition - subcloud4 has 2 labels
system --os-region-name SystemController device-image-apply 8a304eb6-11fd-4db1-8830-1d9ba249dfe0 subcloud4=8
dcmanager fw-update-strategy create --subcloud-apply-type serial
dcmanager fw-update-strategy apply

Expected Behavior
------------------
subcloud4 update with image labeled subcloud4=8

Actual Behavior
----------------
subcloud4 update with image labeled subcloud4=8 and subcloudgroup=2

Reproducibility
---------------
not sure

System Configuration
--------------------
Lab-name: DC-3

Branch/Pull Time/Commit
-----------------------
2020-07-31_20-00-00

Last Pass
---------
N/A

Timestamp/Logs
--------------
[sysadmin@controller-0 ~(keystone_admin)]$ system --os-region-name SystemController device-image-show 8a304eb6-11fd-4db1-8830-1d9ba249dfe0
+----------------+--------------------------------------+
| Property | Value |
+----------------+--------------------------------------+
| uuid | 8a304eb6-11fd-4db1-8830-1d9ba249dfe0 |
| bitstream_type | functional |
| pci_vendor | 8086 |
| pci_device | 0b30 |
| bitstream_id | 1 |
| key_signature | None |
| revoke_key_id | None |
| name | None |
| description | None |
| image_version | None |
| applied | True |
| applied_labels | [{u'subcloud4': u'8'}] |
+----------------+--------------------------------------+
[sysadmin@controller-0 ~(keystone_admin)]$ system --os-auth-url https://[fd01:304::2]:5001/v3 --os-region-name subcloud4 device-image-show 8a304eb6-11fd-4db1-8830-1d9ba249dfe0
+----------------+--------------------------------------------------+
| Property | Value |
+----------------+--------------------------------------------------+
| uuid | 8a304eb6-11fd-4db1-8830-1d9ba249dfe0 |
| bitstream_type | functional |
| pci_vendor | 8086 |
| pci_device | 0b30 |
| bitstream_id | 1 |
| key_signature | None |
| revoke_key_id | None |
| name | None |
| description | None |
| image_version | None |
| applied | True |
| applied_labels | [{u'subcloudgroup': u'2'}, {u'subcloud4': u'8'}] |
+----------------+--------------------------------------------------+
[sysadmin@controller-0 ~(keystone_admin)]$ system --os-auth-url https://[fd01:304::2]:5001/v3 --os-region-name subcloud4 device-label-list
+--------------+------------------+---------------+-------------+
| hostname | PCI device name | label key | label value |
+--------------+------------------+---------------+-------------+
| controller-0 | pci_0000_b4_00_0 | subcloudgroup | 2 |
| controller-0 | pci_0000_b4_00_0 | subcloud4 | 8 |
+--------------+------------------+---------------+-------------+

After update, both labels on subcloud4 got removed.
[sysadmin@controller-0 ~(keystone_admin)]$ system --os-auth-url https://[fd01:304::2]:5001/v3 --os-region-name subcloud4 device-label-list

[sysadmin@controller-0 ~(keystone_admin)]$ system --os-auth-url https://[fd01:304::2]:5001/v3 --os-region-name subcloud4 device-image-show 8a304eb6-11fd-4db1-8830-1d9ba249dfe0
+----------------+--------------------------------------+
| Property | Value |
+----------------+--------------------------------------+
| uuid | 8a304eb6-11fd-4db1-8830-1d9ba249dfe0 |
| bitstream_type | functional |
| pci_vendor | 8086 |
| pci_device | 0b30 |
| bitstream_id | 1 |
| key_signature | None |
| revoke_key_id | None |
| name | None |
| description | None |
| image_version | None |
| applied | False |
| applied_labels | None |
+----------------+--------------------------------------+

host-device-show displays "updated_at | None"
[sysadmin@controller-0 ~(keystone_admin)]$ system --os-auth-url https://[fd01:304::2]:5001/v3 --os-region-name subcloud4 host-device-show controller-0 0000:b4:00.0
+-----------------------+----------------------------------+
| Property | Value |
+-----------------------+----------------------------------+
| name | pci_0000_b4_00_0 |
| address | 0000:b4:00.0 |
| class id | 120000 |
| vendor id | 8086 |
| device id | 0b30 |
| class name | Processing accelerators |
| vendor name | Intel Corporation |
| device name | Device 0b30 |
| numa_node | 0 |
| enabled | True |
| sriov_totalvfs | 1 |
| sriov_numvfs | 0 |
| sriov_vfs_pci_address | |
| sriov_vf_pdevice_id | None |
| extra_info | |
| created_at | 2020-08-04T20:17:05.019664+00:00 |
| updated_at | None |
| root_key | |
| revoked_key_ids | |
| boot_page | user |
| bitstream_id | 0x2383a62a010504 |
| bmc_build_version | D.2.0.6 |
| bmc_fw_version | D.2.0.21 |
| driver | intel-fpga-pci |
| sriov_vf_driver | None |
+-----------------------+----------------------------------+

Test Activity
-------------
Functional Testing

Difu Hu (difuhu)
description: updated
Ghada Khalil (gkhalil)
tags: added: stx.fpga
Revision history for this message
Ghada Khalil (gkhalil) wrote :

As discussed with Al Bailey and Teresa Ho, this should be resolved with the following code changes:
o https://review.opendev.org/#/c/744906/
o https://review.opendev.org/#/c/744787/

Revision history for this message
Ghada Khalil (gkhalil) wrote :

Marking as stx.5.0 - issue related to DC FPGA Orchestration feature

Changed in starlingx:
status: New → Triaged
tags: added: stx.5.0
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to config (master)

Reviewed: https://review.opendev.org/744906
Committed: https://git.openstack.org/cgit/starlingx/config/commit/?id=c8e814320f545191959015a5631b7526fc862ec6
Submitter: Zuul
Branch: master

commit c8e814320f545191959015a5631b7526fc862ec6
Author: Teresa Ho <email address hidden>
Date: Wed Aug 5 09:04:19 2020 -0400

    Fix label usage in device image commands

    When an image is deleted, the device labels that are used to apply
    the image and are not associated with any physical hardware are not removed.
    The commit removes the device label if it is the last image associated
    with the label and it is not associated with physical hardware.

    The device image apply with multiple labels of the same device
    for root-key image was incorrectly blocked. This update corrects
    the validation of this case.

    Closes-Bug: 1890299
    Closes-Bug: 1890324
    Closes-Bug: 1890340

    Change-Id: Iaa34b9fe57678d6283a4cce4a30d7df4673ec996
    Signed-off-by: Teresa Ho <email address hidden>

Changed in starlingx:
status: In Progress → Fix Released
Revision history for this message
Ghada Khalil (gkhalil) wrote :

Re-opening until the second commit merges

Changed in starlingx:
importance: Undecided → Medium
assignee: nobody → Al Bailey (albailey1974)
status: Fix Released → In Progress
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to distcloud (master)

Reviewed: https://review.opendev.org/744787
Committed: https://git.openstack.org/cgit/starlingx/distcloud/commit/?id=663fcf0f8868c57540e682d47b93aab3d7a0c12c
Submitter: Zuul
Branch: master

commit 663fcf0f8868c57540e682d47b93aab3d7a0c12c
Author: albailey <email address hidden>
Date: Fri Jul 31 16:12:46 2020 -0500

    Fixes for DC firmware orchestration related to labels and upload

    The applied image labels on the system controller should be the labels
    used when applying the image on the subcloud.

    The stage that removes images now uses all the applied labels
    for the image when removing them.

    The code that converts image fields for upload was passing an integer
    for the revoke_key_id. It now converts to a string to prevent an
    encode exception.

    This commit also enables some unit tests for DC firmware
    orchestration that were being skipped.

    Closes-Bug: 1890355
    Closes-Bug: 1890340
    Closes-Bug: 1890324
    Change-Id: I1a730b01c0bbaeb5f3e1f6ecfb19f649911da453
    Signed-off-by: albailey <email address hidden>

Changed in starlingx:
status: In Progress → Fix Released
Revision history for this message
Difu Hu (difuhu) wrote :

Verified on build 2020-08-06_20-00-00.

tags: removed: stx.retestneeded
Ghada Khalil (gkhalil)
tags: added: stx.distcloud
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to distcloud (f/centos8)

Fix proposed to branch: f/centos8
Review: https://review.opendev.org/c/starlingx/distcloud/+/792298

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Change abandoned on distcloud (f/centos8)

Change abandoned by "Chuck Short <email address hidden>" on branch: f/centos8
Review: https://review.opendev.org/c/starlingx/distcloud/+/792298
Reason: Updated merge soon

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to distcloud (f/centos8)

Fix proposed to branch: f/centos8
Review: https://review.opendev.org/c/starlingx/distcloud/+/793405

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Change abandoned on distcloud (f/centos8)

Change abandoned by "Chuck Short <email address hidden>" on branch: f/centos8
Review: https://review.opendev.org/c/starlingx/distcloud/+/793405

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to distcloud (f/centos8)

Fix proposed to branch: f/centos8
Review: https://review.opendev.org/c/starlingx/distcloud/+/796528

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to distcloud (f/centos8)
Download full text (105.0 KiB)

Reviewed: https://review.opendev.org/c/starlingx/distcloud/+/796528
Committed: https://opendev.org/starlingx/distcloud/commit/4c5344f8765b372cb84d2b1181589c16db2ae6e4
Submitter: "Zuul (22348)"
Branch: f/centos8

commit cb979811017bd193fc1f06e53bb7830fd3184859
Author: Yuxing Jiang <email address hidden>
Date: Wed Jun 9 11:11:27 2021 -0400

    Format the IP addresses in payload before adding a subcloud

    The IPv6 addresses can be represented in multiple formats. As IP
    addresses are stored as text in database, ansible inventory and
    overrides, this commit converts the IP addresses in payload to
    standard text format of IPv6 address during adding a new subcloud.

    Tested with installing and bootstrapping a new subcloud(RVMC
    configured) with the correct IPv6 address values, but with
    unrecommended upper case letters and '0'. The addresses are
    converted to standard format in database, ansible inventory and
    overrides files.

    Partial-Bug: 1931459
    Signed-off-by: Yuxing Jiang <email address hidden>
    Change-Id: I6c26e749941f1ea2597f91886ad8f7da64521f0d

commit 2cf5d6d5cef0808c354f7575336aec34253993b3
Author: albailey <email address hidden>
Date: Thu May 20 14:19:24 2021 -0500

    Delete existing vim strategy from subcloud during patch orch

    When dcmanager creates a patch strategy, if a subcloud has an
    existing vim patch strategy, it will attempt to re-use
    that strategy during its patching phase, which may result in an
    error.

    This commit deletes the existing vim patch strategy in
    a subcloud, if it exists, so it can be re-created.
    If the strategy cannot be deleted, orchestration fails.

    Change-Id: Id35ef26ed3ddae6d71874fc6bac11df147f72323
    Closes-Bug: 1929221
    Signed-off-by: albailey <email address hidden>

commit 9e14c83f0162549a2a94cb8bc1e73dbc4f4d4887
Author: albailey <email address hidden>
Date: Tue Jun 1 14:37:14 2021 -0500

    Adding activation retry to upgrade orchestration

    When performing an activation, the keystone endpoints may not
    be accessible in the subcloud due to the asyncronous way that
    cert-mon can trigger a restart of keystone.

    This would have occasionally resulted in the upgrade activation
    failing to be initiated, and orchestration needing to be invoked
    again to resume.

    This 'hack' adds retries and sleeps to the initial
    activation action.

    Change-Id: Ic757521dec7bdc248a51a70b5463caafe7927360
    Partial-Bug: 1927550
    Signed-off-by: albailey <email address hidden>

commit bb604c0a9b872efd65fa45f1e2269995818c6262
Author: Tee Ngo <email address hidden>
Date: Thu May 27 22:17:16 2021 -0400

    Fix subcloud show --detail command related issues

    If the subcloud is offline, the command stalls and eventually returns
    the "ERROR (app)" output. If the subcloud is online, the oam_floating_ip
    info is excluded from the output when the subcloud id instead of subcloud
    name is specified.

    This commit fixes both of the above issues.

    Closes-Bug: 1929893
    Change-Id: I995591368564539b0e6af185b1adba2db73e0e46
    Sign...

tags: added: in-f-centos8
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.