OpenStack override PCI pass-through failed with Ceph nil pointer
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
Medium
|
Hugo Nicodemos Brito |
Bug Description
Brief Description
-----------------
After successful installed Starlingx V5.0.1 with Openstack, configure OpenStack Nova to allow for PCI pass-through PCI passthrough failed with Ceph nil pointer.
Severity
--------
Critical: Time Senstive Networking (TSN) not usable due to the error
Steps to Reproduce
------------------
1/ Follow Starlingx release 5 bare metal aio simplex installation guide and installed Openstack with Starlingx successfully.
https:/
2/ Enable TSN in Starlingx following the guide https:/
3/ Create nova-tsn-pt.yaml file to allow PCI pass-through for i210 adapter, with the following content:
conf:
nova:
pci:
alias:
type: multistring
values:
- '{"vendor_id": "8086", "product_id": "1533",
passthrough
type: multistring
values:
- '{"class_id": "8086", "product_
overrides:
nova_compute:
hosts:
- conf:
nova:
DEFAULT:
my_ip: 192.168.206.2
libvirt:
pci:
type: multistring
- '{"class_id": "8086", "product_id": "1533"}'
vnc:
name: controller-0
4/ Set PCI passthrough config
> system helm-override-
5/ Confirm that the user_override lists
> system helm-override-show stx-openstack nova openstack
6/ Apply the changes
> system application-apply stx-openstack
7/ After a few minutes, the apply failed with Ceph nil pointer error (see the attached log)
Expected Behavior
------------------
Openstack PCI passthrough override should succeed without error.
Actual Behavior
----------------
The apply failed with Ceph nil pointer error. Tested a few times with clean installation of Starlingx.
Reproducibility
---------------
100% reproducible.
System Configuration
-------
One node system, natively install Starlingx in Intel NUC HadesCanyon. OAM in 192.168.1.x subnet, data network in 192.168.2.x subnet, both interfaces use flat network.
Branch/Pull Time/Commit
-------
Use Starlingx V5.0.1 green build on 27 Oct 2021, downloaded from http://
Last Pass
---------
None
Timestamp/Logs
--------------
Attached the log. Snippet of logs as below, Unique identifier "nil pointer evaluating interface {}.ceph"
2021-11-10 01:52:00.286 74 ERROR armada.
status = StatusCode.UNKNOWN
details = "render error in "nova/job-
debug_error_string = "{"created"
>
2021-11-10 01:52:00.286 74 ERROR armada.
2021-11-10 01:52:00.286 74 ERROR armada.
2021-11-10 01:52:00.286 74 ERROR armada.
2021-11-10 01:52:00.286 74 ERROR armada.
2021-11-10 01:52:00.286 74 ERROR armada.
2021-11-10 01:52:00.286 74 ERROR armada.
2021-11-10 01:52:00.286 74 ERROR armada.
2021-11-10 01:52:00.286 74 ERROR armada.
2021-11-10 01:52:00.286 74 ERROR armada.
2021-11-10 01:52:00.286 74 ERROR armada.
2021-11-10 01:52:00.286 74 ERROR armada.
Test Activity
-------------
Developer Testing
Workaround
----------
None. Each time the apply failed, my Openstack is no longer work, I have to reinstall Starlingx. Please let me know if there is a way to revert the apply after failure.
tags: |
added: stx.5.0 stx.distro.openstack removed: v5.0.1 |
tags: | added: stx stx.6.0 |
Changed in starlingx: | |
assignee: | Douglas Lopes Pereira (douglaspereira) → Hugo Nicodemos Brito (hbrito) |
Changed in starlingx: | |
importance: | Undecided → Medium |
tags: | added: stx.docs |
@Douglas, can someone from the openstack team have a look at this and provide some guidance to the reporter.