Distributed Cloud DX subcloud standby controller keeps rebooting after unlock

Bug #1793200 reported by Peng Peng
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
StarlingX
Invalid
High
Matt Peters

Bug Description

Brief Description
-----------------
When system unlock DX subcloud standby controller. controller keeps rebooting.

Severity
--------
Critical

Steps to Reproduce
------------------
1. install System Controller
2. install DX subcloud
3. during lab_setup, unlock controller-1

Expected Behavior
------------------
controller-1 boots up properly

Actual Behavior
----------------
controller-1 keeps rebooting

Reproducibility
---------------
Reproducible

System Configuration
--------------------
Distributed Cloud, two nodes subcloud

Branch/Pull Time/Commit
-----------------------
master as of 2018-09-10_20-18-00

Timestamp/Logs
--------------
controller-1:/var/log/puppet/latest# vi puppet.log
controller-1:/var/log/puppet/latest# grep failed puppet.log
2018-09-14T21:21:39.390 ovs-vsctl: unix:/var/run/openvswitch/db.sock: database connection failed (No such file or directory)
2018-09-14T21:22:13.544 Notice: 2018-09-14 21:22:13 +0000 /Stage[main]/Platform::Vswitch::Ovs/Platform::Vswitch::Ovs::Device[0000:05:00.1]/Exec[ovs-bind-device: 0000:05:00.1]/returns: Error: bind failed for 0000:05:00.1 - Cannot bind to driver vfio-pci
2018-09-14T21:22:13.546 Notice: 2018-09-14 21:22:13 +0000 /Stage[main]/Platform::Vswitch::Ovs/Platform::Vswitch::Ovs::Device[0000:05:00.1]/Exec[ovs-bind-device: 0000:05:00.1]/returns: Error: unbind failed for 0000:05:00.1 - Cannot open /sys/bus/pci/drivers//unbind
2018-09-14T21:22:13.632 Error: 2018-09-14 21:22:13 +0000 /Stage[main]/Platform::Vswitch::Ovs/Platform::Vswitch::Ovs::Device[0000:05:00.1]/Exec[ovs-bind-device: 0000:05:00.1]/returns: change from notrun to 0 failed: dpdk-devbind.py --bind=vfio-pci 0000:05:00.1 returned 1 instead of one of [0]
2018-09-14T21:22:13.638 Warning: 2018-09-14 21:22:13 +0000 /Stage[main]/Vswitch::Dpdk/Service[openvswitch]: Skipping because of failed dependencies
2018-09-14T21:22:13.852 Error: 2018-09-14 21:22:13 +0000 Failed to apply catalog: Execution of '/usr/bin/ovs-vsctl list Open_vSwitch .' returned 1: ovs-vsctl: unix:/var/run/openvswitch/db.sock: database connection failed (No such file or directory)
controller-1:/var/log/puppet/latest#

Ghada Khalil (gkhalil)
summary: - STX: Distributed Cloud DX subcloud standby controller keeps rebooting
- after unlock
+ Distributed Cloud DX subcloud standby controller keeps rebooting after
+ unlock
Changed in starlingx:
importance: Undecided → High
tags: added: stx.distcloud
Revision history for this message
Ghada Khalil (gkhalil) wrote :

This is a lab configuration issue. Intel VT-d Directed I/O doesn't appear to be enabled on controller-1, therefore it is unable to bind the NIC devices to the VFIO driver.

Marking this bug as invalid

Changed in starlingx:
assignee: nobody → Matt Peters (mpeters-wrs)
tags: added: stx.networking
Changed in starlingx:
status: New → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.