SRIOV vf's are not created using "system host-if-modify" command
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Incomplete
|
Low
|
Sriram Dharwadkar |
Bug Description
Brief Description
-----------------
SRIOV vfs are not created using "system host-if-modify" command
system host-if-modify controller-0 -n sriov0 -c pci-sriov -N 8 --vf-driver=
Severity
--------
Provide the severity of the defect.
Critical
Steps to Reproduce
------------------
1) while installing distributed starlingX 4.0, as part of subcloud bring up, SRIOV vf's are to be created by using below commands.
system host-label-assign controller-0 sriovdp=enabled
system host-memory-modify controller-0 0 -1G 30
system host-memory-modify controller-0 1 -1G 30
DATA0IF=enp59s0f0
export NODE=controller-0
PHYSNET0='physnet0'
SPL=/tmp/
SPIL=/tmp/
system host-port-list ${NODE} --nowrap > ${SPL}
system host-if-list -a ${NODE} --nowrap > ${SPIL}
DATA0PCIADDR=$(cat $SPL | grep $DATA0IF |awk '{print $8}')
DATA0PORTUUID=$(cat $SPL | grep ${DATA0PCIADDR} | awk '{print $2}')
DATA0PORTNAME=$(cat $SPL | grep ${DATA0PCIADDR} | awk '{print $4}')
DATA0IFUUID=$(cat $SPIL | awk -v DATA0PORTNAME=
system datanetwork-add ${PHYSNET0} vlan
system host-if-modify $NODE -n sriov0 -c pci-sriov -N 8 --vf-driver=
system interface-
system host-unlock controller-0
Expected Behavior
------------------
After the running the list of commands as part of subcloud installation, it is expected that
8 vf's are created. "ip link show" should show those 8 vfs. Configmap(
Actual Behavior
----------------
vf's are not created. ip link show doesnt show any vfs.
But config.json is updated
controller-
{
"resourceList": [
{
"
"selectors": {
"vendors": [
"14e4"
],
"drivers": [
"bnxt_en"
],
"devices": [
"16dc"
],
"pfNames": [
]
}
}
]
}
and controller-0 has gone to "failed" state
[sysadmin@
+----+-
| id | hostname | personality | administrative | operational | availability |
+----+-
| 1 | controller-0 | controller | unlocked | disabled | failed |
Since it is in failed state, controller-0 node is not able to run sriov-dp pod.
Reproducibility
---------------
Always reproducible
System Configuration
-------
All in one controller - duplex in edge OR
Standard controller with storage + 2 compute nodes in edge
Branch/Pull Time/Commit
-------
rel-20.06 - in Starlingx 4.0 (Distributed StarlingX)
Last Pass
---------
Timestamp/Logs
--------------
Test Activity
-------------
Workaround
----------
Describe workaround if available
tags: | added: stx.networking |
Changed in starlingx: | |
status: | New → Incomplete |
assignee: | nobody → Sriram Dharwadkar (sriramec) |
Notes:
Check the logs in /var/log/puppet/*. The controller-0 is likely failing as the puppet manifest has errors when trying to enable / bind the SR-IOV devices.
- Ensure you have VT-d enabled in the BIOS.
- Ensure this NIC supports SR-IOV
It may be useful to try to enable SR-IOV manually on the NIC. You can check the puppet logs for the appropriate commands