X710 (10G) NIC VFs report a speed of 100M when used with DPDK

Bug #1885020 reported by Steven Webster
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
StarlingX
Fix Released
High
Steven Webster

Bug Description

Brief Description
-----------------
When SR-IOV interfaces with device type "Ethernet Controller X710 for 10GbE SFP+ [1572]" are used in a DPDK enabled application container, it can be seen that the speed reported by these interfaces is 100Mb, rather than 10Gb.

The i40 PF and VF drivers need to be upgraded.

Severity
--------
Major: System/Feature is usable but degraded

Steps to Reproduce
------------------
system host-lock controller-0
system host-label-assign controller-0 sriovdp=enabled
system host-label-assign controller-0 kube-cpu-mgr-policy=static
system host-label-assign controller-0 kube-topology-mgr-policy=restricted
system host-if-modify controller-0 enp177s0f0 -n sriov0 -c pci-sriov --vf-driver=vfio -N 2
system interface-datanetwork-assign controller-0 sriov0 group0-data1
system host-unlock controller-0

network attachment definition:

apiVersion: "k8s.cni.cncf.io/v1"
kind: NetworkAttachmentDefinition
metadata:
  name: sriov0
  annotations:
    k8s.v1.cni.cncf.io/resourceName: intel.com/pci_sriov_net_group0_data1
spec:
  config: '{
    "type": "sriov",
    "vlan": 3460,
    "trust": "on",
    "spoofchk": "off"
  }'

pod spec:

apiVersion: v1
kind: Pod
metadata:
  name: dpdkapp
  annotations:
    k8s.v1.cni.cncf.io/networks: '[
            { "name": "sriov0" },
            { "name": "sriov0" }
    ]'
spec:
  restartPolicy: Never
  containers:
  - name: dpdkapp
    image: "dpdkapp"
    securityContext:
      privileged: true
    volumeMounts:
    - mountPath: /mnt/huge-1048576kB
      name: hugepage
    stdin: true
    tty: true
    securityContext:
      capabilities:
        add: ["IPC_LOCK", "NET_ADMIN", "NET_RAW"]
    resources:
      requests:
        memory: 2Gi
        cpu: 3
        intel.com/pci_sriov_net_group0_data1: '2'
      limits:
        hugepages-1Gi: 2Gi
        memory: 2Gi
        cpu: 3
        intel.com/pci_sriov_net_group0_data1: '2'
  volumes:
  - name: hugepage
    emptyDir:
      medium: HugePages

Expected Behavior
------------------

The DPDK application should report a speed for the VF interfaces of 10G

Actual Behavior
----------------

Reported port speed is 100M

Reproducibility
---------------
100%

System Configuration
--------------------
AIO-SX

Branch/Pull Time/Commit
-----------------------
master, 2020-06-24

Last Pass
---------

This is likely caused by the last uprev of the drivers

Timestamp/Logs
--------------

N/A

Test Activity
-------------
Regression Testing

Workaround
----------

Upgraded i40 (2.11.29), iavf (3.9.5) drivers

Changed in starlingx:
assignee: nobody → Steven Webster (swebster-wr)
Ghada Khalil (gkhalil)
tags: added: stx.distro.other stx.networking
Revision history for this message
Ghada Khalil (gkhalil) wrote :

stx.4.0 / high priority - this results in serious throughput degradation in this config

Changed in starlingx:
importance: Undecided → High
status: New → Triaged
tags: added: stx.4.0
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to kernel (master)

Fix proposed to branch: master
Review: https://review.opendev.org/738106

Changed in starlingx:
status: Triaged → In Progress
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to kernel (master)

Reviewed: https://review.opendev.org/738106
Committed: https://git.openstack.org/cgit/starlingx/kernel/commit/?id=a8c66d3860d2d82f64e401a0f6fab6cd5bb78190
Submitter: Zuul
Branch: master

commit a8c66d3860d2d82f64e401a0f6fab6cd5bb78190
Author: Steven Webster <email address hidden>
Date: Thu Jun 25 16:33:24 2020 -0400

    SR-IOV DPDK traffic loss

    It has been found that the current version of the i40e driver
    causes SR-IOV VF interfaces to report a speed of 100Mbps when
    used by DPDK applications.

    Uprevving the i40e and iavf drivers to the latest release
    will resolve the issue.

    Change-Id: I503001f979252f319cb849f9f3fd148b050062a8
    Closes-Bug: #1885020
    Signed-off-by: Steven Webster <email address hidden>

Changed in starlingx:
status: In Progress → Fix Released
Revision history for this message
Ghada Khalil (gkhalil) wrote :

@Steve, please cherry-pick to the r/stx.4.0 branch.

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to kernel (r/stx.4.0)

Fix proposed to branch: r/stx.4.0
Review: https://review.opendev.org/742270

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to kernel (r/stx.4.0)

Reviewed: https://review.opendev.org/742270
Committed: https://git.openstack.org/cgit/starlingx/kernel/commit/?id=155b89c1ecaf5704d92b01edbc73ad08d785074b
Submitter: Zuul
Branch: r/stx.4.0

commit 155b89c1ecaf5704d92b01edbc73ad08d785074b
Author: Steven Webster <email address hidden>
Date: Thu Jun 25 16:33:24 2020 -0400

    SR-IOV DPDK traffic loss

    It has been found that the current version of the i40e driver
    causes SR-IOV VF interfaces to report a speed of 100Mbps when
    used by DPDK applications.

    Uprevving the i40e and iavf drivers to the latest release
    will resolve the issue.

    Change-Id: I503001f979252f319cb849f9f3fd148b050062a8
    Closes-Bug: #1885020
    Signed-off-by: Steven Webster <email address hidden>
    (cherry picked from commit a8c66d3860d2d82f64e401a0f6fab6cd5bb78190)

Ghada Khalil (gkhalil)
tags: added: in-r-stx40
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.