VF LAG not enabled due to service startup order for sriov-netplan-shim and mlnx-switchdev-mode

Bug #1907454 reported by Frode Nordahl
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
OpenStack Neutron Open vSwitch Charm
Fix Released
High
Unassigned
charm-ovn-chassis
Fix Released
High
Frode Nordahl

Bug Description

Note that this is a bug in the mlnx-switchdev-mode and sriov-netplan-shim pacakges.

At boot you will see:
[ 275.085444] mlx5_core 0000:81:00.0: mlx5_cmd_check:769:(pid 5815): CREATE_LAG(0x840) op_mod(0x0) failed, status bad parameter(0x3), syndrome (0x7d49cb)
[ 275.100008] mlx5_core 0000:81:00.0: mlx5_create_lag:199:(pid 5815): Failed to create LAG (-22)
[ 275.109120] mlx5_core 0000:81:00.0: mlx5_activate_lag:219:(pid 5815): Failed to activate VF LAG
               Make sure all VFs are unbound prior to VF LAG activation or deactivation

We have partial fix committed here:
https://github.com/openstack-charmers/mlnx-switchdev-mode/pull/2
https://github.com/openstack-charmers/sriov-netplan-shim/pull/2

What still needs to be done is to break out the "rebind" step into a separate sub-command and run that _AFTER_ network.target is done so that VFs are bound back to the mlx5_core driver after any bonds are created.

The fix works as-is but nova-compute will complain.

Tags: ps5
Frode Nordahl (fnordahl)
tags: added: ps5
Revision history for this message
James Page (james-page) wrote :

Testing packages will appear here:

https://launchpad.net/~openstack-charmers/+archive/ubuntu/networking-tools-testing

with the referenced pull requests

Revision history for this message
James Page (james-page) wrote :

Proposed updates are testing fine in our deployment.

Side effect is that nova-compute complains about missing device entries, however they do work just fine.

Frode Nordahl (fnordahl)
Changed in charm-neutron-openvswitch:
status: New → Fix Committed
status: Fix Committed → Triaged
importance: Undecided → High
Changed in charm-ovn-chassis:
status: New → Fix Committed
importance: Undecided → High
assignee: nobody → Frode Nordahl (fnordahl)
milestone: none → 21.01
David Ames (thedac)
Changed in charm-ovn-chassis:
status: Fix Committed → Fix Released
Frode Nordahl (fnordahl)
Changed in charm-neutron-openvswitch:
status: Triaged → Fix Committed
milestone: none → 21.04
Changed in charm-neutron-openvswitch:
status: Fix Committed → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.