interface with a warning is not configured properly
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
MAAS |
Fix Released
|
High
|
Björn Tillenius | ||
3.0 |
Fix Released
|
Critical
|
Björn Tillenius |
Bug Description
We have one machine which shows with yellow triangle and "Link connected to a slow interface" message on one NIC.
When machine is deployed, this interface fails to be configured resulting in missing VLANs.
Works fine in MAAS 2.9
2.9
ubuntu@pangoro:~$ ip -c -br a
lo UNKNOWN 127.0.0.1/8 ::1/128
eth0 UP 10.246.64.202/21 fe80::eeb1:
eth1 UP fe80::eeb1:
eth2 UP
eth3 UP
bond0 UP fe80::eeb1:
bond0.2735@bond0 UP 192.168.35.70/26 fe80::eeb1:
bond0.2736@bond0 UP 192.168.36.70/26 fe80::eeb1:
eth1.2696@eth1 UP 192.168.33.164/25 fe80::eeb1:
eth1.2678@eth1 UP 10.244.8.133/24 fe80::eeb1:
3.0rc2
lo UNKNOWN 127.0.0.1/8 ::1/128
eth0 UP 10.246.64.208/21 fe80::eeb1:
eno2 DOWN
eth2 UP
eth3 UP
bond0 UP fe80::eeb1:
bond0.2735@bond0 UP 192.168.35.70/26 fe80::eeb1:
bond0.2736@bond0 UP 192.168.36.70/26 fe80::eeb1:
Machine named pangoro in https:/
Related branches
- Björn Tillenius: Approve
-
Diff: 102 lines (+15/-9)2 files modifiedsrc/metadataserver/builtin_scripts/network.py (+8/-8)
src/metadataserver/builtin_scripts/tests/test_network.py (+7/-1)
- Alberto Donato (community): Approve
- MAAS Lander: Pending (unittests) requested
-
Diff: 102 lines (+15/-9)2 files modifiedsrc/metadataserver/builtin_scripts/network.py (+8/-8)
src/metadataserver/builtin_scripts/tests/test_network.py (+7/-1)
summary: |
- [3.0 rc2 testing] interface with a warning is not configured properly + interface with a warning is not configured properly |
Changed in maas: | |
milestone: | none → 3.0.0-rc2 |
importance: | Undecided → Critical |
Changed in maas: | |
milestone: | 3.0.0-rc2 → none |
Changed in maas: | |
importance: | Critical → High |
Changed in maas: | |
assignee: | nobody → Björn Tillenius (bjornt) |
status: | New → In Progress |
Changed in maas: | |
status: | Incomplete → In Progress |
Changed in maas: | |
milestone: | none → next |
status: | In Progress → Fix Committed |
Changed in maas: | |
status: | Fix Committed → Fix Released |
milestone: | next → none |
I did a bit of digging here. One of the issues is that eno2 on pangoro is considered disconnected. If you don't configure the interface and log in to pangoro, you can see with 'ip link' that eno2 doesn't have a CARRIER.
If you manually configure the interface, it does come up after a while. I tried to ping a known host, but the first 9 pings were lost. Only after that, the interface started receiving traffic.
That's most likely a cabling or switch configuration issue. I would suggest that you look into that, since I would think that would cause problems in the Openstack deployment itself.
The reason it works in 2.9, is that we don't properly detect that there's no link. I tried commissioning a machine with a NIC that didn't have a cable attached, and in 2.9 it was still reported as "link_connected" and "enabled". While in 3.0, it's properly reported as not connected, and not enable.
I don't think it makes sense to change 3.0 to work like in 2.9 in that regard, since that's clearly a bug that got fixed.
We might consider to mark the interface as enabled if you explicitly set the vlan of the interface, like you do. On the other hand, I think you should be aware of the fact that you're configuring a device that isn't connected. So we might also want to error out, and require the user to explicitly enable the interface first.
SolQA, what do you think?