Subcloud with admin network doesn't go online due to missing rule in L3 firewall
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
High
|
Andre Kantek |
Bug Description
Brief Description
When installing a subcloud using the admin network, the subcloud install completes, routes are added as expected, but the subcloud and subcloud is online. But after sometime (20min or so), the subcloud goes offline.
[sysadmin@
+----+-
| id | name | management | availability | deploy status | sync | backup status | backup datetime |
+----+-
| 2 | subcloud5 | managed | offline | complete | unknown | None | None |
| 3 | subcloud6 | managed | offline | complete | unknown | None | None |
Initial investigation points that there is a missing rule/config related to L3 firewall. Even though root cause is related L3 firewall, this issue is also impacting TCPG-1093 as it prevents installation of subclouds with admin network.
Severity
<Major: System/Feature is usable but degraded>
Steps to Reproduce
1 - Install subcloud using admin network (comment out mgmt network gateway in the subcloud bootstrap file before running "dcmanager subcloud add")
2 - Wait subcloud to complete installation
Expected Behavior
Subcloud is completed and online
Actual Behavior
Subcloud install is completed but status is offline
Reproducibility
Reproducible
System Configuration
WRCP-DC3-1 SC1 (AIO-SX subcloud)
system controller: 2620:10a:
Official lab files used.
Load info (eg: 2022-03-
BUILD_ID=
Last Pass
Tried and worked on master load from may 25th
Alarms
NA
Test Activity
Workaround
The workaround is to add an extra global network policy to allow the admin network and the link-local network for ICMPv6
apiVersion: crd.projectcali
kind: GlobalNetworkPolicy
metadata:
name: workaround-
spec:
applyOnForward: true
ingress:
- action: Allow
ipVersion: 6
metadata:
annotations:
name: stx-ingr-
protocol: ICMPv6
source:
nets:
- fd00:8:24::/64 # <=== adjust to the subcloud admin network address
- fe80::/64
order: 100
selector: has(nodetype) && nodetype == 'controller' && has(iftype) && iftype contains 'admin'
types:
- Ingress
Changed in starlingx: | |
assignee: | nobody → Andre Kantek (akantek) |
Related to recent code changes for: https:/ /storyboard. openstack. org/#!/ story/2010591