[1.19] services on baremetal sometimes fail start with hacluster on focal
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Kubernetes Control Plane Charm |
New
|
Undecided
|
Unassigned | ||
OpenStack HA Cluster Charm |
New
|
Undecided
|
Unassigned |
Bug Description
Currently our deploys on baremetal deploys the kubernetes master leader units regularly fail to have services start; sometimes they come up and then go down later.
Some notes about our baremetal deployments:
we deploy with 3 k8s master units using hacluster
we limit traffic for kube-api-endpoint, kube-control, and loadbalancer endpoints traffic to their own space. we call this the internal-space
we have all the network spaces used set in juju-no-proxy
The only consistent thing in the logs is that the k8s master regularly fails to connect to itself over the internal space usually with a connection refused.
a run where kube-controller
kube-controller
a run where kube-apiserver failed
2020-09-22 11:43:59 INFO juju-log Executing ['kubectl', '--kubeconfig=
2020-09-22 11:43:59 DEBUG update-status The connection to the server 192.168.33.31:6443 was refused - did you specify the right host or port?
Runs affected by this bug can be found at: /solutions. qa.canonical. com/bugs/ bugs/bug/ 1896639
https:/