EKS 1.30: new nodes sometimes get "tls: internal error"
Bug #2069854 reported by
Sergei Jeldosev
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
cloud-images |
New
|
Undecided
|
Unassigned |
Bug Description
After upgrading to 1.30 and EKS AMI ami-053f81caf65
That causes "tls: internal error" errors for ~10 minutes. If kubelet service is restarted manually, everything starts working immediately. We use Karpenter for autoscaling and it happens for about every 5th node.
Looks like there's a timing issue somewhere. We use userdata but don't restart kubelet or containerd.
AWS VPC CNI: 1.18.2
kube-proxy v1.30.0-
To post a comment you must log in.
Hi Sergei,
thanks for filling the bug report. Can you please provide log files (journal, kubelet-eks service, relevant pods, ...) . And possibly more details about your setup and the detailed steps to reproduce this?