commit dba41755233b2c5e9f8db6ee275f69873ab95612
Author: Jim Gauld <email address hidden>
Date: Thu Jun 27 11:47:12 2019 -0400
AIO reaffine tasks and k8s-infra during startup
This update reimplements the affine-tasks init script and service to
dynamically reaffine tasks and k8s-infra cgroup cpuset on AIO nodes.
This accomodates CPU intensive phases of work. Tasks are initially
allowed to float across all cores. Once system is at steady-state,
this will ensure that K8S pods are constrained to platform cores and
do not run on cores with VMs/containers.
This will speedup the first stx-application apply, as well as pod
recovery after lock/unlock, reboot, and controller swact.
This script waits forever for sufficient platform readiness criteria
(e.g., system critical pods are recovered, critical openstack pods
are running, nova-compute pod is running) before reaffining back
to platform cores.
This corrects the pod affinity problem seen on AIO introduced by fix
for bug: 1826592, commit e513baad44181f667085886007632d0ebf79eeb0,
i.e., fix allowed the AIO to not timeout, but left pods floating.
Change-Id: Ic257378eac451904a200a0f2e79f7bc4f8373009
Partial-Bug: 1832781
Signed-off-by: Jim Gauld <email address hidden>
Reviewed: https:/ /review. opendev. org/667972 /git.openstack. org/cgit/ starlingx/ config/ commit/ ?id=dba41755233 b2c5e9f8db6ee27 5f69873ab95612
Committed: https:/
Submitter: Zuul
Branch: master
commit dba41755233b2c5 e9f8db6ee275f69 873ab95612
Author: Jim Gauld <email address hidden>
Date: Thu Jun 27 11:47:12 2019 -0400
AIO reaffine tasks and k8s-infra during startup
This update reimplements the affine-tasks init script and service to
dynamically reaffine tasks and k8s-infra cgroup cpuset on AIO nodes.
This accomodates CPU intensive phases of work. Tasks are initially
allowed to float across all cores. Once system is at steady-state,
this will ensure that K8S pods are constrained to platform cores and
do not run on cores with VMs/containers.
This will speedup the first stx-application apply, as well as pod
recovery after lock/unlock, reboot, and controller swact.
This script waits forever for sufficient platform readiness criteria
(e.g., system critical pods are recovered, critical openstack pods
are running, nova-compute pod is running) before reaffining back
to platform cores.
This corrects the pod affinity problem seen on AIO introduced by fix 67085886007632d 0ebf79eeb0,
for bug: 1826592, commit e513baad44181f6
i.e., fix allowed the AIO to not timeout, but left pods floating.
Change-Id: Ic257378eac4519 04a200a0f2e79f7 bc4f8373009
Partial-Bug: 1832781
Signed-off-by: Jim Gauld <email address hidden>