commit 6335d7d49183d716d7f4032c81f1d4138e756755
Author: Rei Oliveira <email address hidden>
Date: Tue Feb 20 15:57:12 2024 -0300
Rehome: Increase timeout for certs to be installed
This commit addresses 2 related bugs, in the sense that they
are related to ansible tasks that wait for certificates
to be installed.
Task 'Check admin-ep-cert.pem updated' depends on cert-mon to
install the certificate. Cert-mon may fail and reattempt after
10 minutes. This change increases that timeout for this task
to be larger than that. I'm also decreasing the delay in a
half as it is a quick stat operation, to allow it to be
detected quickly in most cases where the first cert-mon attempt
works.
Task 'Verify if there are 250.001 (config out-of-date) alarms'
is dependent on puppet to apply a config change to install
the certificate and sysinv to clear the alarm. When I
reproduced this issue it took 1 minute longer for the alarm
to clear. This change increases the timeout of the task in
about 50%.
Test Plan:
PASS: Rehome a subcloud with 1200 ms latency injected and
50% cap on CPU capacity.
Reviewed: https:/ /review. opendev. org/c/starlingx /ansible- playbooks/ +/909609 /opendev. org/starlingx/ ansible- playbooks/ commit/ 6335d7d49183d71 6d7f4032c81f1d4 138e756755
Committed: https:/
Submitter: "Zuul (22348)"
Branch: master
commit 6335d7d49183d71 6d7f4032c81f1d4 138e756755
Author: Rei Oliveira <email address hidden>
Date: Tue Feb 20 15:57:12 2024 -0300
Rehome: Increase timeout for certs to be installed
This commit addresses 2 related bugs, in the sense that they
are related to ansible tasks that wait for certificates
to be installed.
Task 'Check admin-ep-cert.pem updated' depends on cert-mon to
install the certificate. Cert-mon may fail and reattempt after
10 minutes. This change increases that timeout for this task
to be larger than that. I'm also decreasing the delay in a
half as it is a quick stat operation, to allow it to be
detected quickly in most cases where the first cert-mon attempt
works.
Task 'Verify if there are 250.001 (config out-of-date) alarms'
is dependent on puppet to apply a config change to install
the certificate and sysinv to clear the alarm. When I
reproduced this issue it took 1 minute longer for the alarm
to clear. This change increases the timeout of the task in
about 50%.
Test Plan:
PASS: Rehome a subcloud with 1200 ms latency injected and
50% cap on CPU capacity.
Closes-Bug: 2054462 9c63a7cd855470f 0a777f06e22
Closes-Bug: 2054463
Change-Id: I017fab0ccb1362
Signed-off-by: Rei Oliveira <email address hidden>