ovb-1ctlr_1comp-featureset020-master job timing out
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
Triaged
|
Critical
|
Arx Cruz |
Bug Description
ovb fs20 master job is frequently timing out. No errors reported.
I failed to find the root cause or any evidence that could explain these timeouts. However, there is one playbook that is being skipped after a huge interval (2-3 hours):
2018-03-26 02:52:44.587 | skipping: [undercloud]
2018-03-26 05:43:20.432 |
This rings a bell...
I checked the latest logs and they all have this same behavior:
2018-03-26 02:52:44.566 | TASK [validate-tempest : Verifying bugs in bugzilla and launchpad and generating skip file] ***
2018-03-26 02:52:44.566 | Monday 26 March 2018 02:52:44 +0000 (0:00:00.055) 0:00:21.644 **********
2018-03-26 02:52:44.587 | skipping: [undercloud]
2018-03-26 05:43:20.432 |
2018-03-26 05:43:20.432 | TASK [validate-tempest : Execute tempest] *******
The job continues until being killed by timeout. The total runtime for the job was ~5h:30m:
2018-03-26 01:06:29.844 | Started by user anonymous
2018-03-26 06:33:55.424 | Warning: Permanently added the ECDSA host key for IP address '38.145.32.13' to the list of known hosts.
Note: For the jobs that succeed, there is also a huge interval for the validate-tempest task. However, these jobs were able to complete within the timeout limit:
https:/
The successful job completed in less than 5h:
2018-03-24 20:11:21.726 | Started by user anonymous
2018-03-25 00:50:10.081 | % Total % Received % Xferd Average Speed Time Time Time Current
Note2: Compared to other jobs (fs001) and the task is being skipped in 7 min only.
2018-03-26 03:14:20.112 | TASK [validate-tempest : Verifying bugs in bugzilla and launchpad and generating skip file] ***
2018-03-26 03:14:20.112 | Monday 26 March 2018 03:14:20 +0000 (0:00:00.060) 0:00:24.711 **********
2018-03-26 03:14:20.132 | skipping: [undercloud]
2018-03-26 03:21:52.818 |
2018-03-26 03:21:52.839 | TASK [validate-tempest : Execute tempest] *******
https:/
tags: | removed: fs020 master ovb |
bump fs20 tempest to 3 workers https:/ /review. openstack. org/556695 /review. openstack. org/556697
bump ci overcloud flavor for faster job time https:/