centos-8 multinode and undercloud jobs are hanging on the undercloud install
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
Fix Released
|
Critical
|
Unassigned |
Bug Description
The hang happens sporadically and is therefore hard to track down but it has been reproduced on multinode and undercloud jobs in both RDO Cloud (periodic) and upstream Zuul (check).
The tripleo undercloud install task fails with:
2020-03-02 18:25:08.374990 | primary | TASK [tripleo_
2020-03-02 18:25:08.389970 | primary | Monday 02 March 2020 18:25:08 +0000 (0:00:00.086) 0:06:16.122 **********
2020-03-02 20:30:03.809836 | primary | fatal: [undercloud]: FAILED! => {
2020-03-02 20:30:03.809906 | primary | "changed": true
2020-03-02 20:30:03.809918 | primary | }
2020-03-02 20:30:03.809925 | primary |
2020-03-02 20:30:03.809933 | primary | MSG:
2020-03-02 20:30:03.809940 | primary |
2020-03-02 20:30:03.809947 | primary | async task did not complete within the requested time - 7200s
Example logs:
Note that it not always the same tasks that times out:
https:/
Changed in tripleo: | |
status: | Triaged → Fix Released |
Example log from upstream zuul check:
https:/ /1faa1506db9db9 8d8787- 99d0a6bba3a1a8c 92ad3774145ae0b ff.ssl. cf5.rackcdn. com/710179/ 6/check/ tripleo- ci-centos- 8-containers- multinode/ a337701/ logs/undercloud /home/zuul/ undercloud_ install. log