test runner fails to reach the test bed
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ubuntu CI Engine |
Fix Released
|
High
|
Vincent Ladeuil |
Bug Description
Reported by Urshina on IRC:
INFO:tstrun.
INFO:tstrun.
INFO:image_
INFO:image_test:The test bed is ready
INFO:image_
ssh: connect to host 10.0.0.109 port 22: No route to host
ssh: connect to host 10.0.0.109 port 22: No route to host
INFO:image_
ssh: connect to host 10.0.0.109 port 22: No route to host
INFO:image_
ssh: connect to host 10.0.0.109 port 22: No route to host
INFO:image_
INFO:image_
This appears to be caused by some glitch on hp cloud but may also be a race around the ssh server in the tesbed starting *after* cloud-init has completed.
In any case, the test runner should report a better error than empty summary, subunit and log files.
If the ssh server is slow to start, the tr worker should retry to connect before reporting an error.
Related branches
- Andy Doan (community): Approve
- PS Jenkins bot (community): Approve (continuous-integration)
-
Diff: 72 lines (+20/-6)1 file modifiedtest_runner/tstrun/testbed.py (+20/-6)
Changed in uci-engine: | |
assignee: | nobody → Vincent Ladeuil (vila) |
importance: | Undecided → High |
milestone: | none → phase-0.5 |
status: | New → Confirmed |
no longer affects: | ubuntu-ci-services-itself |
Changed in uci-engine: | |
status: | Confirmed → In Progress |
Changed in uci-engine: | |
status: | Fix Committed → Fix Released |
FTR, I'm ruling out the race between cloud-init and the ssh server. I was able to connect to a testbed long before cloud-init finished apt-get update/install. Given the other network related issues in the past with hp, I think it's safe to blame that instead.