autopkgtest-cloud running out of space is not handled well
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Auto Package Testing |
New
|
Undecided
|
Unassigned |
Bug Description
Some autopkgtests were running on autopkgtest-
autopkgtest [16:44:34]: testing package golang-
autopkgtest [16:44:34]: ERROR: unexpected error:
OSError: [Errno 28] No space left on device
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/home/
process_
File "/home/
tests_tree = build_source(kind, arg, built_binaries)
File "/home/
f.write('%s %s\n' % (testpkg_name, testpkg_version))
OSError: [Errno 28] No space left on device
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/home/
main()
File "/home/
errorcode = print_exception
File "/home/
adtlog.
File "/home/
summary_
OSError: [Errno 28] No space left on device
And this the journal from around that time frame:
Mar 22 16:44:30 lrg-root4 /home/ubuntu/
Mar 22 16:44:30 lrg-root4 sh[706687]: OSError: [Errno 28] No space left on device
Mar 22 16:44:30 lrg-root4 sh[706687]: During handling of the above exception, another exception occurred:
Mar 22 16:44:30 lrg-root4 sh[706687]: Traceback (most recent call last):
Mar 22 16:44:30 lrg-root4 sh[706687]: File "/home/
Mar 22 16:44:30 lrg-root4 sh[706687]: main()
Mar 22 16:44:30 lrg-root4 sh[706687]: File "/home/
Mar 22 16:44:30 lrg-root4 sh[706687]: queue.wait()
Mar 22 16:44:30 lrg-root4 sh[706687]: File "/usr/lib/
Mar 22 16:44:30 lrg-root4 sh[706687]: return self.dispatch_
Mar 22 16:44:30 lrg-root4 sh[706687]: File "/usr/lib/
h_method
Mar 22 16:44:30 lrg-root4 sh[706687]: return amqp_method(self, args, content)
Mar 22 16:44:30 lrg-root4 sh[706687]: File "/usr/lib/
Mar 22 16:44:30 lrg-root4 sh[706687]: func(msg)
Mar 22 16:44:30 lrg-root4 sh[706687]: File "/home/
Mar 22 16:44:30 lrg-root4 sh[706687]: f.write('%i\n' % code)
Mar 22 16:44:30 lrg-root4 sh[706687]: OSError: [Errno 28] No space left on device
Mar 22 16:44:30 lrg-root4 systemd[1]: <email address hidden>: Main process exited, code=exited, status=1/FAILURE
As I see it there are two problems here:
1) the log files are left in /tmp/autopkgtes
2) the autopkgtest process itself can still be left running e.g.
ubuntu 716051 0.0 0.0 45316 20736 ? S Mar22 0:00 /usr/bin/python3 -u /home/ubuntu/
t-dir /tmp/autopkgtes
ion/setup-
ols --apt-upgrade golang-
g-golang-
<email address hidden> --name adt-lunar-
--keyname testbed-lrg-root4 --net-id=
http://
security.
In case any other autopkgtest sys admin looks at this bug (like I did!) the best way to clean this up is to kill the tee process which the runner is waiting on e.g.:
ps aux | grep "runner.*jool"
Confirm that $PID is waiting indefinitely - on tee by the way. Then:
kill $(pstree -p $PID | grep tee | sed -E 's/.*\((.*)\)/\1/')
The runner process will then exit as an underlying process was killed and the test will be restarted as there were no results.