RTC? test time-out (linux 4.9.0-12.13 ADT test failure with linux 4.9.0-12.13)

Bug #1658635 reported by Andy Whitcroft
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Fix Released
Medium
Colin Ian King
Andy Whitcroft (apw)
tags: added: kernel-adt-failure
Revision history for this message
Brad Figg (brad-figg) wrote : Missing required logs.

This bug is missing log files that will aid in diagnosing the problem. From a terminal window please run:

apport-collect 1658635

and then change the status of the bug to 'Confirmed'.

If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status: New → Incomplete
Revision history for this message
Andy Whitcroft (apw) wrote : Re: linux 4.9.0-12.13 ADT test failure with linux 4.9.0-12.13

Looks like the RTC test is timing us out again (maybe):

11:59:18 DEBUG| [stdout] selftests: threadtest [PASS]
11:59:18 ERROR| [stderr]
11:59:18 ERROR| [stderr] RTC Driver Test Example.
11:59:18 ERROR| [stderr]
autopkgtest [15:57:37]: ERROR: timed out on command "su -s /bin/bash ubuntu -c [...]"

description: updated
Changed in linux (Ubuntu):
status: Incomplete → Confirmed
summary: - linux 4.9.0-12.13 ADT test failure with linux 4.9.0-12.13
+ RTC? test time-out (linux 4.9.0-12.13 ADT test failure with linux
+ 4.9.0-12.13)
Andy Whitcroft (apw)
Changed in linux (Ubuntu):
assignee: nobody → Colin Ian King (colin-king)
Changed in linux (Ubuntu):
importance: Undecided → Medium
status: Confirmed → In Progress
Revision history for this message
Colin Ian King (colin-king) wrote :

I suspect this is because the test is doing a blocking read on /dev/rtc* and the test will block until 1 second wakeups get triggered. In this case, it seems like the RTC_UIE_ON ioctl did not trigger the wakeup or the wakeup didn't fire on the VM and the ADT test timeout out after 4 hrs.

Revision history for this message
Colin Ian King (colin-king) wrote :

@apw, how are you invoking these tests on prodstack? I can't reproduce this issue across the VMs I'm using.

Revision history for this message
Andy Whitcroft (apw) wrote :

@cking mostly using magic, i can trigger new ones etc if that helps.

Revision history for this message
Colin Ian King (colin-king) wrote :

OK, can you run a few more tests and we can see how variable this is, e.g, does it trigger every time or not.

Revision history for this message
Seth Forshee (sforshee) wrote :

@cking: I think this is the same problem I looked into before, see bug #1649718. However I completely dropped the ball on SRU-ing the patches.

tl;dr, the problem is with the host kernel for the VM losing irq acks from the guest and then failing to trigger any subsequent interrupts. So it's a problem in xenial and not zesty.

I will get SRU patches sent to the list for the other bug. You'll probably just want to confirm that this one is the same problem then dup it to that bug.

Revision history for this message
Colin Ian King (colin-king) wrote :

@Seth I'll give it a spin. Thanks!

Revision history for this message
Colin Ian King (colin-king) wrote :

Let's see how the SRU fixes work on this one and re-test. I'm having issues trying to reproduce this still, my gut feeling is that we get this SRU'd and see if it fixes the issue rather than me waste more cycles trying to verify if the fix does the trick when it seems clear that the fix in bug #1649718 addresses the issue.

Revision history for this message
Colin Ian King (colin-king) wrote :

I don't see any issue now with 4.10.0-8 (i386 + amd64), so marking this as fixed released.

Changed in linux (Ubuntu):
status: In Progress → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.