Android boot test failed on TC2
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Linaro Android |
Fix Released
|
High
|
vishal |
Bug Description
I have submitted an job on TC2 with Android image. where i have notice boot_linaro_
issues seems to be kernel issue so i have reported on to "linaro-
LAVA JOB:
--------------
http://
http://
Error log:
--------------
The system is going down for reboot NOW!
udevd[1974]: '/sbin/blkid -o udev -p /dev/mmcblk0p6' [2013] terminated by signal 15 (Terminated)
* Asking all remaining processes to terminate... [80G [74G[ OK ]
* All processes ended within 1 seconds.... [80G [74G[ OK ]
* Deconfiguring network interfaces... [80G [74G[ OK ]
* Deactivating swap... [80G [74G[ OK ]
umount: /run/lock: not mounted
umount: /run/shm: not mounted
* Will now restart
[ 2642.612250]
[ 2642.616702] =======
[ 2642.635205] [ INFO: possible circular locking dependency detected ]
[ 2642.653973] 3.7.0-rc8+ #1 Not tainted
[ 2642.664923] -------
[ 2642.683688] reboot/2183 is trying to acquire lock:
[ 2642.698025] (s_active#
[ 2642.719970]
[ 2642.719970] but task is already holding lock:
[ 2642.737433] (&per_cpu(
[ 2642.766522]
[ 2642.766522] which lock already depends on the new lock.
[ 2642.766522]
[ 2642.791018]
[ 2642.791018] the existing dependency chain (in reverse order) is:
[ 2642.813430]
-> #1 (&per_cpu(
[ 2642.830923] [<c005cc3f>] lock_acquire+
[ 2642.846317] [<c03dea07>] down_write+
[ 2642.861191] [<c02e8bb3>] lock_policy_
[ 2642.879447] [<c02e8c47>] store+0x23/0x60
[ 2642.893015] [<c0100319>] sysfs_write_
[ 2642.909447] [<c00bb325>] vfs_write+0x69/0xcc
[ 2642.924059] [<c00bb4d3>] sys_write+0x2f/0x50
[ 2642.938667] [<c000cc81>] ret_fast_
[ 2642.954843]
-> #0 (s_active#
[ 2642.967137] [<c005c493>] __lock_
[ 2642.984091] [<c005cc3f>] lock_acquire+
[ 2642.999481] [<c0101399>] sysfs_addrm_
[ 2643.016696] [<c010174f>] sysfs_remove_
[ 2643.033131] [<c02531c3>] kobject_
[ 2643.048003] [<c025321f>] kobject_
[ 2643.064177] [<c02e83bb>] cpufreq_
[ 2643.080090] [<c02e94dd>] __cpufreq_
[ 2643.098085] [<c03d98bd>] cpufreq_
[ 2643.115560] [<c0037d21>] notifier_
[ 2643.132774] [<c001e4e3>] __cpu_notify+
[ 2643.148167] [<c03d2a99>] _cpu_down+
[ 2643.163038] [<c001e713>] disable_
[ 2643.180511] [<c002b2df>] kernel_
[ 2643.196425] [<c002b417>] sys_reboot+
[ 2643.211940] [<c000cc81>] ret_fast_
[ 2643.228162]
[ 2643.228162] other info that might help us debug this:
[ 2643.228162]
[ 2643.252136] Possible unsafe locking scenario:
[ 2643.252136]
[ 2643.269859] CPU0 CPU1
[ 2643.283414] ---- ----
[ 2643.296969] lock(&per_
[ 2643.311953] lock(s_active#27);
[ 2643.329210] lock(&per_
[ 2643.351755] lock(s_active#27);
[ 2643.361636]
[ 2643.361636] *** DEADLOCK ***
[ 2643.361636]
[ 2643.379529] 4 locks held by reboot/2183:
[ 2643.391261] #0: (reboot_
[ 2643.413458] #1: (cpu_add_
[ 2643.439821] #2: (cpu_hotplug.
[ 2643.464621] #3: (&per_cpu(
[ 2643.494890]
[ 2643.494890] stack backtrace:
[ 2643.507940] [<c0012101>] (unwind_
[ 2643.536089] [<c03da771>] (print_
[ 2643.565020] [<c005c493>] (__lock_
[ 2643.591990] [<c005cc3f>] (lock_acquire+
[ 2643.619099] [<c0101399>] (sysfs_
[ 2643.647249] [<c010174f>] (sysfs_
[ 2643.673055] [<c02531c3>] (kobject_
[ 2643.698603] [<c025321f>] (kobject_
[ 2643.725191] [<c02e83bb>] (cpufreq_
[ 2643.753600] [<c02e94dd>] (__cpufreq_
[ 2643.783572] [<c03d98bd>] (cpufreq_
[ 2643.812764] [<c0037d21>] (notifier_
[ 2643.839873] [<c001e4e3>] (__cpu_
[ 2643.864636] [<c03d2a99>] (_cpu_down+
[ 2643.891483] [<c001e713>] (disable_
[ 2643.919372] [<c002b2df>] (kernel_
[ 2643.945177] [<c002b417>] (sys_reboot+
[ 2643.972176] CPU0 packing on CPU-1
[ 2643.982117] CPU4 packing on CPU-1
[ 2643.992134] CPU0 packing on CPU-1
[ 2644.008833] CPU4: shutdown
[ 2644.017967] Restarting system.
Rebooting...
Disabling debug USB.
Switching off ATX PSU.
Board powered down, rebooting...
ARM V2M Firmware v3.1.1
Build Date: Aug 20 2012
Date: Fri 25 Jan 2013
Time: 04:04:07
information type: | Proprietary → Public |
description: | updated |
affects: | lava-android-test → linaro-android |
Changed in linaro-android: | |
milestone: | none → 13.02 |
status: | New → Fix Committed |
Changed in linaro-android: | |
status: | Fix Committed → Fix Released |
The attached log snippet in the original report shows kernel bug in the LAVA Master Image but this didn't prevent the board from rebooting and is not the cause of the test failure.
Looking at the full logs at: http:// validation. linaro. org/lava- server/ scheduler/ job/45998/ log_file# entry20
I see:
[3] LAVA Android Test Image 6FA3-4484- AF4F-0A000A8D3A 82)/HD( 3,MBR,0x0000000 0,0x200000, 0x20000) /uImage 6FA3-4484- AF4F-0A000A8D3A 82)/HD( 3,MBR,0x0000000 0,0x200000, 0x20000) /uInitrd ttyAMA0, 38400n8 rootwait ro init=/init androidboot. console= ttyAMA0 6FA3-4484- AF4F-0A000A8D3A 82)/HD( 3,MBR,0x0000000 0,0x200000, 0x20000) /v2p-ca15- tc2.dtb ------- ------- -- 37FF-4070- B471-BB4AF12A72 4A)/MemoryMappe d(0x0,0xE800000 ,0xE803000) ------- ------- --
- VenHw(09831032-
- Initrd: VenHw(09831032-
- Arguments: console=tty0 console=
- FDT: VenHw(09831032-
- LoaderType: Linux kernel with Local FDT
-------
Global FDT Config
- VenHw(1F15DA3C-
-------
[a] Boot Manager
[b] Shell
Start: 3
ERROR: Did not find Device Tree blob.
So the tests failed because of a missing DTB, And looging in the boot.tar.bz2 file for the image it was trying to test (https:/ /snapshots. linaro. org/android/ ~linaro- android- restricted/ vexpress- linaro- iks/86/ boot.tar. bz2) I see that the device-tree has a different name than expected, namely 'vexpress- v2p-ca15- tc2.dtb' .
I therefore suspect that the test image was not produce by the normal vexpress BoardConfig.mk which renames the device-tree.
I don't see that this is a bug with the ARM LT kernel or the standard vexpress Android config, so I'll mark this bug as invalid for the ARM LT.