MFG: Habanero: hxestorage exerciser logs task blocked messages in dmesg when running disks under PMC Sierra

Bug #1505178 reported by bugproxy
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Expired
Medium
Unassigned

Bug Description

== Comment: #0 ==

When running STX on Habanero systems with PMC Sierra, the following linux error messages are found when running "dmesg -T --level=alert,crit,err" after the run.

[Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18049 blocked for more than 120 seconds.
[Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu
[Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18177 blocked for more than 120 seconds.
[Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu
[Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18181 blocked for more than 120 seconds.
[Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu
[Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18185 blocked for more than 120 seconds.
[Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu
[Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18189 blocked for more than 120 seconds.
[Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu
[Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18194 blocked for more than 120 seconds.
[Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu
[Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18200 blocked for more than 120 seconds.
[Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu
[Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18205 blocked for more than 120 seconds.
[Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu
[Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18213 blocked for more than 120 seconds.
[Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu
[Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18221 blocked for more than 120 seconds.
[Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu
[Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.

We are running the following code levels.

       ver 1.5.4.3 - OS, HTX, Firmware and Machine details

                           OS: GNU/Linux
                   OS Version: Ubuntu 14.04.3 LTS \n \l
               Kernel Version: 3.19.0-25-generic
                  HTX Version: htxubuntu-357
                    Host Name: rcx2c357
            Machine Serial No: 1035C5A
           Machine Type/Model: 8348-21C

We have a very limited number of PMC Sierra configs. I've seen this error on both EC3S and ECSY PMC adapter types. We've only run systems with 6TB drives or a mix of 6TB and 8TB disk drives so far.

== Comment: #5 ==
Call Trace:

dmesg -T

---------------
[Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18049 blocked for more than 120 seconds.
[Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu
[Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Fri Oct 2 12:36:52 2015] hxestorage D 00003fff78c69a20 0 18049 451 0x00040000
[Fri Oct 2 12:36:52 2015] Call Trace:
[Fri Oct 2 12:36:52 2015] [c00000791de17490] [c0000079111f8980] 0xc0000079111f8980 (unreliable)
[Fri Oct 2 12:36:52 2015] [c00000791de17660] [c000000000015934] __switch_to+0x204/0x350
[Fri Oct 2 12:36:52 2015] [c00000791de176c0] [c000000000a11948] __schedule+0x368/0x8d0
[Fri Oct 2 12:36:52 2015] [c00000791de178e0] [c000000000a124e0] schedule_preempt_disabled+0x20/0x30
[Fri Oct 2 12:36:52 2015] [c00000791de17900] [c000000000a1464c] __mutex_lock_slowpath+0xfc/0x1f0
[Fri Oct 2 12:36:52 2015] [c00000791de17980] [c000000000a147ac] mutex_lock+0x6c/0x70
[Fri Oct 2 12:36:52 2015] [c00000791de179b0] [c0000000003050a8] __blkdev_get+0xa8/0x4d0
[Fri Oct 2 12:36:52 2015] [c00000791de17a20] [c000000000305730] blkdev_get+0x260/0x4d0
[Fri Oct 2 12:36:52 2015] [c00000791de17ad0] [c0000000002b10c0] do_dentry_open+0x270/0x410
[Fri Oct 2 12:36:52 2015] [c00000791de17b30] [c0000000002c6368] do_last+0x1b8/0xf30
[Fri Oct 2 12:36:52 2015] [c00000791de17c00] [c0000000002c96fc] path_openat+0xdc/0x7c0
[Fri Oct 2 12:36:52 2015] [c00000791de17cd0] [c0000000002cb358] do_filp_open+0x58/0xd0
[Fri Oct 2 12:36:52 2015] [c00000791de17db0] [c0000000002b2e88] do_sys_open+0x1c8/0x390
[Fri Oct 2 12:36:52 2015] [c00000791de17e30] [c000000000009258] system_call+0x38/0xd0
[Fri Oct 2 12:36:52 2015] INFO: task hxestorage:18177 blocked for more than 120 seconds.
[Fri Oct 2 12:36:52 2015] Tainted: G OE 3.19.0-25-generic #26~14.04.1-Ubuntu
[Fri Oct 2 12:36:52 2015] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[Fri Oct 2 12:36:52 2015] hxestorage D 00003fff78c69a20 0 18177 451 0x00040000
[Fri Oct 2 12:36:52 2015] Call Trace:
[Fri Oct 2 12:36:52 2015] [c00000791beb7490] [00000000000025f7] 0x25f7 (unreliable)
[Fri Oct 2 12:36:52 2015] [c00000791beb7660] [c000000000015934] __switch_to+0x204/0x350
[Fri Oct 2 12:36:52 2015] [c00000791beb76c0] [c000000000a11948] __schedule+0x368/0x8d0
[Fri Oct 2 12:36:52 2015] [c00000791beb78e0] [c000000000a124e0] schedule_preempt_disabled+0x20/0x30
[Fri Oct 2 12:36:52 2015] [c00000791beb7900] [c000000000a1464c] __mutex_lock_slowpath+0xfc/0x1f0
[Fri Oct 2 12:36:52 2015] [c00000791beb7980] [c000000000a147ac] mutex_lock+0x6c/0x70
[Fri Oct 2 12:36:52 2015] [c00000791beb79b0] [c0000000003050a8] __blkdev_get+0xa8/0x4d0
[Fri Oct 2 12:36:52 2015] [c00000791beb7a20] [c000000000305730] blkdev_get+0x260/0x4d0
[Fri Oct 2 12:36:52 2015] [c00000791beb7ad0] [c0000000002b10c0] do_dentry_open+0x270/0x410
[Fri Oct 2 12:36:52 2015] [c00000791beb7b30] [c0000000002c6368] do_last+0x1b8/0xf30
[Fri Oct 2 12:36:52 2015] [c00000791beb7c00] [c0000000002c96fc] path_openat+0xdc/0x7c0
[Fri Oct 2 12:36:52 2015] [c00000791beb7cd0] [c0000000002cb358] do_filp_open+0x58/0xd0
[Fri Oct 2 12:36:52 2015] [c00000791beb7db0] [c0000000002b2e88] do_sys_open+0x1c8/0x390
[Fri Oct 2 12:36:52 2015] [c00000791beb7e30] [c000000000009258] system_call+0x38/0xd0

bugproxy (bugproxy)
tags: added: architecture-ppc64 bugnameltc-131395 severity-high targetmilestone-inin14043
Changed in ubuntu:
assignee: nobody → Taco Screen team (taco-screen-team)
Revision history for this message
Ubuntu Foundations Team Bug Bot (crichton) wrote :

Thank you for taking the time to report this bug and helping to make Ubuntu better. It seems that your bug report is not filed about a specific source package though, rather it is just filed against Ubuntu in general. It is important that bug reports be filed about source packages so that people interested in the package can find the bugs about it. You can find some hints about determining what package your bug might be about at https://wiki.ubuntu.com/Bugs/FindRightPackage. You might also ask for help in the #ubuntu-bugs irc channel on Freenode.

To change the source package that this bug is filed about visit https://bugs.launchpad.net/ubuntu/+bug/1505178/+editstatus and add the package name in the text box next to the word Package.

[This is an automated message. I apologize if it reached you inappropriately; please just reply to this message indicating so.]

tags: added: bot-comment
Luciano Chavez (lnx1138)
affects: ubuntu → linux (Ubuntu)
Revision history for this message
bugproxy (bugproxy) wrote : Comment bridged from LTC Bugzilla

------- Comment From <email address hidden> 2015-10-13 21:05 EDT-------
==== State: Assigned by: tuandoan on 13 October 2015 15:56:40 ====

#=#=# 2015-10-13 15:56:27 (CDT) #=#=#
New Fix_Potential = [P810.00D]
#=#=#=#=#=#=#=#=#=#=#=#=#=#=#=#=#=#=#

penalvch (penalvch)
Changed in linux (Ubuntu):
importance: Undecided → Medium
status: New → Triaged
Revision history for this message
bugproxy (bugproxy) wrote :

------- Comment From <email address hidden> 2015-11-05 21:41 EDT-------
==== State: Assigned by: tuandoan on 05 November 2015 15:16:40 ====

#=#=# 2015-11-05 15:16:38 (CST) #=#=#
New Fix_Potential = [P810.20W]
#=#=#=#=#=#=#=#=#=#=#=#=#=#=#=#=#=#=#

Revision history for this message
Michael Hohnbaum (hohnbaum) wrote :

Canonical does not have access to this hardware, will need IBM to propose a fix.

Revision history for this message
bugproxy (bugproxy) wrote :

------- Comment From <email address hidden> 2016-02-22 21:35 EDT-------
Back in October, it was suggested the reporter try a later kernel and also use a different I/O scheduler but no feedback was provided. Returning bug as insufficient data.

Changed in linux (Ubuntu):
assignee: Taco Screen team (taco-screen-team) → nobody
Revision history for this message
Andrew Cloke (andrew-cloke) wrote :

Marking bug as incomplete, as per comment #5, no response was received.

Changed in linux (Ubuntu):
status: Triaged → Incomplete
Revision history for this message
bugproxy (bugproxy) wrote :

------- Comment From <email address hidden> 2017-02-06 12:36 EDT-------

Revision history for this message
Launchpad Janitor (janitor) wrote :

[Expired for linux (Ubuntu) because there has been no activity for 60 days.]

Changed in linux (Ubuntu):
status: Incomplete → Expired
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.