Ubuntu
linux package

Ubuntu 18.04: Bug in cfq scheduler, I/Os do not get submitted to adapter for a very long time.

Bug #1709889 reported by bugproxy on 2017-08-10

This bug affects 1 person

	Status	Importance	Assigned to
Linux	Unknown	Unknown	linux-kernel-bugs #196737
The Ubuntu-power-systems project	Fix Released	Medium	Canonical Kernel Team
linux (Ubuntu)	Fix Released	Medium	Canonical Kernel Team
Zesty	Won't Fix	Medium	Unassigned
Bionic	Fix Released	Medium	Unassigned

Bug Description

---Problem Description---
When running stress test, sometimes seeing IO hung in dmesg or seeing "Host adapter abort request" error.

---Steps to Reproduce---
There are two ways to re-create the issues:
(1)running HTX, you will see IO timeout backtrace in dmesg in several hours
(2)running some IO test, then reboot system, repeat this two steps, it takes long time to re-create the issue.

---uname output---
4.10.0-11-generic - still valid up to latest kernel in Bionic

The bulk of the effort for this issue is currently being worked in MicroSemi's JIRA https://jira.pmcs.com/browse/ESDIBMOP-133.

Ran an interesting test: Ran HTX until I started getting the "stall" messages on the console, then shutdown HTX and examined the I/O counters for the tested disks in sysfs:

root@bostonp15:~# for i in /sys/devices/pci0003:00/0003:00:00.0/0003:01:00.0/host0/target0:2:[2345]/0:2:[2345]:0; do echo ${i##*/} $(<${i}/iorequest_cnt) $(<${i}/iodone_cnt); done
0:2:2:0 0x5eba3d 0x5eba3d
0:2:3:0 0x773cc9 0x773cc9
0:2:4:0 0x782c61 0x782c61
0:2:5:0 0x5ca134 0x5ca134
root@bostonp15:~#

So, none of the disks showed any evidence of having lost an I/O. I then restarted HTX and aside from having to manually restart one of the disks, see no problems with the testing. It appears that what was "hung" was purely in userland.

This does not absolve the kernel or aacraid driver from blame, but it shows that the OS "believes" that it completed the I/O and thus removed it from the queue. What we don't know is whether the OS truly notified HTX about the completion, or if HTX (or userland libraries) just failed to process the notification.

Tests are running again, will see what happens next.

Update from JIRA:

I have run some more experiments. Not sure what it tells us, but here's what I've seen.

First test, ran until I got kernel messages about stalled tasks, then shutdown HTX. After HTX was down, I checked the above mentioned counters and found that on each disk iorequest_cnt matched iodone_cnt. The disks were usable and I could restart HTX. This suggests that the problem is not in the PM8069 firmware, and makes the case for the aacraid driver having a bug somewhat weaker. However, this merely says that the driver "completed" the I/O as far as the kernel is concerned, not that a completion rippled back to the application.

I restarted HTX and have run until errors. This time, I am leaving HTX running and observing. Two of the disks reached the HTX error threshold and the testers stopped (those 2 disks are now idle). Another disks saw errors but then stopped and appears to be running fine now. The last disk has not seen any errors (yet). On the two idle (errored-out) disks I see iorequest_cnt matches iodone_cnt. I am able to "terminate and restart" the two idle disks and HTX appears to be testing them again "normally". Note that no reboot was required, further supporting the evidence that, as far as the kernel is concerned, there is nothing wrong with the disks and their I/O paths.

So, I don't believe this completely eliminates aacraid from the picture, especially given we don't see this behavior on other systems/drivers. But, it probably moves the focus of the investigation away form the adapter firmware.

Tried build upstream 4.11 kernel on Ubuntu. This still gets the hangs. Both Ubuntu 4.10 and upstream 4.11 have aacraid driver 1.2.1[50792]-custom.

Good new/bad news... While doing an initial evaluation of the LSI-3008 SAS HBA on Boston and Ubuntu 17.04, I am hitting this same problem. So, it appears to have nothing specific to do with the PM8069 or aacraid driver.

Some notes on reproduce this. I have been using the github release of HTX, built using the following steps:

1. apt install make gcc g++ git libncurses5-dev libcxl-dev libdapl-dev (others may be required)
2. git clone https://github.com/open-power/HTX
3. cd HTX
4. make
5. make deb

Then install the resulting "htxubuntu.deb" package.

Note, HTX will not test disks that have a filesystem or OS installed, so there must be at least two disks made available to HTX by clearing any previous data. A partition table is optional, in my testing I have none.

Also, it may be desirable to run HTX somewhere other than the console, leaving the console free to watch for messages.

To run:

A. su - htx (this may take some time)
B. htx
C. Select the test file "mdt.io"
D. Hit ENTER for default log file option
E. Once menu is display, select item 2 (Enable/disable hardware to test)
    E1. Enter "h" to disable (halt) all devices testing
    E2. Select at least two disks for testing (enter their line numbers)
    E3. Enter "q" to return to main menu
F. Select item "4" (Continue On Error flags)
    F1. Enter line numbers for each disk previously selected to test.
    F2. Enter "q" to return to main menu.
G. Select item "1" to begin the test exercisers.
H. Optionally, select item "5" to display status of testing.

After about 10-12 hours, there should be a few "INFO: task hxestorage:XXXXX blocked for more than 120 seconds." messages with stack traces. The typical stack trace is:

sysctl_sched_migration_cost+0x0/0x4 (unreliable)
__switch_to+0x2c0/0x450
__schedule+0x2f8/0x990
schedule+0x48/0xc0
schedule_timeout+0x274/0x470
io_schedule_timeout+0xd0/0x160
debug_schedule+0x318/0x3c0
__blkdev_direct_IO_simple+0x258/0x440
blkdev_direct_IO+0x4e0/0x520
generic_file_read_iter+0x2c8/0xaa0
blkdev_read_iter+0x50/0x80
new_sync_read+0xec/0x140
vfs_read+0xbc/0x1b0
SyS_read+0x68/0x110
system_call+0x38/0xe0

About 8 minutes after the "blocked" messages, you should start to see HTX reporting errors in "/tmp/htxerr" (HTX reports errors for I/Os that do not complete in 10 minutes, but continues to run).

With added debugging, it was seen that the I/Os do eventually complete, but in some cases it can take over an hour. It is also observed that I/O traffic continues through these periods of stalls, and so only a portion of the total I/O traffic actually gets stalled. The system does not hang, and if HTX is shutdown (stopped), any stalled I/Os will complete immediately.

Referencing LP1469829, it seems that it was requested that "cfq" scheduler not be used by default as it has this exact sort of bug, and that "deadline" should be used instead. Somewhere, the default got reverted back to "cfq" which exposes this bug again. It appears that the bug in "cfq" was never fixed, either.

https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1469829

A couple upstream commits of interest, ordered by perceived relevance.

https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=5be6b75610cefd1e21b98a218211922c2feb6e08

https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=142bbdfccc8b3e9f7342f2ce8422e76a3b45beae

See original description

Tags:

Revision history for this message

bugproxy (bugproxy) wrote on 2017-08-10: sorted output from htxerr

sorted output from htxerr Edit (145.6 KiB, text/plain)

Default Comment by Bridge

tags:	added: architecture-ppc64le bugnameltc-152603 severity-critical targetmilestone-inin1704
Changed in ubuntu:
assignee:	nobody → Ubuntu on IBM Power Systems Bug Triage (ubuntu-power-triage)
affects:	ubuntu → linux (Ubuntu)

Andrew Cloke (andrew-cloke) on 2017-08-10

Changed in ubuntu-power-systems:
importance:	Undecided → Critical
assignee:	nobody → Canonical Kernel Team (canonical-kernel-team)

Revision history for this message

bugproxy (bugproxy) wrote on 2017-08-11: Comment bridged from LTC Bugzilla

------- Comment From <email address hidden> 2017-08-11 07:44 EDT-------
Testing shows that this commit appears to fix the problem. After 20 hours, no evidence of stalled I/Os.

https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=5be6b75610cefd1e21b98a218211922c2feb6e08

This fixes a problem introduced by having two modes of operation for cfq that each uses a different timebase, and not having separate scheduling delay (time limit before forcing I/O submit) settings. What appears to be the default mode, "iops", ended up using a delay that allowed I/Os to be postponed for up to 200000000 jiffies (which is hundreds of hours).

Joseph Salisbury (jsalisbury) on 2017-08-11

Changed in linux (Ubuntu):
status:	New → In Progress
importance:	Undecided → Critical
assignee:	Ubuntu on IBM Power Systems Bug Triage (ubuntu-power-triage) → Joseph Salisbury (jsalisbury)
Changed in linux (Ubuntu Zesty):
status:	New → In Progress
importance:	Undecided → Critical
assignee:	nobody → Joseph Salisbury (jsalisbury)

Frank Heimes (fheimes) on 2017-08-11

Changed in ubuntu-power-systems:
status:	New → In Progress

Revision history for this message

Joseph Salisbury (jsalisbury) wrote on 2017-08-11: Re: Ubuntu 17.04: Bug in cfq scheduler, I/Os do not get submitted to adapter for a very long time.

I built a 17.04(Zesty) test kernel with a pick of commit 5be6b75610ce. This kernel can be downloaded from:

http://kernel.ubuntu.com/~jsalisbury/lp1709889/

Can you test this kernel and see if it resolves this bug?

I also see that commit 5be6b75610ce has been cc'd to upstream stable, but it has not landed in upstream 4.10.y yet.

Revision history for this message

bugproxy (bugproxy) wrote on 2017-08-14: Comment bridged from LTC Bugzilla

------- Comment From <email address hidden> 2017-08-14 10:33 EDT-------
I am currently making a long test run to collect some data. It may be the case that cfq delays actually increase over time, even with this fix. That may be evidence that cfq is not a good default choice for I/O scheduler, but I want to collect more data.

Revision history for this message

bugproxy (bugproxy) wrote on 2017-08-15:

------- Comment From <email address hidden> 2017-08-15 09:26 EDT-------
Here is what I am observing, and what leads me to think that "cfq" may not (yet) be a good choice for the default io-sched.

The test exerciser, HTX (https://github.com/open-power/HTX - POWER arch only), causes stress on CFQ during certain cycles. I set the debug timeout threshold for completion of I/Os at 60 seconds (upon timeout, debugging is printed and then io_schedule() called after which more debugging is printed). It is known that I/O delays seem to vary continuously throughout the range, but using a timeout lower than 60 just produced too much output.

During certain cycles, where about 1 million I/Os per hour are being performed on each disk, we see timeouts being triggered. Essentially, the timeout happens because CFQ has not even submitted the I/O to SCSI yet. Earlier debugging showed that the once the I/O actually gets submitted to SCSI it completes promptly.

Without the patch 5be6b75610ce these I/Os could (sometimes) take an hour or more to get submitted to SCSI. With the patch, that delay time seems to max out at around 110 seconds, which is a great improvement however still indicates a problem.

I typically see about 400-500 I/Os trip the 60-second timeout during a given ~2 hour cycle (estimated 4 million I/Os total), so it is not a huge percentage. However, the I/Os affected seem to be related, possibly by process or thread, and so this could be detrimental to an application. Note, the number of I/Os taking between 30 and 60 seconds is not known, but is expected to be much higher. Even 30 seconds may be an undesirable number. It's not clear just how CFQ chooses the delay value and what overrides it.

On a run with the scheduler set to "deadline", I never see any I/Os trip the 60-second timeout.

I think this shows undesirable behavior in CFQ, possibly a bug, and that it should not be the default scheduler - especially for servers. Is there some evidence that shows CFQ to be better than deadline in general?

------- Comment From dougmill@us.ibm.com 2017-08-15 09:26 EDT-------
Here is what I am observing, and what leads me to think that "cfq" may not (yet) be a good choice for the default io-sched.

On a run with the scheduler set to "deadline", I never see any I/Os trip the 60-second timeout.

Revision history for this message

bugproxy (bugproxy) wrote on 2017-08-15:

------- Comment From <email address hidden> 2017-08-15 09:32 EDT-------
I should also add that the data indicates there is an instability to the delay, with the initial stalled I/Os maxing out at about 70 seconds, then the next cycle (~10 hours later) and subsequent 2 shows max values in the 90-100 second range, and the following 5 cycles show numbers around 110 seconds (but tapering off). I think this unpredictability is further indication of a bug in CFQ. Clearly it does not live up to it's name, at least for these particular I/Os.

Joseph Salisbury (jsalisbury) on 2017-08-15

Changed in linux (Ubuntu):
assignee:	Joseph Salisbury (jsalisbury) → Colin Ian King (colin-king)
Changed in linux (Ubuntu Zesty):
assignee:	Joseph Salisbury (jsalisbury) → Colin Ian King (colin-king)

Revision history for this message

Colin Ian King (colin-king) wrote on 2017-08-15: Re: Ubuntu 17.04: Bug in cfq scheduler, I/Os do not get submitted to adapter for a very long time.

I've rolled three patches into a test kernel to test. I believe the best fix is probably "cfq: Disable writeback throttling by default" https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=142bbdfccc8b3e9f7342f2ce8422e76a3b45beae

The commits are:
4d608baac5f4e72b033a122b2d6d9499532c3afc "block: Initialize cfqq->ioprio_class in cfq_get_queue()"
142bbdfccc8b3e9f7342f2ce8422e76a3b45beae "cfq: Disable writeback throttling by default"
5be6b75610cefd1e21b98a218211922c2feb6e08 "cfq-iosched: fix the delay of cfq_group's vdisktime under
iops mode"

You can download the packages from:
http://kernel.ubuntu.com/~cking/lp1709889/

Let me know if this helps with the issue.

Revision history for this message

bugproxy (bugproxy) wrote on 2017-08-15: Comment bridged from LTC Bugzilla

------- Comment From <email address hidden> 2017-08-15 16:36 EDT-------
I will start a test with that kernel tomorrow. I will be adding my debug code to it, so that I can track delayed I/Os if they occur. I see you posted source code, so I can do that easily.

Revision history for this message

bugproxy (bugproxy) wrote on 2017-08-16:

------- Comment From <email address hidden> 2017-08-16 07:23 EDT-------
Oh, there is no source code in the linux-source package. I will have to add those patches to my own source - which won't exactly test your kernel.

Revision history for this message

bugproxy (bugproxy) wrote on 2017-08-16:

#10

------- Comment From <email address hidden> 2017-08-16 08:17 EDT-------
Even with all three of those patches, I still am seeing delayed I/Os. The pattern looks the same, but I will run for 12+ hours to collect more data. At this point, though, I believe that "cfq" should not be the default scheduler. Are there reasons that it should be the default? What is the background behind the choice to make it the default in Ubuntu? Looking at the upstream code, it seems that if no config parameter is used that the default will be "deadline" (mq-deadline).

Revision history for this message

bugproxy (bugproxy) wrote on 2017-08-17:

#11

------- Comment From <email address hidden> 2017-08-17 08:42 EDT-------
Those three patches, at least in the kernel I am running, actually make things worse. The characteristics have changed, in what appears to be a general slow-down of disk I/O (it took over 12 hours to hit the first set of sever stalls), but the delays - when they do occur - or much worse. I saw I/Os getting delayed for over 40 minutes.

I have double-checked that the patches are installed. But in spite of having the patch for the delay length (5be6b75610cefd1e21b98a218211922c2feb6e08) the behavior is back to what I was seeing before that patch alone.

I'm attaching the combined diff of the changes I made to the kernel. Note, the only difference between the "worse" run and the previous "better" one was the addition of these two patches:

4d608baac5f4e72b033a122b2d6d9499532c3afc "block: Initialize cfqq->ioprio_class in cfq_get_queue()"
142bbdfccc8b3e9f7342f2ce8422e76a3b45beae "cfq: Disable writeback throttling by default"

Which I can't explain, as I don't see how either of those should have made this worse.

Maybe I need the actual source for your test kernel so I can add my debug-monitoring code and run. With 40-minute delays the debug-monitoring code is technically not needed, as HTX will complain. But if, as I was seeing on the previous kernel, the delays are below 10 minutes then HTX will never notice and there will be no obvious indication of the more subtle issue.

Revision history for this message

bugproxy (bugproxy) wrote on 2017-08-17: Patch with 3 patches plus debugging

#12

Patch with 3 patches plus debugging Edit (5.1 KiB, text/plain)

------- Comment on attachment From <email address hidden> 2017-08-17 08:44 EDT-------

Here is the patch from my latest run, containing the three proposed patches, getting delays > 40 minutes.

Revision history for this message

bugproxy (bugproxy) wrote on 2017-08-17: Comment bridged from LTC Bugzilla

#13

------- Comment From <email address hidden> 2017-08-17 15:03 EDT-------
I have started a test run using the binary kernel proposed, and will see if there is evidence of the problem. Results will be available tomorrow.

Revision history for this message

bugproxy (bugproxy) wrote on 2017-08-18:

#14

------- Comment From <email address hidden> 2017-08-18 07:51 EDT-------
Test results with the binary kernel package show the same symptoms, I/Os getting delayed longer than 10 minutes. It seems that those three patches together cause a regression of the "cfq-iosched: fix the delay of cfq_group's vdisktime under iops mode" patch.

So, in summary, with *only* the patch:

5be6b75610cefd1e21b98a218211922c2feb6e08 "cfq-iosched: fix the delay of cfq_group's vdisktime under
iops mode"

I see some improvement of the I/Os delays, although the delays are still too long. But by adding these two patches:

4d608baac5f4e72b033a122b2d6d9499532c3afc "block: Initialize cfqq->ioprio_class in cfq_get_queue()"
142bbdfccc8b3e9f7342f2ce8422e76a3b45beae "cfq: Disable writeback throttling by default"

I see a regression of the delays back to what I was seeing without any patches.

Revision history for this message

bugproxy (bugproxy) wrote on 2017-08-22:

#15

------- Comment From <email address hidden> 2017-08-22 09:22 EDT-------
Can we get some answers as to why CFQ is the default scheduler? It seems like the expedient fix is to change the default to "deadline".

Revision history for this message

bugproxy (bugproxy) wrote on 2017-08-23:

#16

------- Comment From <email address hidden> 2017-08-23 08:45 EDT-------
I tried running 4.13-rc6 with "cfq" set as default scheduler. The problem is even worse. I/O delays show up almost immediately. Many exceed the HTX 10-minute limit. It seems that CFQ is even more broken on the latest kernels.

Revision history for this message

bugproxy (bugproxy) wrote on 2017-08-23:

#17

------- Comment From <email address hidden> 2017-08-23 09:19 EDT-------
Opened kernel.org bugzilla https://bugzilla.kernel.org/show_bug.cgi?id=196737

Revision history for this message

Colin Ian King (colin-king) wrote on 2017-08-23: Re: Ubuntu 17.04: Bug in cfq scheduler, I/Os do not get submitted to adapter for a very long time.

#18

Doug,

"Can we get some answers as to why CFQ is the default scheduler? It seems like the expedient fix is to change the default to "deadline"."

Sure. We went back to CFQ as it was showing to be a good general purpose I/O scheduler for a lot of wide ranging and general purpose I/O demands. Unfortunately you seem to have uncovered some cases where clearly it does not behave well for your use case where deadline would be more appropriate. Like all schedulers, it's hard to chose one that is going to meet every use case and every scenario perfectly, especially for a wide range of devices, device configurations, system memory size and file system choices.

We do try to systematically check the performance and latencies of various synthetic test scenarios across a range of file systems on each kernel to try and spot issues: http://kernel.ubuntu.com/~cking/fs-tests/jun-2017/

CFQ has several tunables that may help your issue, I'd refer you to:

https://www.kernel.org/doc/Documentation/block/cfq-iosched.txt

it may be worth reviewing this document and then seeing if the CFQ can be tuned to improve the issues you are seeing.

Revision history for this message

Colin Ian King (colin-king) wrote on 2017-08-23:

#19

Thanks for also testing with the latest upstream kernel and reporting it to the upstream developers. I've added the link to the bug so we pull in bug updates automatically from the bugzilla.

Revision history for this message

bugproxy (bugproxy) wrote on 2017-08-23: Comment bridged from LTC Bugzilla

#20

------- Comment From <email address hidden> 2017-08-23 12:13 EDT-------
By the way, I would not classify this behavior I'm seeing as a performance issue. There are hundreds of I/Os per second on each disk, and most of them are being submitted right away. But a subset of those I/Os are getting delayed for 10 minutes - if not over an hour - which is a huge disparity. The effect this can have on applications could be significant, if not critical. Some of the I/Os are taking at least 3 orders of magnitude longer than the rest. And since no timeouts are in place at this stage, I wonder if any other test frameworks even notice it. You could be hitting this in your test cases but are not aware. Consider what would happen in a database server if the rollback segment I/Os were getting delayed like this.

Some things to think about.

Manoj Iyer (manjo) on 2017-08-28

tags:

added: triage-r

Manoj Iyer (manjo) on 2017-08-28

tags:

added: triage-g
removed: triage-r

Revision history for this message

Andrew Cloke (andrew-cloke) wrote on 2017-09-18: Re: Ubuntu 17.04: Bug in cfq scheduler, I/Os do not get submitted to adapter for a very long time.

#21

Marking as "incomplete" until fix lands upstream.

Changed in ubuntu-power-systems:
status:	In Progress → Incomplete

Revision history for this message

Steve Langasek (vorlon) wrote on 2018-01-18:

#22

Ubuntu 17.04 has reached end of life. No further bugfixes will be applied to this version.

Changed in linux (Ubuntu Zesty):
status:	In Progress → Won't Fix

Revision history for this message

Manoj Iyer (manjo) wrote on 2018-01-22:

#23

These patches are available in Artful and will be available in the linux-hwe kernel, please upgrade to the latest linux-hwe and reopen this bug if you are able to reproduce it. Marking it invalid for now.

Changed in linux (Ubuntu):
status:	In Progress → Invalid
Changed in ubuntu-power-systems:
status:	Incomplete → Invalid

Revision history for this message

bugproxy (bugproxy) wrote on 2018-05-14: Comment bridged from LTC Bugzilla

#24

------- Comment From <email address hidden> 2018-05-14 13:40 EDT-------
Note, we are still seeing issues with CFQ on 18.04 (Bionic).

tags:

added: targetmilestone-inin1804
removed: targetmilestone-inin1704

bugproxy (bugproxy) on 2018-07-09

tags:

added: severity-high
removed: severity-critical

Revision history for this message

bugproxy (bugproxy) wrote on 2018-07-16:

#25

------- Comment From <email address hidden> 2018-07-16 09:02 EDT-------
*** Bug 169550 has been marked as a duplicate of this bug. ***

Revision history for this message

bugproxy (bugproxy) wrote on 2018-07-31:

#26

------- Comment From <email address hidden> 2018-07-31 13:10 EDT-------
Doug,
- do we have patches for this issue ? I saw you talked about some but as I understand they do not seem satisfactory.
- can any of the cfq tunables help on this ?
- if not, do we have some extended tests/view of deadline scheduler on Power : would it introduce other issues in place of what it would solve, if it would be set by default ?

Canonical,
can we mark this issue as happening on 18.04 too (4.15.0-26-generic) ? Launchpad only shows it affects Zesty.

Fred

Revision history for this message

Dimitri John Ledkov (xnox) wrote on 2018-07-31: Re: Ubuntu 17.04: Bug in cfq scheduler, I/Os do not get submitted to adapter for a very long time.

#27

This bug was closed in january 2018. Are you sure this is the same issue / right ticket? I see that you have recently marked some other bug as a duplicate of this one..... maybe you can sync that one? If not please state what is happening on 18.04 since in january it was believed that this issue was fixed.

Changed in linux (Ubuntu):
status:	Invalid → New
importance:	Critical → Undecided
assignee:	Colin Ian King (colin-king) → nobody
Changed in linux (Ubuntu Zesty):
assignee:	Colin Ian King (colin-king) → nobody
Changed in ubuntu-power-systems:
status:	Invalid → New
importance:	Critical → Undecided

Revision history for this message

Dimitri John Ledkov (xnox) wrote on 2018-07-31:

#28

For now reopening the states of this issue in launchpad as NEW and UNDECIDED priority.

Revision history for this message

Frédéric Bonnard (frediz) wrote on 2018-08-02:

#29

Dimitri, we've synced the other bug 1785081 that is probably the same issue as this one.

Revision history for this message

Dimitri John Ledkov (xnox) wrote on 2018-08-03:

#30

But that new lp bug is sparse on details too.... can all comments be pushed through on https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1785081 ?

What exactly is happening with the bionic kernel, and what is the expectation here?

The comments in this bug report from cking still stand imho. Unless there is something else new?

Revision history for this message

Joseph Salisbury (jsalisbury) wrote on 2018-08-03:

#31

Updated bug description and added Bionic bug task. Marked bug 1785081 as a duplicate of this bug.

Changed in linux (Ubuntu Bionic):
status:	New → Triaged
importance:	Undecided → Critical
tags:	added: kernel-da-key
summary:	- Ubuntu 17.04: Bug in cfq scheduler, I/Os do not get submitted to adapter + Ubuntu 18.04: Bug in cfq scheduler, I/Os do not get submitted to adapter for a very long time.

Manoj Iyer (manjo) on 2018-08-06

Changed in ubuntu-power-systems:
importance:	Undecided → Critical
Changed in linux (Ubuntu):
importance:	Undecided → Critical
Changed in ubuntu-power-systems:
status:	New → Triaged
Changed in linux (Ubuntu):
assignee:	nobody → Canonical Kernel Team (canonical-kernel-team)

Revision history for this message

bugproxy (bugproxy) wrote on 2018-08-06: Comment bridged from LTC Bugzilla

#32

------- Comment From <email address hidden> 2018-08-06 09:43 EDT-------
We are also seeing this behavior on Bionic 18.04.

I don't understand the request for better comments. The Launchpad comments seem to include a fairly complete description of the problem. Are there specific questions about this problem? I guess there are a lot of comments in the beginning that show the progression of the diagnosis efforts, so perhaps it requires more reading to reach the full problem description.

Revision history for this message

Andrew Cloke (andrew-cloke) wrote on 2018-09-03:

#33

Is this behaviour also seen with a vanilla upstream 4.15 kernel, or is it unique to the Ubuntu 4.15 bionic kernel?

Manoj Iyer (manjo) on 2018-09-13

Changed in ubuntu-power-systems:
importance:	Critical → Medium
Changed in linux (Ubuntu):
importance:	Critical → Medium
Changed in linux (Ubuntu Zesty):
importance:	Critical → Medium
Changed in linux (Ubuntu Bionic):
importance:	Critical → Medium

Revision history for this message

Andrew Cloke (andrew-cloke) wrote on 2018-09-13:

#34

After discussion with Michael Ranweiler, moving to medium.

Christian Ehrhardt  (paelzer) on 2018-10-19

description:

updated

Revision history for this message

Christian Ehrhardt  (paelzer) wrote on 2018-10-19:

#35

I discussed some background with the Kernel Developers today.
Due to that I sanitized the description a bit to make it clear in the first few lines that this is not only a 4.10 kernel issue.

On a side note I had to smile as there is LTC LTC22393 (long ago) that had Suse switch to deadline on s390x [1] as all the arguments "for cfq" didn't hold true on s390x (with its limited special use cases compared to x86).
But ppc64le is already more various in its potential use cases. So I'm not voting for that solution here.

I mostly agree with cking for the bug here, quoting:
"
1. is it a generic issue or a specific issue?
2. is it just a tunable solution or not.
3. is it a specific use case that hits a bug in CFQ, and if so, are there fixes upstream now
"

Independent to this bug the kernel Team will give the "which default I/O Scheduler" a re-check due to our discussion.

For the bug here it is sort of unclear to me who currently holds the ball for the next steps, I'll ping Andrew who updated last to sort that out (as I know he is good in finding people responsible :-)

[1]: https://github.com/openSUSE/kernel-source/commit/dc425e5a7544c2feec9ca9a260e47382064eeeb8

Revision history for this message

Frédéric Bonnard (frediz) wrote on 2018-10-19:

#36

Hi all,

bug #1785081 was not mirrored completely for some reason and I'm working on fixing this : it's just about the bug happening on 18.04 specifically kernel 4.15.0-26. Sorry for the lag.

Also, on ppc64el, RHEL 7.5 with a 4.14 kernel uses deadline, so I guess it's not a completely unsafe path.
I'm not pushing for anything, I just noticed that.

Manoj Iyer (manjo) on 2019-02-18

Changed in ubuntu-power-systems:
status:	Triaged → Incomplete

Revision history for this message

Manoj Iyer (manjo) wrote on 2019-04-08:

#37

The fixes identified here are available in Bionic:
5be6b75610ce cfq-iosched: fix the delay of cfq_group's vdisktime under iops mode
142bbdfccc8b cfq: Disable writeback throttling by default

marking this bug as fix-released. Please retest with latest Bionic kernel and reopen this bug if this is still an issue.

Changed in linux (Ubuntu):
status:	New → Fix Released
Changed in linux (Ubuntu Bionic):
status:	Triaged → Fix Released
Changed in ubuntu-power-systems:
status:	Incomplete → Fix Released