Ubuntu
linux package

very bad I/O performance with linux kernel in Hardy

Bug #221437 reported by Ritesh Raj Sarraf on 2008-04-24

This bug report is a duplicate of: Bug #246413: kcryptd process doesn't utilize multiple cpus. Edit Remove

This bug affects 1 person

Affects		Status	Importance	Assigned to	Milestone
	Linux	Confirmed	Unknown	linux-kernel-bugs #10549
	linux (Ubuntu)	New	Undecided	Unassigned

Bug Description

Binary package hint: linux-image-2.6.24-16-generic

I see very bad I/O performance when doing a large copy of small files on my laptop.
I'm using dm-crypt for my root filesystem and am using dm-crypt on my external USB HDD. I try to copy around 16 GB of data from my root filesystem to the external USB HDD and the performance is very bad. I eventually end up getting a speed of 50 kB/S. The system is barely responsive.

From dmesg, these are some very interesting messages I see.....

[12485.056379] ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
[12485.056729] ata2.00: cmd a0/00:00:00:0c:00/00:00:00:00:00/a0 tag 0 pio 12 in
[12485.056761] cdb 43 00 00 00 00 00 00 00 0c 00 00 00 00 00 00 00
[12485.056823] res 51/20:03:00:0c:00/00:00:00:00:00/a0 Emask 0x3 (HSM violation)
[12485.056919] ata2.00: status: { DRDY ERR }
[12485.057436] ata2: soft resetting link
[12487.704204] ata2.00: configured for UDMA/33
[12487.704652] ata2: EH complete
[13077.097783] CPU0 attaching NULL sched-domain.
[13077.097911] CPU1 attaching NULL sched-domain.
[13077.122603] CPU0 attaching sched-domain:
[13077.122698] domain 0: span 03
[13077.122731] groups: 01 02
[13077.122825] domain 1: span 03
[13077.122858] groups: 03
[13077.122921] CPU1 attaching sched-domain:
[13077.122954] domain 0: span 03
[13077.122987] groups: 02 01
[13077.123051] domain 1: span 03
[13077.123113] groups: 03
[13257.567642] CPU0 attaching NULL sched-domain.
[13257.567768] CPU1 attaching NULL sched-domain.
[13257.817747] CPU0 attaching sched-domain:
[13257.818066] domain 0: span 03
[13257.818100] groups: 01 02
[13257.818194] CPU1 attaching sched-domain:
[13257.818227] domain 0: span 03
[13257.818260] groups: 02 01
[16918.726319] CPU0 attaching NULL sched-domain.
[16918.726478] CPU1 attaching NULL sched-domain.
[16918.742973] CPU0 attaching sched-domain:
[16918.743068] domain 0: span 03
[16918.743131] groups: 01 02
[16918.743196] domain 1: span 03
[16918.743258] groups: 03
[16918.743322] CPU1 attaching sched-domain:
[16918.743354] domain 0: span 03
[16918.743387] groups: 02 01
[16918.743481] domain 1: span 03
[16918.743513] groups: 03
[17708.858904] CPU0 attaching NULL sched-domain.
[17708.859125] CPU1 attaching NULL sched-domain.
[17708.884150] CPU0 attaching sched-domain:
[17708.884278] domain 0: span 03
[17708.884343] groups: 01 02
[17708.884469] CPU1 attaching sched-domain:
[17708.884501] domain 0: span 03
[17708.884534] groups: 02 01
[18584.899435] CPU0 attaching NULL sched-domain.
[18584.899593] CPU1 attaching NULL sched-domain.
[18584.918651] CPU0 attaching sched-domain:
[18584.918747] domain 0: span 03
[18584.918810] groups: 01 02
[18584.918875] domain 1: span 03
[18584.918937] groups: 03
[18584.918971] CPU1 attaching sched-domain:
[18584.919033] domain 0: span 03
[18584.919065] groups: 02 01
[18584.919129] domain 1: span 03
[18584.919162] groups: 03

LSB Output
rrs@learner:~$ lsb_release -rd
Description: Ubuntu 8.04
Release: 8.04

Kernel:
rrs@learner:~$ uname -a
Linux learner 2.6.24-16-generic #1 SMP Thu Apr 10 13:23:42 UTC 2008 i686 GNU/Linux

Machine:
Dell XPS M1210
Proc 2.0Ghz
RAM: 2 GB
HDD: SATA 60 GB

Rest should be available from the dmesg file.

Tags:

Revision history for this message

Ritesh Raj Sarraf (rrs) wrote on 2008-04-24:

dmesg file with weird I/O Sched message. Edit (45.7 KiB, text/plain)

Bug Watch Updater (bug-watch-updater) on 2008-04-24

Changed in linux:
status:	Unknown → Confirmed

Revision history for this message

Ritesh Raj Sarraf (rrs) wrote on 2008-04-24:

log Edit (60.4 KiB, text/plain)

Attached is the output of iostat when the copy was taking place.
Interesting thing to notice is that the I/O starts up at a good speed, copies at good speed and then eventually the performance starts degrading.

I'm also adding the output of top which is pretty interesting....
rrs@learner:~$ top
top - 02:17:22 up 4:11, 1 user, load average: 8.93, 8.99, 5.34
Tasks: 152 total, 4 running, 148 sleeping, 0 stopped, 0 zombie
Cpu(s): 61.3%us, 28.9%sy, 1.2%ni, 0.0%id, 0.0%wa, 0.5%hi, 8.1%si, 0.0%st
Mem: 2074320k total, 2018768k used, 55552k free, 3588k buffers
Swap: 2621432k total, 38436k used, 2582996k free, 1088672k cached

  PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
6460 rrs 20 0 38224 19m 15m R 57 1.0 11:29.92 kded
7419 rrs 20 0 785m 545m 35m R 49 26.9 7:01.30 kontact
13698 rrs 20 0 181m 67m 26m S 38 3.3 0:46.24 firefox
6613 rrs 20 0 95940 47m 25m S 20 2.4 2:50.79 kopete
5585 root 20 0 79964 51m 5468 R 16 2.6 5:09.88 Xorg
25743 rrs 20 0 2308 1144 856 R 15 0.1 0:04.24 top
6824 rrs 20 0 34048 16m 11m S 7 0.8 0:38.62 konsole
2663 root 15 -5 0 0 0 S 5 0.0 8:57.19 kcryptd
18011 rrs 39 19 93568 30m 10m S 3 1.5 3:50.96 beagled-helper
   48 root 15 -5 0 0 0 S 1 0.0 0:09.40 kblockd/1
6455 rrs 20 0 25912 3260 1688 S 1 0.2 0:18.60 dcopserver
6637 rrs 20 0 35012 3724 2920 S 1 0.2 0:03.50 pulseaudio
6642 rrs 20 0 31892 11m 9148 S 1 0.6 0:03.48 klipper
6522 rrs 20 0 31624 8160 5812 S 1 0.4 0:13.23 artsd
    1 root 20 0 2844 1692 544 S 0 0.1 0:01.42 init
    2 root 15 -5 0 0 0 S 0 0.0 0:00.00 kthreadd

This was captured after around 20 seconds when I interrupted the copy (Sorry, no early could I do it. The machine was not that responsive at that stage). Interesting thing to note is that why are the KDE applications eating up so much of CPU cycles. I'm doing the copy from the KDE filemanager, Dolphin.

Also to point is that when I/O is run, kcryptd takes up a good amount of CPU cycles (around 20-25%), but I think that should be correct because encryption/decryption itself is expensive. What is most bothering is the degradation of the entire OS during such I/O.

I'm also adding the output of top which is pretty interesting....
rrs@learner:~$ top
top - 02:17:22 up  4:11,  1 user,  load average: 8.93, 8.99, 5.34
Tasks: 152 total,   4 running, 148 sleeping,   0 stopped,   0 zombie
Cpu(s): 61.3%us, 28.9%sy,  1.2%ni,  0.0%id,  0.0%wa,  0.5%hi,  8.1%si,  0.0%st
Mem:   2074320k total,  2018768k used,    55552k free,     3588k buffers
Swap:  2621432k total,    38436k used,  2582996k free,  1088672k cached

PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
 6460 rrs       20   0 38224  19m  15m R   57  1.0  11:29.92 kded
 7419 rrs       20   0  785m 545m  35m R   49 26.9   7:01.30 kontact
13698 rrs       20   0  181m  67m  26m S   38  3.3   0:46.24 firefox
 6613 rrs       20   0 95940  47m  25m S   20  2.4   2:50.79 kopete
 5585 root      20   0 79964  51m 5468 R   16  2.6   5:09.88 Xorg
25743 rrs       20   0  2308 1144  856 R   15  0.1   0:04.24 top
 6824 rrs       20   0 34048  16m  11m S    7  0.8   0:38.62 konsole
 2663 root      15  -5     0    0    0 S    5  0.0   8:57.19 kcryptd
18011 rrs       39  19 93568  30m  10m S    3  1.5   3:50.96 beagled-helper
   48 root      15  -5     0    0    0 S    1  0.0   0:09.40 kblockd/1
 6455 rrs       20   0 25912 3260 1688 S    1  0.2   0:18.60 dcopserver
 6637 rrs       20   0 35012 3724 2920 S    1  0.2   0:03.50 pulseaudio
 6642 rrs       20   0 31892  11m 9148 S    1  0.6   0:03.48 klipper
 6522 rrs       20   0 31624 8160 5812 S    1  0.4   0:13.23 artsd
    1 root      20   0  2844 1692  544 S    0  0.1   0:01.42 init
    2 root      15  -5     0    0    0 S    0  0.0   0:00.00 kthreadd

Revision history for this message

Ralf Hildebrandt (ralf-hildebrandt) wrote on 2008-05-13:

Try reproducing this without KDE involved.

Revision history for this message

kiev1 (sys-sys-admin) wrote on 2008-05-20:

I have the same problem - this kernel bug
dublicate -> https://bugs.launchpad.net/ubuntu/+source/linux/+bug/217920

Revision history for this message

Leann Ogasawara (leannogasawara) wrote on 2008-08-29:

The Ubuntu Kernel Team is planning to move to the 2.6.27 kernel for the upcoming Intrepid Ibex 8.10 release. As a result, the kernel team would appreciate it if you could please test this newer 2.6.27 Ubuntu kernel. There are one of two ways you should be able to test:

1) If you are comfortable installing packages on your own, the linux-image-2.6.27-* package is currently available for you to install and test.

--or--

2) The upcoming Alpha5 for Intrepid Ibex 8.10 will contain this newer 2.6.27 Ubuntu kernel. Alpha5 is set to be released Thursday Sept 4. Please watch http://www.ubuntu.com/testing for Alpha5 to be announced. You should then be able to test via a LiveCD.

Please let us know immediately if this newer 2.6.27 kernel resolves the bug reported here or if the issue remains. More importantly, please open a new bug report for each new bug/regression introduced by the 2.6.27 kernel and tag the bug report with 'linux-2.6.27'. Also, please specifically note if the issue does or does not appear in the 2.6.26 kernel. Thanks again, we really appreicate your help and feedback.