RTL8111/8168B hangs whole PC (r8169) in feisty

Bug #91895 reported by dm
14
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Invalid
Undecided
Unassigned
linux-source-2.6.20 (Ubuntu)
Won't Fix
Undecided
Ben Collins

Bug Description

Binary package hint: linux-image-generic

PC: Asus A6Tc notebook
lspci:
...
01:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168B PCI Express Gigabit Ethernet controller (rev 01)

kernel: linux-image-2.6.20-9-generic
$ uname -a
Linux phantom 2.6.20-9-generic #2 SMP Mon Feb 26 03:01:44 UTC 2007 i686 GNU/Linux
$ cat /proc/version
Linux version 2.6.20-9-generic (root@rothera) (gcc version 4.1.2 (Ubuntu 4.1.2-0ubuntu3)) #2 SMP Mon Feb 26 03:01:44 UTC 2007

Trying to copy some files over the network from notebook causes system to hang completely. Just power-off/on button helps.
Hangs are at the random position (sometimes it happens when transfered over 300M, sometimes before 50M).

Btw samba (fusesmb too!) is working extremely slow - 10-11Kbps. I think these issues are linked together.

Revision history for this message
Ben Collins (ben-collins) wrote :

Please perform this from a text console and see if the kernel produces a visible crash. Without some sort of crash (or having the hw) we can't do much with this bug.

If the system locks up on text console, then see if you can attach a digital photo of the screen.

Thanks

Changed in linux-source-2.6.20:
assignee: nobody → ben-collins
status: Unconfirmed → Needs Info
Revision history for this message
dm (dmitry-shaposhnik) wrote :

I've noticed that if there is no other network activity big files than I can copy big files (1.5GB) over the network from laptop. With Gaim and Evolution (and listening internet-radio) system crashes.

Will try to operate without X and monitor conditions.

Btw, kernel is 2.6.20-12-generic now.

Revision history for this message
dm (dmitry-shaposhnik) wrote :

Ok, with GDM turned off I see that notebook not halts completely - just network interface is going down. ifdown/ifup helps to restore connection, but anyway transfer is broken.
No records to syslog while going down. Bringing it up produces heap of messages from NetworkManager.

Kernel: 2.6.20-13-generic

Revision history for this message
Phillip Lougher (phillip-lougher) wrote :

Can you attach the messages from NetworkManager. Thanks.

Revision history for this message
dm (dmitry-shaposhnik) wrote :

Well, as interface is going down I wait a few seconds, make copy of logged messages, and than re-insert r8169 module.

Last system update was about a half hour ago.
This bug is mentioned only while transferring data out.

Revision history for this message
dm (dmitry-shaposhnik) wrote :

With 2.6.22 kernel from gutsy link is monitored correctly, but hang-ups are still present.

Revision history for this message
Achim (ach1m) wrote :

please take a look at this bugreport.
https://bugs.launchpad.net/bugs/114171
I think this could be the same problem.

Kernel: 2.6.20-15-generic

Revision history for this message
dm (dmitry-shaposhnik) wrote : Re: [Bug 91895] Re: RTL8111/8168B hangs whole PC (r8169) in feisty

With 2.6.22-3-generic from gutsy samba transfer is about 3,6MB/s for
download and 10Kb/s for upload.
May be the reason is same. And after a few MB of upload my system freezes
and the only way out is to use reset button.

Revision history for this message
Achim (ach1m) wrote :

Maybe someone can take a look at this driver from Realtek.

Direct link to the download.
ftp://209.216.61.149/cn/nic/r8168-8.001.00.tar.bz2

As they say in there release notes, they use a modified linux kernel module.

##################################################################
release date: 2007/04/13
driver version: 8.001.00
1. This driver is modified from the r8169.c in Linux kernel 2.6.18 and
supports Linux 2.6.x only.
2. Support RTL8168C/8111C, the Realtek PCI-E Gigabit Ethernet NIC.
3. Support RTL8168B/8111B, the Realtek PCI-E Gigabit Ethernet NIC.
##################################################################

I have tested this driver and it performances well at download but bad at upload.

I have tested Gutsy Tripe 2 and this bug still exists.

Hopefully someone could find a way to fix this.

I had no system freezes so far, only bad transfer rates over SMBFS/CIFS.

As you can see in this bugreport, this problem doesn't exist if you transfer files over FTP.

https://bugs.launchpad.net/ubuntu/+bug/114171

Has anybody tested filetransfer over FTP?
It would be nice if someone can confirm the filetransfer speed over ftp from about (~25 Mb/s).

best wishes
Achim

Revision history for this message
dm (dmitry-shaposhnik) wrote :

Can't compile under 2.6.22:

root@phantom:~/tmp/r8168-8.001.00# make modules
make -C src/ modules
make[1]: Entering directory `/home/dm/tmp/r8168-8.001.00/src'
make -C /lib/modules/2.6.22-7-generic/build SUBDIRS=/home/dm/tmp/r8168-8.001.00/src modules
make[2]: Entering directory `/usr/src/linux-headers-2.6.22-7-generic'
  CC [M] /home/dm/tmp/r8168-8.001.00/src/r8168_n.o
/home/dm/tmp/r8168-8.001.00/src/r8168_n.c: In function ‘rtl8168_tso_csum’:
/home/dm/tmp/r8168-8.001.00/src/r8168_n.c:2424: error: ‘struct sk_buff’ has no member named ‘nh’
/home/dm/tmp/r8168-8.001.00/src/r8168_n.c: In function ‘rtl8168_init_module’:
/home/dm/tmp/r8168-8.001.00/src/r8168_n.c:3123: warning: implicit declaration of function ‘pci_module_init’
make[3]: *** [/home/dm/tmp/r8168-8.001.00/src/r8168_n.o] Error 1
make[2]: *** [_module_/home/dm/tmp/r8168-8.001.00/src] Error 2
make[2]: Leaving directory `/usr/src/linux-headers-2.6.22-7-generic'
make[1]: *** [modules] Error 2
make[1]: Leaving directory `/home/dm/tmp/r8168-8.001.00/src'
make: *** [modules] Error 2

Will try to hack it.

Revision history for this message
Achim (ach1m) wrote :

I tested this driver only with Linux kernel 2.6.20-16-generic.

Revision history for this message
dm (dmitry-shaposhnik) wrote :

I compiled this driver. At last.
But there is no luck - after 3-5 seconds after starting transmission of file (790MB) interface holds. It logs link status if I'm trying to plug out/in network cable. But can't transmit/receive anything until ifdown/ifup eth1 (form me this is Ethernet card).

There was added two fixes:
# diff ./r8168_n.c ./r8168_n1.c
2424c2424
< const struct iphdr *ip = skb->nh.iph;
---
> const struct iphdr *ip = ip_hdr(skb);
3123c3123
< return pci_module_init(&rtl8168_pci_driver);
---
> return pci_register_driver(&rtl8168_pci_driver);

only after that it compiles well and modinsert-s as it used to.

Revision history for this message
Slavisa (slavisa) wrote :

I have to confirm this problem. Same Hardware (Asus a6tc). The problem only exists when using X. It is not related to the ubuntu kernel, but exists also for non ubuntu kernels including 2.6.21. It seems that the NIC-module (R8169) crashes. Due to IRQ-sharing of NIC and graphics adaptor the whole system does not respond when NIC crashes.

Is there any way to assign a different IRQ to the NIC without disabling ACPI?

Unbelivable anoying bug! I would be very gratefull if anybody could solve it, as this bug makes the computer nearly unusable in a networking environment.

I guess that the following solution-approaches should be examined:

1. Let kernel assign different IRQ (I have no idea if this is possible, as ACPI assigns IRQs - maybe there is someone who is more experienced with this certain topic)
2. Check whether NIC-module crashes or graphics adaptor crashes
3. Repair guilty module (again I'm not enough familiar with linux kernel-modules hacking to do it myself)
4. I'm not sure whether this problem is related to usage of Nvidia proprietary graphics driver - should be checked

I would love to help by testing and reporting. Just ask, but somebody please take this bug serious.

Revision history for this message
Achim (ach1m) wrote :

hi,

I have no ASUS Notebook as you can see in my system specs but the same NIC.
My system does never freeze, I only get slow filetransfer speed.

I have tested the latest kernel-driver from realtek ftp://209.216.61.149/cn/nic/r8168-8.002.00.tar.bz2 which performs to 99% well.
I only get slow upload speed, if I transfer many small files over smbfs/cifs.

Have you tried to install the latest bios from Asus (version BIOS 0806 from 2007/05/07) maybe this helps.

to 4.
I think it is not related to the proprietary nvidia driver. You could test the nv driver to see if it changes anything.

My System
OS: Ubuntu Feisty 7.04 final.
Mainboard: MSI P965 Platinum
NIC: RTL8111 PCI Express Gigabit (onboard)

Revision history for this message
Slavisa (slavisa) wrote :

Am Sonntag, den 15.07.2007, 07:58 +0000 schrieb Achim:
> hi,
>
> I have no ASUS Notebook as you can see in my system specs but the same NIC.
> My system does never freeze, I only get slow filetransfer speed.
Which X.Org driver you are you using?
>
> I have tested the latest kernel-driver from realtek ftp://209.216.61.149/cn/nic/r8168-8.002.00.tar.bz2 which performs to 99% well.
> I only get slow upload speed, if I transfer many small files over smbfs/cifs.
But even with the origianal kernel module you had no freezes? So did
this realtek driver you mentioned help anything?
>
> Have you tried to install the latest bios from Asus (version BIOS 0806
> from 2007/05/07) maybe this helps.
Of course. I'm looking for BIOS-updates every second day :-) I doubt
whether Asus will produce new updates, as windows works well.
>
> to 4.
> I think it is not related to the proprietary nvidia driver. You could test the nv driver to see if it changes anything.
>
I will check that later this day

Greetings,
Slavisa

Revision history for this message
Achim (ach1m) wrote :

> Which X.Org driver you are you using?
>
At the moment I use NVIDIA driver version 100.14.11.
>
> But even with the origianal kernel module you had no freezes?
>
Yes I have no freezes.
>
> So did this realtek driver you mentioned help anything?
>
Yes the realtek driver is much faster than the original kernel module (r8169).
I use the realtek module (r8168) at the moment.
In earlier days the realtek kernel module and the original kernel module were so bad that I had to buy a Intel network card, to get my work done.
The realtek module is making progress.
But as it seems to me the original kernel module isn't making any progress.

I tested Gutsy tribe 2 with the nv driver and couldn't see any difference to the proprietary nvidia driver.
Network speed was still slow.

Have you tried to contact ASUS to get some help, because it seems to me that the freezes are related to the ASUS Laptop.
I read some threads in ubuntuforums.org and couldn't remember that someone had so heave problems.
I only read from slow transfer speeds.

best wishes
Achim

Revision history for this message
Launchpad Janitor (janitor) wrote : This bug is now reported against the 'linux' package

Beginning with the Hardy Heron 8.04 development cycle, all open Ubuntu kernel bugs need to be reported against the "linux" kernel package. We are automatically migrating this bug to the new "linux" package. However, development has already began for the upcoming Intrepid Ibex 8.10 release. It would be helpful if you could test the upcoming release and verify if this is still an issue - http://www.ubuntu.com/testing . If the issue still exists, please update this report by changing the Status of the "linux" task from "Incomplete" to "New". We appreciate your patience and understanding as we make this transition. Thanks!

Revision history for this message
Leann Ogasawara (leannogasawara) wrote :

The Ubuntu Kernel Team is planning to move to the 2.6.27 kernel for the upcoming Intrepid Ibex 8.10 release. As a result, the kernel team would appreciate it if you could please test this newer 2.6.27 Ubuntu kernel. There are one of two ways you should be able to test:

1) If you are comfortable installing packages on your own, the linux-image-2.6.27-* package is currently available for you to install and test.

--or--

2) The upcoming Alpha5 for Intrepid Ibex 8.10 will contain this newer 2.6.27 Ubuntu kernel. Alpha5 is set to be released Thursday Sept 4. Please watch http://www.ubuntu.com/testing for Alpha5 to be announced. You should then be able to test via a LiveCD.

Please let us know immediately if this newer 2.6.27 kernel resolves the bug reported here or if the issue remains. More importantly, please open a new bug report for each new bug/regression introduced by the 2.6.27 kernel and tag the bug report with 'linux-2.6.27'. Also, please specifically note if the issue does or does not appear in the 2.6.26 kernel. Thanks again, we really appreicate your help and feedback.

Revision history for this message
shawnlandden (shawnlandden) wrote :

Problems with ethernet confirmed on ECS NFORCE9M-A motherboard (phenom) with latest intrepid kernels (x64 release candidate kernel, x86 most recent off synaptic)

No connection, dhclient never gets response and manual doesnt work either, one time it just randomly worked with x86 intrepid latest kernel, never worked since.

Revision history for this message
Achim (ach1m) wrote :

scientus,

as it seems you have another lan chip.

Realtek RTL8111 Gigabit Fast Ethernet Controller (Co-lay RTL8101E)

http://www.ecs.com.tw/ECSWebSite/Products/ProductsDetail.aspx?detailid=887&CategoryID=1&DetailName=Specification&MenuID=123&LanID=0

You should fill a new bug report with sufficient information. Have you looked at the Realtek website for a driver if the in kernel driver doesn't work properly? You could also ask the Realtek folks for help.

Regards
Achim

Achim (ach1m)
Changed in linux:
status: Incomplete → Invalid
Revision history for this message
Timothy Miller (theosib) wrote :

I have this same RTL8111B chip on my MSI X48 Platinum, and I have problems with it too. I've been looking around, and I have found some other discussions about this. When using this chip with either Ubuntu (Gutsy was the last version I had on there) and Gentoo, transferring a moderate amount of data (say, downloading something from the internet) will cause the chip to hang. Just the network and anything dependent on the network freezes. Otherwise, the system is fine. A reboot always clears up the problem. Fortunately, I have an Intel e1000 on there as well, but I would like to use both NICs. (I guess I'll buy a switch instead.)

Someone on a Fedora forum had some interesting comments on it (read both comments):
http://forums.fedoraforum.org/showthread.php?t=160512

Here's a kernel bug report:
https://lists.linux-foundation.org/pipermail/bugme-new/2006-October/015144.html

There's a warning here about a mobo that uses it and how it has issues with both the in-kernel driver and the Realtek driver:
http://techreport.com/forums/viewtopic.php?f=7&t=50896

This page lists the chip multiple times, each time mentioning something about issues or instability:
http://people.ubuntu.com/~brian/reports/gt5comments/linux.html

Interestingly, the Realtek URL everyone points to for getting their driver (which also tends to be flaky) appears to no longer exist. I haven't tried searching their site to see if they moved it anywhere else.

It appears that Realtek's support for this chip outside of Windows is lousy, and kernel developers have had trouble with hidden features of the chip biting them.

Revision history for this message
Achim (ach1m) wrote :

Timothy have you tried to use hardy or intrepid?

You are referring to relative old reports, the kernel bug report is from 2006!
If you have still problems with your chip in combination with hardy or intrepid you should fill a new report with sufficient information.

Maybe you have another version of that chip.

$ lspci | grep Ethernet
05:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168B PCI Express Gigabit Ethernet controller (rev 01)

Do you have exactly the same lan chip?
I don't have any problems with it. I transfer very big files over gigabit ethernet.

Regards
Achim

Revision history for this message
Timothy Miller (theosib) wrote :

I haven't tried Hardy or Intrepid because I switched to Gentoo on that box. (Ubuntu on the desktop, but I wanted a totally stripped system for my compute machine.)

Anyhow, here's what lspci tells me:
0a:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168B PCI Express Gigabit Ethernet controller (rev 01)

This is my system information:
Linux compute0 2.6.26-gentoo #1 SMP Fri Jul 18 20:58:53 EDT 2008 x86_64 Intel(R) Core(TM)2 Quad CPU Q9450 @ 2.66GHz GenuineIntel GNU/Linux

I'm sure Intrepid has a slightly newer kernel. Does Ubuntu use the FOSS driver or does it use a binary one from Realtek? I may consider switching to Ubuntu server if it can solve my problem. (I should probably also file a bug report with Gentoo.)

Revision history for this message
Achim (ach1m) wrote :

Ubuntu uses the OSS driver which is in the kernel.

You could try a liveCD/USB for testing purpose and see if the problem disappears.
You could also update your BIOS and see if that helps.

I cannot remember that my system froze because of the network chip. Maybe some other hardware in your system is producing this problem.

Regards
Achim

Revision history for this message
Timothy Miller (theosib) wrote :

I'll try the live CD. My evidence that it was the NIC is that when all I did was switch to using the Intel NIC, I no longer had any trouble. Mind you, there could be a defect in the board. But the hang I observe seems so much like the others that I don't want to call it a coincidence.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers