[Edgy] kernel panic with Realtek RTL 8168/8111

Bug #85738 reported by Mathias Kende
6
Affects Status Importance Assigned to Milestone
linux-source-2.6.17 (Ubuntu)
Won't Fix
Medium
Unassigned
linux-source-2.6.20 (Ubuntu)
Fix Released
Undecided
Unassigned

Bug Description

Hello,

I've got a P5B motherboard with a Core 2 Duo E6600 processor running Edgy Eft AMD64 (with all updates done).
The card has a Realtek RTL 8168/8111 Ethernet Network Adaptater, and I'm using SATA hard drives on the Intel ICH8 controller (I'm not using the JMicron's one). I'm using the normal generic kernel 2.6.17-11 (but the bug was already there with the 2.6.17-10 kernel). The only modification that I've done is installing the nvidia driver from their website, but I think it is not loaded when my bug happens.

From times to times I'm experiencing a Kernel Panic. I've checked my RAM with memtest without problem. Here is the Kernel output (I've copied a part of it as I've got no camera, and no log is written to the disk) :

"..." stands for hexadecimal output that I assume being of no interest.

Realtek RTL 8168/8111 Family PCI-E Gigabit Ethernet Network Adaptater
Driver version: 1.02
Release date: 2006/02/23
I/O Base: 0xB800 (I/O port)
IRQ: 177

Then there is the initialisation of the SD drives (I suppose that it is my hard drives), and then :

skb_over_panic: text: fff... len: -4 put: -4 head: ff... data: ff... tail: ff... end: ff... dev: eth0
------[cut here]------[please bite here]------
Kernel BUG at net/core/skbuff.c:94
invalid opcode: 0000 [1] SMP
CPU 0
Module linked in: [here lie a long list of module, I suppose it's juste the default ones as I have not touched to my kernel]]
Pid: 4056, comm: ifconfig Not tainted 2.6.17-11-generic #2
Rip: 0010:[<...>] <...> {skb_over_panic+92}

Then there is the machine state, I'm not sure that it is useful except for the call trace, I'm listing only the function name and not the hexadecimal adress :
Call trace: <IRQ> {r1000:r1000_interupt+459}
  handle_IRQ_event+44 __do_IRQ+188
  __do_IRQ+66 ret_from_intr+0 <EOI>
  cache_grow+779 kmem_cahce_alloc+131
  __alloc_skb+75 r1000:r1000_open+436
  dev_open+47 dev_change_flags+104
  devinet_ioctl+741 sock_ioctl+540
  do_ioctl+47 vfs_ioctl+659
  sys_ioctl+108 system_call+126
Code ...
Rip <...> {skb_over_panic+92} RSP <...>
<0> Kernel panic - not syncing: Aiee, killing interrupt handler!

If there is not enough information let me now and i'll try to find a camera or something.

I've got the impression that the bug appen when the eth0 driver is loaded before the SD drives, but I'm not sure that it is the real reason. The interesting point is that when this bug happens, if I just reboot the computer by pressing the reboot button, the bug re-happens nearly always. I need to completely shut it down for a while before rebooting it, in order for the system to boot properly. So I suppose that it is maybe one of the component (the Ethernet adaptater I suppose) that is left in an improper state and that is causing the problem.

Revision history for this message
Cristian Aravena Romero (caravena) wrote :

Thanks for taking the time to report this bug. Unfortunately we can't fix it, because your description didn't include enough information.

Please include the following additional information, if you have not already done so (please pay attention to lspci's additional options), as required by the Ubuntu Kernel Team:
1. Please include the output of the command "uname -a" in your next response. It should be one, long line of text which includes the exact kernel version you're running, as well as the CPU architecture.
2. Please run the command "dmesg > dmesg.log" and attach the resulting file "dmesg.log" to this bug report.
3. Please run the command "lspci -vvnn > lspci-vvnn.log" and attach the resulting file "lspci-vvnn.log" to this bug report.

For your reference, the full description of procedures for kernel-related bug reports is available here: <http://wiki.ubuntu.com/DebuggingKernelProblems> Thanks!

Revision history for this message
Mathias Kende (mathias-kende) wrote :

Thank you for the precision, I did not know this page.

The result of the uname -a command is :

Linux MATHIAS-ENS 2.6.17-11-generic #2 SMP Thu Feb 1 18:03:05 UTC 2007 x86_64 GNU/Linux

The output of the dmesg and lspci -vvnn command are included. But no log is written when there is the bug (there is no error message in dmesg.log)

Revision history for this message
Mathias Kende (mathias-kende) wrote :
Changed in linux-source-2.6.17:
importance: Undecided → Medium
status: Unconfirmed → Confirmed
Revision history for this message
Christian Rishøj (christian-rishoj) wrote :

I can reproduce the crash on an pair of Opteron 1218 boxes interconnected with an 8168-based Gigabit crossover simply by scp'ing a collection of large files. The receiving side crashes after no more than 10 seconds. Running Edgy.

uname -a:

Linux ska 2.6.17-11-server #2 SMP Thu Feb 1 18:03:56 UTC 2007 x86_64 GNU/Linux.

Revision history for this message
Christian Rishøj (christian-rishoj) wrote :
Revision history for this message
Christian Rishøj (christian-rishoj) wrote :

Just tried with the driver from RealTek: ftp://210.51.181.211/cn/nic/r1000_v1.05.tgz. Still panics.

Revision history for this message
Soren Hansen (soren) wrote :

http://bugzilla.kernel.org/show_bug.cgi?id=6807 contains a working fix. Thanks to Christian Rishøj for pointing this out.

Revision history for this message
Mathias Kende (mathias-kende) wrote :

This is maybe a fix to Christian bug, but not to the bug I reported here since it is a boot time bug.
Reading my original message, I see now that this was not very clear.

Revision history for this message
Leann Ogasawara (leannogasawara) wrote :

Now that the 7.10 Gutsy Gibbon release of Ubuntu is out, we were wondering if you can still reproduce this issue. Could you please download and try the new version of Ubuntu from http://www.ubuntu.com/getubuntu/download and report back your results. If the issue is still present in the new release, please attach the following information:

* uname -a > uname-a.log
* cat /proc/version_signature > version.log
* dmesg > dmesg.log
* sudo lspci -vvnn > lspci-vvnn.log

Please be sure to attach each file as a separate attachment. Digital photos will also be accepted if you are unable to access the actual logs for things like dmesg output. For more information regarding the kernel team bug policy, please refer to https://wiki.ubuntu.com/KernelTeamBugPolicies . Thanks again and we appreciate your help and feedback.

Changed in linux-source-2.6.20:
status: New → Incomplete
Revision history for this message
Mathias Kende (mathias-kende) wrote :

I updated to 7.10 with the RC a week ago and I have not experienced this issue since, but it has never been a frequent problem.
I will report any new occurrences of this bug if they happen.

Revision history for this message
Brian Murray (brian-murray) wrote :

I am assigning this bug to the 'ubuntu-kernel-team' per their bug policy. For future reference you can learn more about their bug policy at https://wiki.ubuntu.com/KernelTeamBugPolicies .

Changed in linux-source-2.6.17:
assignee: nobody → ubuntu-kernel-team
Revision history for this message
Sandy Harris (sandyinchina) wrote :

righ at the start this says:

> I've got a P5B motherboard with a Core 2 Duo E6600 processor running Edgy Eft AMD64 (with all updates done).

Why an AMD kernel on an Intel CPU?

Revision history for this message
Mathias Kende (mathias-kende) wrote :

Because there is no such thing as an 'intel' kernel or an 'amd' one. AMD64 is the name of the architecture of the CPU. Intel use the name INTEL64 (formerly EM64T). But there is only one kernel for the two vendors.

I think that this bug should be closed if nobody is still experiencing this bug.

Revision history for this message
Leann Ogasawara (leannogasawara) wrote :

Since Mathias is the original bug reporter and has commented that this appears to have been resolved I'm closing this report. Thanks.

Changed in linux-source-2.6.20:
status: Incomplete → Fix Released
Changed in linux-source-2.6.17:
status: Confirmed → Won't Fix
Revision history for this message
Launchpad Janitor (janitor) wrote : Kernel team bugs

Per a decision made by the Ubuntu Kernel Team, bugs will longer be assigned to the ubuntu-kernel-team in Launchpad as part of the bug triage process. The ubuntu-kernel-team is being unassigned from this bug report. Refer to https://wiki.ubuntu.com/KernelTeamBugPolicies for more information. Thanks.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.