RTL8102EL / Ubuntu 8.04 intermitent failure

Bug #240470 reported by Bogdan Butnaru on 2008-06-16
22
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Undecided
Unassigned

Bug Description

This is probably related to bug #223656. I just finished building a new system with the Atom-based D945GCLF motherboard, which includes a Realtek RTL8102EL Ethernet chip. (Actually, it's just the motherboard and a hard drive for now.)

I managed to install Ubuntu Hardy Server on it, with the kernel 2.6.24-19.33/amd64. (I hacked the kernel by hand into the server NetBoot image, since I don't have a CD drive in that computer. Incidentally, this means that the network card itself works perfectly when it's running, as I installed the entire system through it; but see below.)

The problem now is that sometimes the network works perfectly, but sometimes it doesn't seem to work at all: there are no messages I can see in /var/log or dmesg, ifconfig shows the network as usually, the leds light up in the network connectors, but no packet passes by. I have it setup to get an address via DHCP from my laptop (the two are connected directly via ethernet cable), and when this happened I thought perhaps my server was misconfigured. So I fired up Wireshark and it shows _nothing_ passing through that cable; when the network works, I can see everything working beautifully.

I can't figure it out at all; nothing seems to be wrong, I can't see any errors anywhere, but nothing seems to go through the cable... I can't figure out a pattern exactly either. It seems that it depends on whether I used "shutdown -r now" or "shutdown -h now" last time I shut down the Atom, but I can't decide on an exact pattern.

Any ideas where to look next? What debug info should I post?

[Tue Jun 17 00:11:46 CEST 2008 – Edited to add that I'm using the amd64 edition of Ubuntu.]

Bogdan Butnaru (bogdanb) wrote :

I just realized I didn't mention this explicitly: the kernel that comes in the installer images (the netboot ones at least) broke during boot, exactly as it's described in the bug I linked above. That's why I needed to use the -19.33 kernel.

Bob Gotthardt (dell-gotthardt) wrote :

I'll be testing 8.04.1 [AMD64] version "live-cd" later today. Not much in common [other than multi-threading]; but, I'll let you know.

Bob Gotthardt (dell-gotthardt) wrote :

No help.

Bogdan Butnaru (bogdanb) on 2008-06-16
description: updated
Oliver Grawert (ogra) wrote :

are you aware that Atom is not a 64bit architecture ?

Bogdan Butnaru (bogdanb) wrote :

No, but it seems that neither is itself, so it works:

bogdanb@tanelorn:~$ sudo lshw -class processor
  *-cpu
       description: CPU
       product: Intel(R) Atom(TM) CPU 230 @ 1.60GHz
       vendor: Intel Corp.
       physical id: 0
       bus info: cpu@0
       version: Intel(R) Atom(TM) CPU 230 @ 1.60GHz
       slot: U1PR
       size: 1600MHz
       capacity: 4GHz
       width: 64 bits
       clock: 133MHz
       capabilities: fpu fpu_exception wp vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx x86-64 constant_tsc arch_perfmon pebs bts rep_good pni monitor ds_cpl tm2 ssse3 cx16 xtpr lahf_lm
bogdanb@tanelorn:~$ uname -a
Linux tanelorn 2.6.24-19-server #1 SMP Wed Jun 18 14:44:47 UTC 2008 x86_64 GNU/Linux

Bogdan Butnaru (bogdanb) wrote :

I meant the 64-bit kernel works. The Ethernet chip is still acting funny.

DarkMace (darkmace) wrote :

I'm experiencing a likewise problem with my new Intel D945GCLF.
Installed Ubuntu Server 8.04.1 x64 without any kernel hacks or something.
After connecting the board to the router, I cannot get a connection.
IP is configured static, but neither ping nor anything else gets a connection. DHCP doesn't work either...

awe_cz (martin-nedbal) wrote :

Same here - installed Hardy on Intel Little Falls today, network card is recognized as different model (there's RTL8102EL written on the chip):

$ dmesg | grep eth0

eth0: RTL8169 at 0xffffc200004e8000, 00:1c:c0:5e:6e:bf, XID 24a00000 IRQ 2300
r8169: eth0: link up
eth0: no IPv6 routers present

$ lspci -vvn
01:00.0 0200: 10ec:8136 (rev ff) (prog-if ff)
        !!! Unknown header type 7f

$ lspci -vv
01:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8101E PCI Express Fast Ethernet controller (rev ff) (prog-if ff)
        !!! Unknown header type 7f

$ uname -a
Linux tv 2.6.24-19-server #1 SMP Fri Jul 11 21:50:43 UTC 2008 x86_64 GNU/Linux

But - it does not work. Orange light is on, green is flashing when sending packets out - but packets are neither sent or received.

awe_cz (martin-nedbal) wrote :

I just updated BIOS to LF0099 version (from 2008/8/1) and it seem to fix the problem - let us know if it works for you too...

DarkMace (darkmace) wrote :

Recently updated BIOS to LF0099. Connected to another PC with crossover cable and made a setup with matching IP network. Still no ping from one PC to another.

Checked the information for dmesg and lspci with that given by awe_cz two posts above: except to the MAC-adress there are equal listings after the BIOS update.

Steffen Rusitschka (rusi) wrote :

Same problem here. Latest BIOS (LF0099) and Hardy kernel... On D945GCLF as well...

awe_cz (martin-nedbal) wrote :

Aha! The problem is that my systems somehow fails to initialize the card upon power-on. It start to work after consequent warm reboot. After powering it on I have:

$ lspci -vv
01:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8101E PCI Express Fast Ethernet controller (rev ff) (prog-if ff)
        !!! Unknown header type 7f

$ reboot now

$ lspci -vv

01:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8101E PCI Express Fast Ethernet controller (rev 02)
        Subsystem: Intel Corporation Unknown device 0001
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B-
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR-
        Latency: 0, Cache Line Size: 32 bytes
        Interrupt: pin A routed to IRQ 2300
        Region 0: I/O ports at 1000 [size=256]
        Region 2: Memory at 48100000 (64-bit, non-prefetchable) [size=4K]
        Region 4: Memory at 48000000 (64-bit, prefetchable) [size=64K]
        Expansion ROM at 48020000 [disabled] [size=128K]
        Capabilities: [40] Power Management version 3
                Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=375mA PME(D0+,D1+,D2+,D3hot+,D3cold+)
                Status: D0 PME-Enable- DSel=0 DScale=0 PME-
        Capabilities: [50] Message Signalled Interrupts: Mask- 64bit+ Queue=0/0 Enable+
                Address: 00000000fee0300c Data: 4191
        Capabilities: [70] Express Endpoint IRQ 1
                Device: Supported: MaxPayload 256 bytes, PhantFunc 0, ExtTag-
                Device: Latency L0s <512ns, L1 <8us
                Device: AtnBtn- AtnInd- PwrInd-
                Device: Errors: Correctable- Non-Fatal- Fatal- Unsupported-
                Device: RlxdOrd+ ExtTag- PhantFunc- AuxPwr- NoSnoop-
                Device: MaxPayload 128 bytes, MaxReadReq 512 bytes
                Link: Supported Speed 2.5Gb/s, Width x1, ASPM L0s L1, Port 0
                Link: Latency L0s <512ns, L1 <64us
                Link: ASPM Disabled RCB 64 bytes CommClk- ExtSynch-
                Link: Speed 2.5Gb/s, Width x1
        Capabilities: [ac] MSI-X: Enable- Mask- TabSize=2
                Vector table: BAR=4 offset=00000000
                PBA: BAR=4 offset=00000800
        Capabilities: [cc] Vital Product Data

It looks like BIOS issue to me. There is a new BIOS available (LF0103P) which does not seem to have even this issue. It works for me with this new one - try it and let us know.

awe_cz (martin-nedbal) wrote :

Sorry for my overexcitement, after doing the restart loop couple of times over I'm back on "0x7f". Damn!

The Ubuntu Kernel Team is planning to move to the 2.6.27 kernel for the upcoming Intrepid Ibex 8.10 release. As a result, the kernel team would appreciate it if you could please test this newer 2.6.27 Ubuntu kernel. There are one of two ways you should be able to test:

1) If you are comfortable installing packages on your own, the linux-image-2.6.27-* package is currently available for you to install and test.

--or--

2) The upcoming Alpha5 for Intrepid Ibex 8.10 will contain this newer 2.6.27 Ubuntu kernel. Alpha5 is set to be released Thursday Sept 4. Please watch http://www.ubuntu.com/testing for Alpha5 to be announced. You should then be able to test via a LiveCD.

Please let us know immediately if this newer 2.6.27 kernel resolves the bug reported here or if the issue remains. More importantly, please open a new bug report for each new bug/regression introduced by the 2.6.27 kernel and tag the bug report with 'linux-2.6.27'. Also, please specifically note if the issue does or does not appear in the 2.6.26 kernel. Thanks again, we really appreicate your help and feedback.

DarkMace (darkmace) wrote :

Installed amd64 Ubuntu Server 8.10 Alpha 4 on my Intel D945GCLF!
Networking with DHCP worked now!!! Best regards, DarkMace

awe_cz (martin-nedbal) wrote :

I went for Interpid alpha - problem seems to be solved with 2.6.26-5

$ uname -a
Linux tv 2.6.26-5-server #1 SMP Fri Aug 15 14:32:57 UTC 2008 x86_64 GNU/Linux

Setting this to "Fix Released" based on the decision to stick with the 2.6.27 kernel for Intrepid - https://lists.ubuntu.com/archives/kernel-team/2008-September/003107.html

Changed in linux:
status: New → Fix Released
Steve Murphy (steve-murphy) wrote :

Just tried the 2.6.27-4-generic kernel, same problem, no network connection.

Also looks related to bug 225749

uname -a
Linux pc-64 2.6.27-4-generic #1 SMP Mon Sep 22 04:40:15 UTC 2008 x86_64 GNU/Linux

lspci -vv
01:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8101E/RTL8102E PCI Express Fast Ethernet controller (rev ff) (prog-if ff)
 !!! Unknown header type 7f
 Kernel driver in use: r8169
 Kernel modules: r8169

re:Bug #240470:

Atom-based D945GCLF motherboard, which includes a Realtek RTL8102EL (optically checked on chip)

on performing a 32 bit and 64 bit systems dual boot comparison

32 bit 2.6.27-7-server RTL8102EL works

64 bit 2.6.27-7-server RTL8102EL does not work

david@atomx641:~$ uname -a
Linux atomx641 2.6.27-7-server #1 SMP Fri Oct 10 04:35:01 UTC 2008 x86_64 GNU/Linux

david@atomx641:~$ lspci -vv
01:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8101E/RTL8102E PCI Express Fast Ethernet controller (rev ff) (prog-if ff)
 !!! Unknown header type 7f
 Kernel driver in use: r8169
 Kernel modules: r8169

If I can help in any way please just ask and I will perform any tests
or provide any diagnostics that may help!

re:Bug #240470:

Atom-based D945GCLF motherboard, which includes a Realtek RTL8102EL (optically checked on chip)

on performing a 32 bit and 64 bit systems dual boot comparison

32 bit 2.6.27-7-server RTL8102EL works

64 bit 2.6.27-7-server RTL8102EL does not work

david@atomx641:~$ uname -a
Linux atomx641 2.6.27-7-server #1 SMP Fri Oct 10 04:35:01 UTC 2008 x86_64 GNU/Linux

david@atomx641:~$ lspci -vv
01:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8101E/RTL8102E PCI Express Fast Ethernet controller (rev ff) (prog-if ff)
 !!! Unknown header type 7f
 Kernel driver in use: r8169
 Kernel modules: r8169

If I can help in any way please just ask and I will perform any tests
or provide any diagnostics that may help!
--------------------------------------------------------------------------------
some further tests on i686 32 bit illustrating working system

david@atomx321:~$ uname -a
Linux atomx321 2.6.27-7-server #1 SMP Fri Oct 10 04:50:54 UTC 2008 i686 GNU/Linux

01:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8101E/RTL8102E PCI Express Fast Ethernet controller (rev 02)
 Subsystem: Intel Corporation Device 0001
 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
 Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
 Latency: 0, Cache Line Size: 32 bytes
 Interrupt: pin A routed to IRQ 220
 Region 0: I/O ports at 1000 [size=256]
 Region 2: Memory at 90100000 (64-bit, non-prefetchable) [size=4K]
 Region 4: Memory at 90000000 (64-bit, prefetchable) [size=64K]
 Expansion ROM at 90020000 [disabled] [size=128K]
 Capabilities: <access denied>
 Kernel driver in use: r8169
 Kernel modules: r8169

david@atomx321:~$ sudo lshw -c network
[sudo] password for david:
  *-network
       description: Ethernet interface
       product: RTL8101E/RTL8102E PCI Express Fast Ethernet controller
       vendor: Realtek Semiconductor Co., Ltd.
       physical id: 0
       bus info: pci@0000:01:00.0
       logical name: eth0
       version: 02
       serial: 00:1c:c0:47:e4:f5
       size: 100MB/s
       capacity: 1GB/s
       width: 64 bits
       clock: 33MHz
       capabilities: pm msi pciexpress msix vpd bus_master cap_list ethernet physical tp 10bt 10bt-fd 100bt 100bt-fd 1000bt-fd autonegotiation
       configuration: autonegotiation=on broadcast=yes driver=r8169 driverversion=2.3LK-NAPI duplex=full ip=192.168.1.103 latency=0 link=yes module=r8169 multicast=yes port=twisted pair speed=100MB/s

awe_cz (martin-nedbal) wrote :

...and broken again in 2.6.27-7-server;(

awe_cz (martin-nedbal) wrote :

...and broken even in today's 2.6.27-9 (64bit);(

S.R.Parda (linux-qbox) wrote :

Same problem,

Network not working on D945GCLF with Ubuntu 8.10 64bits (on a clean install).

After download all patches to 19-1-2008 with an USB adapter it doesn't work neitehr.

awe_cz (martin-nedbal) wrote :

...and broken even in last 2.6.27-11(-server) - 8.10, of course.

Barry Morrison (basketcase) wrote :

:~$ sudo lshw -c network
  *-generic
       description: Ethernet interface
       product: Illegal Vendor ID
       vendor: Illegal Vendor ID
       physical id: 0
       bus info: pci@0000:01:00.0
       logical name: eth0
       version: ff
       serial: 00:21:85:e0:4d:f3
       size: 100MB/s
       capacity: 100MB/s
       width: 32 bits
       clock: 66MHz
       capabilities: bus_master vga_palette cap_list ethernet physical tp mii 10bt 10bt-fd 100bt 100bt-fd autonegotiation
       configuration: autonegotiation=on broadcast=yes driver=r8169 driverversion=2.3LK-NAPI duplex=full latency=255 link=yes maxlatency=255 mingnt=255 module=r8169 multicast=yes port=MII speed=100MB/s
  *-network
       description: Wireless interface
       product: AR242x 802.11abg Wireless PCI Express Adapter
       vendor: Atheros Communications Inc.
       physical id: 0
       bus info: pci@0000:02:00.0
       logical name: wifi0
       version: 01
       serial: 00:1d:7d:78:2c:f8
       width: 64 bits
       clock: 33MHz
       capabilities: pm msi pciexpress msix bus_master cap_list logical ethernet physical wireless
       configuration: broadcast=yes driver=ath_pci ip=10.11.103.119 latency=0 module=ath_pci multicast=yes wireless=IEEE 802.11g
  *-network DISABLED
       description: Ethernet interface
       physical id: 1
       logical name: pan0
       serial: 46:43:46:d8:39:c6
       capabilities: ethernet physical
       configuration: broadcast=yes driver=bridge driverversion=2.3 firmware=N/A link=yes multicast=yes

___

 uname -a
Linux ubuntu-wind 2.6.27-11-generic #1 SMP Thu Jan 29 19:24:39 UTC 2009 i686 GNU/Linux

thripper (thripper) wrote :
Download full text (3.4 KiB)

Same problem here , here is some info :

up-to-date ubuntu 8.10

Linux user-laptop 2.6.27-11-generic #1 SMP Thu Jan 29 19:24:39 UTC 2009 i686 GNU/Linux

 removing the modules seems to lead to this :

[ 186.805113] ------------[ cut here ]------------
[ 186.805143] WARNING: at /build/buildd/linux-2.6.27/net/sched/sch_generic.c:219 dev_watchdog+0x21a/0x230()
[ 186.805157] NETDEV WATCHDOG: eth0 (r8169): transmit timed out
[ 186.805167] Modules linked in: ipv6 af_packet i915 drm binfmt_misc sco bridge stp bnep rfcomm l2cap bluetooth ppdev acpi_cpufreq cpufreq_conservative cpufreq_ondemand cpufreq_userspace cpufreq_stats freq_table cpufreq_powersave wmi container pci_slot sbs sbshc iptable_filter ip_tables x_tables nls_iso8859_1 nls_cp437 vfat fat parport_pc lp parport joydev snd_hda_intel snd_pcm_oss snd_mixer_oss snd_pcm uvcvideo compat_ioctl32 videodev v4l1_compat snd_seq_dummy evdev video output snd_seq_oss snd_seq_midi snd_rawmidi serio_raw psmouse snd_seq_midi_event ath_pci snd_seq wlan ath_hal(P) snd_timer snd_seq_device battery button ac snd pcspkr iTCO_wdt iTCO_vendor_support soundcore intel_agp snd_page_alloc agpgart shpchp pci_hotplug ext3 jbd mbcache sd_mod crc_t10dif ata_piix sg pata_acpi usb_storage libusual usbhid hid ata_generic ahci libata scsi_mod dock ehci_hcd r8169 mii uhci_hcd usbcore thermal processor fan fbcon tileblit font bitblit softcursor fuse
[ 186.805488] Pid: 0, comm: swapper Tainted: P 2.6.27-11-generic #1
[ 186.805501] [<c0131e15>] warn_slowpath+0x65/0x90
[ 186.805523] [<c0240030>] ? get_request+0xc0/0x360
[ 186.805542] [<c012990d>] ? find_busiest_group+0x15d/0x7c0
[ 186.805560] [<c0121007>] ? enqueue_task+0x57/0x70
[ 186.805578] [<c012a28e>] ? resched_task+0x1e/0x70
[ 186.805595] [<c024d8d5>] ? __next_cpu+0x15/0x30
[ 186.805611] [<c012990d>] ? find_busiest_group+0x15d/0x7c0
[ 186.805629] [<c0118e38>] ? read_hpet+0x8/0x20
[ 186.805645] [<c014e6eb>] ? getnstimeofday+0x4b/0x100
[ 186.805664] [<c0136a26>] ? set_normalized_timespec+0x16/0x90
[ 186.805682] [<c0154437>] ? timer_stats_update_stats+0x17/0x250
[ 186.805699] [<c0254a19>] ? strlen+0x9/0x20
[ 186.805716] [<c0252a9d>] ? strlcpy+0x1d/0x60
[ 186.805732] [<c02f16a7>] ? netdev_drivername+0x37/0x40
[ 186.805749] [<c03068aa>] dev_watchdog+0x21a/0x230
[ 186.805765] [<c0130039>] ? dup_mmap+0x49/0x290
[ 186.805782] [<c0151d34>] ? clockevents_program_event+0x14/0x150
[ 186.805800] [<c014b84e>] ? ktime_get+0x1e/0x40
[ 186.805816] [<c013c038>] run_timer_softirq+0x138/0x210
[ 186.805833] [<c0306690>] ? dev_watchdog+0x0/0x230
[ 186.805850] [<c0306690>] ? dev_watchdog+0x0/0x230
[ 186.805867] [<c0137732>] __do_softirq+0x92/0x120
[ 186.805882] [<c013781d>] do_softirq+0x5d/0x60
[ 186.805896] [<c0137995>] irq_exit+0x55/0x90
[ 186.805909] [<c0113f8d>] smp_apic_timer_interrupt+0x5d/0x90
[ 186.805927] [<c01050f8>] apic_timer_interrupt+0x28/0x30
[ 186.805944] [<c01700d8>] ? __audit_ipc_set_perm+0x28/0x80
[ 186.805985] [<f8860800>] ? acpi_idle_enter_bm+0x268/0x2b7 [processor]
[ 186.806033] [<c02dcc2b>] cpuidle_idle_call+0x7b/0xd0
[ 186.806049] [<c010288d>] cpu_idle+0x7d/0x140
[ 186.806063] [<c037b471>] start_s...

Read more...

Dušan Miletić (karl3) wrote :

same here. worked just fine until ~3 weeks ago, some update must have messed it up.

Dušan Miletić (karl3) on 2009-02-18
Changed in linux:
status: Fix Released → Confirmed

Seems the latest regression is a duplicate of bug 326891. Please continue to follow up there. Thanks.

awe_cz (martin-nedbal) wrote :

Leann: I'm not sure this really a duplicate - as you can see it was not working for us even with 27-7...

To post a comment you must log in.
This report contains Public information  Edit
Everyone can see this information.

Other bug subscribers