Ubuntu

e1000e needs updating due to 82574L keeps dropping RX packets

Reported by John Strömstedt on 2012-06-27

This bug report will be marked for expiration in 39 days if no further activity occurs. (find out why)

94
This bug affects 19 people
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Medium
Unassigned

Bug Description

Description: Ubuntu 12.04 LTS
Release: 12.04
Networking driver e1000e needs updating due to 82574L (and others?) keeps dropping RX packets.
Information about how to fix it can be found here (http://www.doxer.org/learn-linux/resolved-intel-e1000e-driver-bug-on-82574l-ethernet-controller-causing-network-blipping/) but including it in the default kernel or official package would be preferable.
This controller is popular in Supermicro servers.

Thank you for taking the time to report this bug and helping to make Ubuntu better. It seems that your bug report is not filed about a specific source package though, rather it is just filed against Ubuntu in general. It is important that bug reports be filed about source packages so that people interested in the package can find the bugs about it. You can find some hints about determining what package your bug might be about at https://wiki.ubuntu.com/Bugs/FindRightPackage. You might also ask for help in the #ubuntu-bugs irc channel on Freenode.

To change the source package that this bug is filed about visit https://bugs.launchpad.net/ubuntu/+bug/1018561/+editstatus and add the package name in the text box next to the word Package.

[This is an automated message. I apologize if it reached you inappropriately; please just reply to this message indicating so.]

tags: added: bot-comment
Fabio Marconi (fabiomarconi) wrote :

Thank you for taking the time to report this bug and helping to make Ubuntu better. This bug did not have a package associated with it, which is important for ensuring that it gets looked at by the proper developers. You can learn more about finding the right package at https://wiki.ubuntu.com/Bugs/FindRightPackage . I have classified this bug as a bug in linux.

When reporting bugs in the future please use apport by using 'ubuntu-bug' and the name of the package affected. You can learn more about this functionality at https://help.ubuntu.com/community/ReportingBugs.

affects: ubuntu → linux (Ubuntu)

This bug is missing log files that will aid in diagnosing the problem. From a terminal window please run:

apport-collect 1018561

and then change the status of the bug to 'Confirmed'.

If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status: New → Incomplete
tags: added: precise

Cannot run "apport-collect 1018561".

Changed in linux (Ubuntu):
status: Incomplete → Confirmed
Joseph Salisbury (jsalisbury) wrote :

Would it be possible for you to test the latest upstream kernel? Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Please test the latest v3.5kernel[0] (Not a kernel in the daily directory) and install both the linux-image and linux-image-extra .deb packages.

Once you've tested the upstream kernel, please remove the 'needs-upstream-testing' tag(Only that one tag, please leave the other tags). This can be done by clicking on the yellow pencil icon next to the tag located at the bottom of the bug description and deleting the 'needs-upstream-testing' text.

If this bug is fixed in the mainline kernel, please add the following tag 'kernel-fixed-upstream'.

If the mainline kernel does not fix this bug, please add the tag: 'kernel-bug-exists-upstream'.

If you are unable to test the mainline kernel, for example it will not boot, please add the tag: 'kernel-unable-to-test-upstream'.
Once testing of the upstream kernel is complete, please mark this bug as "Confirmed".

Thanks in advance.

[0] http://kernel.ubuntu.com/~kernel-ppa/mainline/v3.5-rc5-quantal/

Changed in linux (Ubuntu):
importance: Undecided → Medium
tags: added: needs-upstream-testing
Changed in linux (Ubuntu):
status: Confirmed → Incomplete

Tested 3.4.0-030400-generic #201205210521 SMP Mon May 21 09:22:02 UTC 2012 x86_64 x86_64 x86_64 GNU/Linux and the machine is still dropping packets (RX).
Is it safe to use quantal kernels on 12.04 (safe as will it boot up, this is a production server hehe)?

Johannes Hessellund (osos) wrote :

Could this patch be included in the next Precise kernel update:

http://<email address hidden>/msg10202.html

I have upgraded kernel to 3.4.0-030400-generic from PPA and applied the eeprom fix but still incomming packets dropping packets. In my first post the solution at http://www.doxer.org/learn-linux/resolved-intel-e1000e-driver-bug-on-82574l-ethernet-controller-causing-network-blipping/ describes downloading drivers from Intel and fiddling with boot parameters.
Due to the 82574L being a popular chip i thing this should be implemented with in default install.

Launchpad Janitor (janitor) wrote :

[Expired for linux (Ubuntu) because there has been no activity for 60 days.]

Changed in linux (Ubuntu):
status: Incomplete → Expired
Wido den Hollander (wido) wrote :

I'm seeing the same issue.

We have a Intel Dual NIC in a Ubuntu 12.04.1 machine with LACP Bonding configured.

I tried this kernel: 3.5.0-030500-generic

Still having packets being dropped on eth0, eth1 and bond0.

Changed in linux (Ubuntu):
status: Expired → Confirmed
spbike (bill-broadley) wrote :

I'm seeing the same problem on an ubuntu 12.10 x86-64 server. Random 1 second latencies and /sbin/ifconfig shows the RX dropped packet counter increasing with each stutter/lag.

An example ping:
100 packets transmitted, 100 received, 0% packet loss, time 98995ms
rtt min/avg/max/mdev = 0.255/28.669/1915.588/210.376 ms, pipe 2

I'm seeing a dropped packet every 30 seconds or so.

I've tried the mentioned fixes of turning off ACPI, pcie_aspm and setting the e1000e interuptthrottlerate without fixing it yet.

My kernel is 3.5.0-18-generic

Guilhem Lettron (guilhem-fr) wrote :

Same here.

I upgrade to 3.7 rc6 and it resolved this problem.

http://kernel.ubuntu.com/~kernel-ppa/mainline/v3.7-rc6-raring/

abcslayer (abcslayer) wrote :

Well, it is over 02 years since the first time I found the trouble on Lucid installation (with Supermicro X8xxx board) and reported to both this site & even Ubuntu bugsquad mailing list.
On Lucid the driver I use is 1.1.9 (the 1.2.x is not compatible with kernel, can not be compiled).

I have not update ubuntu kernel image for over 1.5 year. Install kernel, compile the driver again, rebuild initrd image (I forgot this once and the eth0 & eth1 failed completely) for every kernel update is time-consuming and it is not guaranteed that the system can be more stable than old kernel config.

Today I decide to upgrade system to 12.0 LTS but it seems that I will need to continue holding it back.

spbike (bill-broadley) wrote :

I had hoped 3.7 rc8 would fix the problem at:
  http://kernel.ubuntu.com/~kernel-ppa/mainline/v3.7-rc8-raring/

Sadly I still get frequent drops and resulting very high latencies occasionally. This seems to happen with all the work arounds I tried under ubuntu 12.04 and ubuntu 12.10.

I've tried pcie_aspm=off, acpi=off, e1000e.IntMode=1,1 e1000e.InterruptThrottleRate=10000,10000, and related settings. I've also tried installing the newest intel drivers for the e1000e.

The occasional stutter/jitter is noticeable on ssh connections through the firewall as well as any video console, voice calls, and video conferencing.

Michael Danko (michael-danko) wrote :

I am using kernel 3.5.0-21, I just installed the latest driver from Intel 2.2.14-NAPI and I have a SIGNIFICANTLY lower amount of dropped packets. No more stutter while using SSH and more importantly my network applications are not hanging. There is still something going on as I don't believe I have ever had dropped packets across the local network but I will investigate further.

James (morris-570) wrote :

SYMPTOMS: I could not see any packet loss from ifconfig, but I, like others, saw network jitter, latency/ping spikes with continuous pings to www.google.com. Example: http://pastebin.com/J7G3fDnN
(though no packetloss with ping, just the jitter)

SOLUTION (for me):
With latest updates to 12.10 (kernel 3.5) I tried:

Latest drivers from Intel (2.2.14):
http://downloadcenter.intel.com/SearchResult.aspx?lang=eng&ProductFamily=Network+Connectivity&ProductLine=Intel%C2%AE+Desktop+Adapters&ProductProduct=Intel%C2%AE+Gigabit+CT+Desktop+Adapter

THIS DID NOT FIX IT

Then I added the boot kernel parameters:

GRUB_CMDLINE_LINUX_DEFAULT="acpi=off pcie_aspm=off e1000e.IntMode=1,1 e1000e.InterruptThrottleRate=10000,10000 text"

THIS DID NOT FIX IT

Only after installing the latest 3.7 ubuntu kernel from here:
http://kernel.ubuntu.com/~kernel-ppa/mainline/v3.7.9-raring/

Instructions here:

http://www.liberiangeek.net/2013/02/linux-kernel-version-3-7-9-releasedinstall-it-in-ubuntu-12-10-quantal-quetzal/

Was my problem resolved.

Manuel Irrschik (m-irrschik) wrote :

Im having this issue here on two identical SuperMicro X9DBL-iF based Serves with two Intel 82574L chips each.

Both systems are running on 12.04.2 LTS - Kernel 3.5.0-25 x86_64

We tried so far:
-) applying eeprom fix
-) adding kernel parameters (Kernel 3.5)
-) new kernel (3.8.0-25)

nothing helped, still getting packets "occasionally" dropped - e.g. stats for one system running for 2 months:

# ifconfig eth0
eth0 Link encap:Ethernet Hardware Adresse 00:25:90:9b:34:70
          inet Adresse:10.1.0.2 Bcast:10.1.0.255 Maske:255.255.255.0
          inet6-Adresse: fe80::225:90ff:fe9b:3470/64 Gültigkeitsbereich:Verbindung
          UP BROADCAST RUNNING MULTICAST MTU:1500 Metrik:1
          RX packets:710697486 errors:0 dropped:575281 overruns:0 frame:0
          TX packets:2469906608 errors:0 dropped:0 overruns:0 carrier:0
          Kollisionen:0 Sendewarteschlangenlänge:1000
          RX-Bytes:123575747734 (123.5 GB) TX-Bytes:3245008116186 (3.2 TB)
          Interrupt:16 Speicher:dfa00000-dfa20000

575.281 dropped are a lot...

Symptoms are:
-) flickering DRBD-link (connection dropped, then resync and back online again)
-) ssh hangs for 1-2 secs
-) ping times > 1 sec
-) various application specific problems due to packet drops

At the moment we don't know what else we could try... - how's the rest of you going?

sles (slesru) wrote :

I have the same problem with 12.04:

root@inetgw2:/var/log# ifconfig eth0
eth0 Link encap:Ethernet HWaddr 00:25:90:55:fe:30
          inet addr:192.168.22.254 Bcast:192.168.22.255 Mask:255.255.255.0
          inet6 addr: fe80::225:90ff:fe55:fe30/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
          RX packets:5427654 errors:0 dropped:11971 overruns:0 frame:0

server was rebooted just an hour ago to update kernel...
:-(
Any working solition?

Thank you!

John Strömstedt, this bug was reported a while ago and there hasn't been any activity in it recently. We were wondering if this is still an issue? If so, could you please test for this with the latest development release of Ubuntu? ISO images are available from http://cdimage.ubuntu.com/daily-live/current/ .

If it remains an issue, could you please run the following command in the development release from a Terminal (Applications->Accessories->Terminal), as it will automatically gather and attach updated debug information to this report:

apport-collect -p linux <replace-with-bug-number>

Also, could you please test the latest upstream kernel available following https://wiki.ubuntu.com/KernelMainlineBuilds ? It will allow additional upstream developers to examine the issue. Please do not test the daily folder, but the one all the way at the bottom. Once you've tested the upstream kernel, please comment on which kernel version specifically you tested. If this bug is fixed in the mainline kernel, please add the following tags:
kernel-fixed-upstream
kernel-fixed-upstream-VERSION-NUMBER

where VERSION-NUMBER is the version number of the kernel you tested. For example:
kernel-fixed-upstream-v3.11-rc5

This can be done by clicking on the yellow circle with a black pencil icon next to the word Tags located at the bottom of the bug description. As well, please remove the tag:
needs-upstream-testing

If the mainline kernel does not fix this bug, please add the following tags:
kernel-bug-exists-upstream
kernel-bug-exists-upstream-VERSION-NUMBER

As well, please remove the tag:
needs-upstream-testing

Once testing of the upstream kernel is complete, please mark this bug's Status as Confirmed. Please let us know your results. Thank you for your understanding.

tags: added: needs-kernel-logs
removed: dropping e1000e rx
Changed in linux (Ubuntu):
status: Confirmed → Incomplete
ChimChim (chrischris) wrote :

I have the same problems with this network driver on Ubuntu-Server 12.04 LTS.

A lot of dropped packets, ssh hangs, >1s pings, nasty drbd connections.

Kernel: 3.5.0-40-generic

Any solutions???

ChimChim (chrischris) wrote :

Solved it with the latest e1000e kernel module 2.5.4 and the Grub Commands above.

Kernel Module
http://sourceforge.net/projects/e1000/files/e1000e%20stable/

Grub options:
pcie_aspm=off e1000e.IntMode=1,1 e1000e.InterruptThrottleRate=10000,10000 acpi=off

Bryce Nesbitt (bryce2) wrote :

Same issue here. Ubuntu 12.04 LTS. e1000e. Kernel 3.2.0-56-generic.

> lspci
06:00.0 Ethernet controller: Intel Corporation 82574L Gigabit Network Connection
07:00.0 Ethernet controller: Intel Corporation 82574L Gigabit Network Connection

See comment #22 for a workaround.

Changed in linux (Ubuntu):
status: Incomplete → Confirmed
Changed in linux (Ubuntu):
status: Confirmed → Incomplete

Bryce Nesbitt, please do not mark this a duplicate, or vice versa.

Launchpad Janitor (janitor) wrote :

[Expired for linux (Ubuntu) because there has been no activity for 60 days.]

Changed in linux (Ubuntu):
status: Incomplete → Expired
Bryce Nesbitt (bryce2) wrote :

This bug has not been fixed, so should not be closed.

Changed in linux (Ubuntu):
status: Expired → Confirmed

Bryce Nesbitt, thank you for your comment. So your hardware and problem may be tracked, could you please file a new report with Ubuntu by executing the following in a terminal while booted into a Ubuntu repository kernel (not a mainline one) via:
ubuntu-bug linux

For more on this, please read the official Ubuntu documentation:
Ubuntu Bug Control and Ubuntu Bug Squad: https://wiki.ubuntu.com/Bugs/BestPractices#X.2BAC8-Reporting.Focus_on_One_Issue
Ubuntu Kernel Team: https://wiki.ubuntu.com/KernelTeam/KernelTeamBugPolicies#Filing_Kernel_Bug_reports
Ubuntu Community: https://help.ubuntu.com/community/ReportingBugs#Bug_reporting_etiquette

When opening up the new report, please feel free to subscribe me to it.

Thank you for your understanding.

Helpful bug reporting tips:
https://wiki.ubuntu.com/ReportingBugs

Changed in linux (Ubuntu):
status: Confirmed → Incomplete
Launchpad Janitor (janitor) wrote :

[Expired for linux (Ubuntu) because there has been no activity for 60 days.]

Changed in linux (Ubuntu):
status: Incomplete → Expired

I expected a patch in update or backport, not an expire.

Changed in linux (Ubuntu):
status: Expired → Incomplete

Jan Kellermann, thank you for your comment. So your hardware and problem may be tracked, could you please file a new report with Ubuntu by executing the following in a terminal while booted into a Ubuntu repository kernel (not a mainline one) via:
ubuntu-bug linux

For more on this, please read the official Ubuntu documentation:
Ubuntu Bug Control and Ubuntu Bug Squad: https://wiki.ubuntu.com/Bugs/BestPractices#X.2BAC8-Reporting.Focus_on_One_Issue
Ubuntu Kernel Team: https://wiki.ubuntu.com/KernelTeam/KernelTeamBugPolicies#Filing_Kernel_Bug_reports
Ubuntu Community: https://help.ubuntu.com/community/ReportingBugs#Bug_reporting_etiquette

When opening up the new report, please feel free to subscribe me to it.

Thank you for your understanding.

Helpful bug reporting tips:
https://wiki.ubuntu.com/ReportingBugs

To post a comment you must log in.
This report contains Public information  Edit
Everyone can see this information.

Other bug subscribers