Activity log for bug #968021

Date Who What changed Old value New value Message
2012-03-29 07:10:55 LeForgeron bug added bug
2012-03-29 10:54:16 Daniel Hahler boinc (Ubuntu): status New Incomplete
2012-03-29 10:54:28 Daniel Hahler boinc (Ubuntu): importance Undecided Medium
2012-04-02 07:14:11 LeForgeron attachment added Stderrdae.txt, from 7.0.15, X session closed around 17:00 https://bugs.launchpad.net/ubuntu/+source/boinc/+bug/968021/+attachment/2987079/+files/stderrdae.txt
2012-04-02 07:16:08 LeForgeron attachment added stdoutdae.txt, Session closed about 17:00, no more work https://bugs.launchpad.net/ubuntu/+source/boinc/+bug/968021/+attachment/2987113/+files/stdoutdae.txt
2012-04-05 21:43:52 Gianfranco Costamagna boinc (Ubuntu): status Incomplete Confirmed
2012-04-14 18:06:01 Daniel Hahler boinc (Ubuntu): status Confirmed Triaged
2012-06-27 23:11:25 Daniel Hahler nominated for series Ubuntu Precise
2012-06-27 23:11:25 Daniel Hahler bug task added boinc (Ubuntu Precise)
2012-06-27 23:12:26 Daniel Hahler boinc (Ubuntu Precise): status New Triaged
2012-06-27 23:12:31 Daniel Hahler boinc (Ubuntu Precise): importance Undecided Medium
2012-07-12 16:38:24 Clint Byrum boinc (Ubuntu Precise): assignee Daniel Hahler (blueyed)
2012-07-18 17:09:22 Clint Byrum description There seems to be a file descriptor leaks in the boinc process (client side). After a few days of fine loading the system, it would suddenly stop working. Relaunching it is usually ok (but actively managing a system running boinc is rather not a decent solution). Clue with the following command: $ sudo lsof -p `pidof boinc` The number of open file descriptor will keep increasing as boinc tasks are completed. (more visible when the projects have fast tasks for the hardware, such as sudoku or milkyway/nvidia) A lot of entries are like: boinc 15348 boinc 623r DIR 8,1 4096 29492116 /var/lib/boinc-client/slots/12 boinc 15348 boinc 624r DIR 8,1 4096 29492173 /var/lib/boinc-client/slots/13 boinc 15348 boinc 625r DIR 8,1 4096 29492116 /var/lib/boinc-client/slots/12 boinc 15348 boinc 626r DIR 8,1 4096 29492084 /var/lib/boinc-client/slots/8 boinc 15348 boinc 627r DIR 8,1 4096 29492085 /var/lib/boinc-client/slots/9 boinc 15348 boinc 628r DIR 8,1 4096 29492116 /var/lib/boinc-client/slots/12 boinc 15348 boinc 629r DIR 8,1 4096 29492173 /var/lib/boinc-client/slots/13 boinc 15348 boinc 630r DIR 8,1 4096 29492116 /var/lib/boinc-client/slots/12 boinc 15348 boinc 632r DIR 8,1 4096 29492018 /var/lib/boinc-client/slots/2 boinc 15348 boinc 633r DIR 8,1 4096 29492040 /var/lib/boinc-client/slots/4 boinc 15348 boinc 634r DIR 8,1 4096 29492018 /var/lib/boinc-client/slots/2 boinc 15348 boinc 635r DIR 8,1 4096 29492062 /var/lib/boinc-client/slots/6 boinc 15348 boinc 636r DIR 8,1 4096 29492116 /var/lib/boinc-client/slots/12 ProblemType: Bug DistroRelease: Ubuntu 11.10 Package: boinc 6.12.33+dfsg-1.1ubuntu0.1 ProcVersionSignature: Ubuntu 3.0.0-17.30-generic 3.0.22 Uname: Linux 3.0.0-17-generic x86_64 NonfreeKernelModules: nvidia ApportVersion: 1.23-0ubuntu4 Architecture: amd64 Date: Thu Mar 29 08:59:09 2012 InstallationMedia: Ubuntu 11.10 "Oneiric Ocelot" - Release amd64 (20111012) PackageArchitecture: all SourcePackage: boinc UpgradeStatus: No upgrade log present (probably fresh install) == SRU Justification == Impact : when opened (oneiric), the bug was/is as described: a long time running boinc system would finally fails to compute more boinc work unit as every computed work unit leads to a leak of 1 file descriptor in the boinc main daemon (irrelevant of the kind of project subscribed). The faster the work units were processed, and the lower the limit of file descriptor in the system, the faster the bug happens (usually within one week of uninterrupted uptime, but might go to months on slower systems). This bug affects all users from Oneiric (boinc 6.12.33+dfsg-1.1ubuntu0.1) to 7.0.27 (excluded, that one is fine). (well, 7.0.23, 7.0.24 & 7.0.25 have another issue: computation error, no more leak; 7.0.26 not tested) Test case: easy, but very long: run boinc for at least one complete work unit (according to project, the unit can be 5 minutes to many hours), then use "lsof" on the boinc daemon and check the end of the listing. When more units have been processed, the list reported by "lsof" should not be longer than before. Computation of each unit must succeed. Regression Potential: I do not know the change, I cannot discuss the impact. But boinc must be able to run unattended for months without such problem, and without reboot, especially on a LTS. == Original Description == There seems to be a file descriptor leaks in the boinc process (client side). After a few days of fine loading the system, it would suddenly stop working. Relaunching it is usually ok (but actively managing a system running boinc is rather not a decent solution). Clue with the following command: $ sudo lsof -p `pidof boinc` The number of open file descriptor will keep increasing as boinc tasks are completed. (more visible when the projects have fast tasks for the hardware, such as sudoku or milkyway/nvidia) A lot of entries are like: boinc 15348 boinc 623r DIR 8,1 4096 29492116 /var/lib/boinc-client/slots/12 boinc 15348 boinc 624r DIR 8,1 4096 29492173 /var/lib/boinc-client/slots/13 boinc 15348 boinc 625r DIR 8,1 4096 29492116 /var/lib/boinc-client/slots/12 boinc 15348 boinc 626r DIR 8,1 4096 29492084 /var/lib/boinc-client/slots/8 boinc 15348 boinc 627r DIR 8,1 4096 29492085 /var/lib/boinc-client/slots/9 boinc 15348 boinc 628r DIR 8,1 4096 29492116 /var/lib/boinc-client/slots/12 boinc 15348 boinc 629r DIR 8,1 4096 29492173 /var/lib/boinc-client/slots/13 boinc 15348 boinc 630r DIR 8,1 4096 29492116 /var/lib/boinc-client/slots/12 boinc 15348 boinc 632r DIR 8,1 4096 29492018 /var/lib/boinc-client/slots/2 boinc 15348 boinc 633r DIR 8,1 4096 29492040 /var/lib/boinc-client/slots/4 boinc 15348 boinc 634r DIR 8,1 4096 29492018 /var/lib/boinc-client/slots/2 boinc 15348 boinc 635r DIR 8,1 4096 29492062 /var/lib/boinc-client/slots/6 boinc 15348 boinc 636r DIR 8,1 4096 29492116 /var/lib/boinc-client/slots/12 ProblemType: Bug DistroRelease: Ubuntu 11.10 Package: boinc 6.12.33+dfsg-1.1ubuntu0.1 ProcVersionSignature: Ubuntu 3.0.0-17.30-generic 3.0.22 Uname: Linux 3.0.0-17-generic x86_64 NonfreeKernelModules: nvidia ApportVersion: 1.23-0ubuntu4 Architecture: amd64 Date: Thu Mar 29 08:59:09 2012 InstallationMedia: Ubuntu 11.10 "Oneiric Ocelot" - Release amd64 (20111012) PackageArchitecture: all SourcePackage: boinc UpgradeStatus: No upgrade log present (probably fresh install)
2012-07-18 17:17:46 Clint Byrum boinc (Ubuntu Precise): status Triaged Fix Committed
2012-07-18 17:17:48 Clint Byrum bug added subscriber Ubuntu Stable Release Updates Team
2012-07-18 17:17:51 Clint Byrum bug added subscriber SRU Verification
2012-07-18 17:17:54 Clint Byrum tags amd64 apport-bug oneiric amd64 apport-bug oneiric verification-needed
2012-07-25 09:40:32 Gianfranco Costamagna tags amd64 apport-bug oneiric verification-needed amd64 apport-bug oneiric verification-done
2012-07-25 23:23:16 Launchpad Janitor boinc (Ubuntu Precise): status Fix Committed Fix Released
2012-07-26 19:16:27 Gianfranco Costamagna boinc (Ubuntu): status Triaged Fix Released