Bug #135038 “postgrey fails regularly : “fatal: Can't call metho...” : Bugs : db4.4 package : Ubuntu

loafer (launchpad-net-iloaf) on 2007-08-27

description:

updated

Revision history for this message

loafer (launchpad-net-iloaf) wrote on 2007-08-28: Re: postgrey fails to run after ?logrotate? - stops nightly takes out postfix too.t

#1

Oh and I've got Net::Server up to date too. ;)

ii libnet-server-perl 0.94-1 An extensible, general perl server engine

Revision history for this message

Lionel Porcheron (lionel.porcheron) wrote on 2007-09-04: Re: postgrey fails to run after ?logrotate? - stops nightly takes out postfix too.

#2

Thanks for taking time to report bug in Ubuntu. With the same configuration I can not reproduce it. The time you give does not appear to be logrotate.
Do you use a out of the box configuration for postgrey or a personalised one?
Could you launch logrotate by hand (" /usr/sbin/logrotate /etc/logrotate.conf") and see if postgrey is still running after.
Thanks

Changed in postgrey:
status:	New → Incomplete

Revision history for this message

loafer (launchpad-net-iloaf) wrote on 2007-09-05:

#3

Download full text (5.8 KiB)

This is now a busy mailserver with about 100spam/min at busy times. It looks like it's not syslog related now.

The postgrey config is:
POSTGREY_OPTS="--inet=127.0.0.1:60000 --delay=40"
POSTGREY_TEXT="Server is not keen on you, see http://greylist.example.com/"

Here are the syslog entries from when it ides.

Sep 3 06:52:04 localhost postgrey[10558]: cleaning main database finished. before: 274048, after: 63815
Sep 3 06:52:04 localhost postgrey[10558]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep 4 02:04:10 localhost postgrey[12464]: cleaning main database finished. before: 289400, after: 49192
Sep 4 02:04:10 localhost postgrey[12464]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep 4 03:11:01 localhost postgrey[8868]: cleaning main database finished. before: 290126, after: 48884
Sep 4 03:11:01 localhost postgrey[8868]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep 4 04:21:39 localhost postgrey[10136]: cleaning main database finished. before: 290793, after: 48493
Sep 4 04:21:39 localhost postgrey[10136]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep 4 05:31:11 localhost postgrey[11328]: cleaning main database finished. before: 291073, after: 47722
Sep 4 05:31:11 localhost postgrey[11328]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep 3 02:13:29 localhost postgrey[14053]: cleaning main database finished. before: 271150, after: 65691
Sep 3 02:13:29 localhost postgrey[14053]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep 3 03:20:52 localhost postgrey[6832]: cleaning main database finished. before: 271584, after: 64917
Sep 3 03:20:52 localhost postgrey[6832]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep 3 04:31:01 localhost postgrey[7976]: cleaning main database finished. before: 272450, after: 64504
Sep 3 04:31:01 localhost postgrey[7976]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep 3 05:40:56 localhost postgrey[9311]: cleaning main database finished. before: 273488, after: 64383
Sep 3 05:40:56 localhost postgrey[9311]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep 2 02:30:27 localhost postgrey[3926]: cleaning main database finished. before: 242519, after: 69659
Sep 2 02:30:27 localhost postgrey[3926]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep 2 03:40:58 localhost postgrey[10336]: cleaning main database finished. before: 243533, after: 69331
Sep 2 03:40:58 localhost postgrey[10336]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep 2 04:50:44 localhost postgrey[11620]: cleaning main database finished. before: 244632, after: 69420
Sep 2 04:50:44 localhost postgrey[11620]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep 2 06:00:5...

This is now a busy mailserver with about 100spam/min at busy times. It looks like it's not syslog related now.

The postgrey config is:
POSTGREY_OPTS="--inet=127.0.0.1:60000 --delay=40"
POSTGREY_TEXT="Server is not keen on you, see http://greylist.example.com/"

Here are the syslog entries from when it ides.

Sep  3 06:52:04 localhost postgrey[10558]: cleaning main database finished. before: 274048, after: 63815
Sep  3 06:52:04 localhost postgrey[10558]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep  4 02:04:10 localhost postgrey[12464]: cleaning main database finished. before: 289400, after: 49192
Sep  4 02:04:10 localhost postgrey[12464]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep  4 03:11:01 localhost postgrey[8868]: cleaning main database finished. before: 290126, after: 48884
Sep  4 03:11:01 localhost postgrey[8868]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223. 
Sep  4 04:21:39 localhost postgrey[10136]: cleaning main database finished. before: 290793, after: 48493
Sep  4 04:21:39 localhost postgrey[10136]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep  4 05:31:11 localhost postgrey[11328]: cleaning main database finished. before: 291073, after: 47722
Sep  4 05:31:11 localhost postgrey[11328]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep  3 02:13:29 localhost postgrey[14053]: cleaning main database finished. before: 271150, after: 65691
Sep  3 02:13:29 localhost postgrey[14053]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep  3 03:20:52 localhost postgrey[6832]: cleaning main database finished. before: 271584, after: 64917
Sep  3 03:20:52 localhost postgrey[6832]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223. 
Sep  3 04:31:01 localhost postgrey[7976]: cleaning main database finished. before: 272450, after: 64504
Sep  3 04:31:01 localhost postgrey[7976]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223. 
Sep  3 05:40:56 localhost postgrey[9311]: cleaning main database finished. before: 273488, after: 64383
Sep  3 05:40:56 localhost postgrey[9311]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223. 
Sep  2 02:30:27 localhost postgrey[3926]: cleaning main database finished. before: 242519, after: 69659
Sep  2 02:30:27 localhost postgrey[3926]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223. 
Sep  2 03:40:58 localhost postgrey[10336]: cleaning main database finished. before: 243533, after: 69331
Sep  2 03:40:58 localhost postgrey[10336]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep  2 04:50:44 localhost postgrey[11620]: cleaning main database finished. before: 244632, after: 69420
Sep  2 04:50:44 localhost postgrey[11620]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep  2 06:00:55 localhost postgrey[12791]: cleaning main database finished. before: 245618, after: 68625
Sep  2 06:00:55 localhost postgrey[12791]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep  1 02:22:03 localhost postgrey[21649]: cleaning main database finished. before: 207387, after: 58805
Sep  1 02:22:03 localhost postgrey[21649]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep  1 03:31:20 localhost postgrey[14639]: cleaning main database finished. before: 208601, after: 58220
Sep  1 03:31:20 localhost postgrey[14639]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep  1 04:41:26 localhost postgrey[15918]: cleaning main database finished. before: 210020, after: 58774
Sep  1 04:41:26 localhost postgrey[15918]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Sep  1 05:51:18 localhost postgrey[17204]: cleaning main database finished. before: 210987, after: 59741
Sep  1 05:51:18 localhost postgrey[17204]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Aug 31 02:48:45 localhost postgrey[12033]: cleaning main database finished. before: 174742, after: 54620
Aug 31 02:48:45 localhost postgrey[12033]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Aug 31 03:50:31 localhost postgrey[17732]: cleaning main database finished. before: 175996, after: 55874
Aug 31 03:50:31 localhost postgrey[17732]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Aug 31 05:00:32 localhost postgrey[19023]: cleaning main database finished. before: 177071, after: 56949
Aug 31 05:00:32 localhost postgrey[19023]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Aug 31 06:10:46 localhost postgrey[20385]: cleaning main database finished. before: 178708, after: 58586
Aug 31 06:10:46 localhost postgrey[20385]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.
Aug 30 02:56:14 localhost postgrey[6431]: cleaning main database finished. before: 150449, after: 59619
Aug 30 02:56:14 localhost postgrey[6431]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223. 
Aug 30 04:00:25 localhost postgrey[20402]: cleaning main database finished. before: 152087, after: 61257
Aug 30 04:00:25 localhost postgrey[20402]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.

I've a watchdog script kicking it back into life at the moment.

Changed in postgrey:
status:	Incomplete → New

Revision history for this message

Daniel Mueller (ubuntu-danm) wrote on 2007-10-01:

#4

I can confirm this behavior. It appears on my (brand-new!) Ubuntu "gutsy" server as well. (The server struggles with about 10 SMTP connections per minute). Every night between 1:00am and 7:00am postgrey dies:

Oct 1 02:44:04 x postgrey[x]: cleaning up old logs...
Oct 1 02:44:04 x postgrey[x]: cleaning up old entries...
Oct 1 02:44:04 x postgrey[x]: cleaning main database finished. before: x, after: x
Oct 1 02:44:04 x postgrey[x]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.

My server does not run any cron-daemon/logrotate at all. After a short look in postgrey's source I found this at line 160:

    [..]
    # remove old keys
    # this is very expensive: We might refuse to speak to postfix for too
    # long, after which clients will start getting "450 Server configuration
    # problem" errors... do it only during the night and only if at least one
    # day has passed
    my $hour = (localtime($now))[2];
    if($hour > 1 and $hour < 7 and
        $now - $self->{postgrey}{last_maint_keys} >= 82800)
    [..]

I would say it is an error in postgrey's do_maintenance function. The newest version of postgrey (1.31) runs very stable on my other mailhosts (OpenSuSE). I have just replaced the exiting /usr/sbin/postgrey with v.1.31 (and copied the needed configuration files to /etc/postfix). If the error occurs again it has probably something to do with the perl-db modules. I'll report any results here.

bye,
Daniel

Revision history for this message

Daniel Mueller (ubuntu-danm) wrote on 2007-10-02:

#5

Hmm, the same error with the newest version of postgrey:

Oct 2 02:11:32 x postgrey[x]: cleaning up old logs...
Oct 2 02:11:32 x postgrey[x]: cleaning up old entries...
Oct 2 02:11:32 x postgrey[x]: cleaning main database finished. before: x, after: x
Oct 2 02:11:32 x postgrey[x]: FATAL: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 289.

bye,
Daniel

Revision history for this message

podslatt (garymcm) wrote on 2007-10-02:

#6

Just wanted to say I am seeing this same error at postgrey 1.27, gutsy.

Oct 2 02:53:16 bluejay postgrey[4121]: fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223.

Revision history for this message

loafer (launchpad-net-iloaf) wrote on 2007-10-02:

#7

Thanks for your input guys. I think the importance needs raising on this bug.

Very lame-ass workaround:
*/2 * * * * sleep 10; pidof postgrey >/dev/null || /etc/init.d/postgrey start

Revision history for this message

podslatt (garymcm) wrote on 2007-10-02:

#8

I just installed monit and turned monitoring postfix and postgrey, because it takes down postfix. I agree this needs to be fixed for this reason alone.

Revision history for this message

Daniel Mueller (ubuntu-danm) wrote on 2007-10-02:

#9

Hi,

yesterday I have installed debian's libdb4.4 version 4.4.20-10 from the 'unstable' branch [1]. I read the debian bugtracker and found out that they have raised a ticket[2] against postgrey as well. Let's hope that the updated library solves the nightly postgrey-crash. I will report my results tomorrow - it's about 0:12am here and I don't want to wait till 2:00am :-)

[1] http://packages.debian.org/sid/libdb4.4
[2] http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=441152
[3] http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=441069 (with reference to Ubuntu #135038 :-)

bye,
Daniel

Revision history for this message

Daniel Mueller (ubuntu-danm) wrote on 2007-10-04:

#10

With libdb4.4 version 4.4.20-10 from debian 'unstable' postgrey is working stable for two days now.

Oct 4 02:15:55 x postgrey[x]: cleaning up old logs...
Oct 4 02:15:55 x postgrey[x]: cleaning up old entries...
Oct 4 02:15:55 x postgrey[x]: cleaning main database finished. before: x, after: x
Oct 4 02:15:55 x postgrey[x]: cleaning clients database finished. before: x, after: x
[..]
Oct 4 02:22:36 x postgrey[x]: action=pass, reason=triplet found, delay=x, client_name=x, client_address=x.x.x.x, recipient=info@xxx
[..]

I think updating Ubuntu's libdb4.4 would solve the issue.

bye,
Daniel

Matt LaPlante (cybrmatt) on 2007-10-18

Changed in postgrey:
status:	New → Confirmed

Revision history for this message

Matt LaPlante (cybrmatt) wrote on 2007-10-18:

#11

Why is it taking so long to get a response on this? postgrey is very much broken in gutsy, and Debian already addressed the problem.

Revision history for this message

Martijn Heemels (yggdrasil) wrote on 2007-10-25:

#12

I have to second Matt's remark that this is taking ages. This is a imho a critical bug since it brings down the server's mail delivery completely and regularly, yet has been open for 2 months now.

Is Daniel Mueller's suggestion of updating libdb4.4 working? If so, please release it.

Regards, Martijn

Siegfried Gevatter (rainct) on 2007-10-25

Changed in postgrey:
importance:	Undecided → Medium

Revision history for this message

Scott Kitterman (kitterman) wrote on 2007-10-25:

#13

Ironically enough, my reading of the Debian bug is that postrgrey isn't broken, it's libdb4.4. Adding libdb4.4 task.

Changed in postgrey:
importance:	Medium → Undecided
status:	Confirmed → Invalid

Siegfried Gevatter (rainct) on 2007-10-25

Changed in db4.4:
importance:	Undecided → Medium

Revision history for this message

Ronald Hummelink (ronald-hummelink) wrote on 2007-10-26:

#14

My postgrey dies 4 times each night. a random minute of each 3am, 4am, 5am and 6am.

I've been running a cronjob every minute that restarts postgrey if its down and mails me for half a week.

Machine is gutsy upgraded from feisty (and it was stable on feisty). I have little time to investigate the causes, as i'm in the process of moving.

Bug Watch Updater (bug-watch-updater) on 2007-10-27

Changed in db4.4:
status:	Unknown → Fix Released

Revision history for this message

Gustavo A. Díaz (gdiaz) wrote on 2007-11-02:

#15

Same for me, and running Gutsy as well...
I was wondering why my email server stops to send emails once per day almost... and now i discovered this bug report.. which is the same problem happening to me... and since I've upgraded to gutsy... with feisty, zero problems.

Revision history for this message

Brendan Cully (bcully) wrote on 2007-11-02:

#16

I'm me-tooing this bug to get updates.

Revision history for this message

Raoul Merki (raoul) wrote on 2007-11-05:

#17

Same with me. Brand new installed and updated Gutsy.

Implemented the "Cron Fix" described above.

Revision history for this message

Ronald Hummelink (ronald-hummelink) wrote on 2007-11-05:

#18

Installed the updated libdb4.4 package from debian/lenny. Fixes the problem (no more mails from my cronscript). Can we expect an updated package from Ubuntu any time soon?

(People following this 'advise': at your risk off course)

Daniel Hahler (blueyed) on 2007-11-05

Changed in db4.4:
assignee:	nobody → blueyed
status:	New → In Progress

Revision history for this message

Daniel Mueller (ubuntu-danm) wrote on 2007-11-05:

#19

My system runs stable since 2007-10-02 (after replacing Ubuntu's libdb4.4 package with Debian's 4.4.20-10). I am really disappointed that nobody at Canonical Ltd. feels responsible for this bug. I mean - I've installed the Gutsy server version and postfix/postgrey is not an exotic combination, is it?

If you have in mind to replace the libdb4.4 package make sure that there's no other application using that particular Berkeley DB version. Otherwise you should backup your database files, update libdb4.4, install db4.4-util, run db4.4_recover -h <your db env> and cross the fingers.

Revision history for this message

Daniel Hahler (blueyed) wrote on 2007-11-05:

#20

I'll make this a dupe of bug 153996 (which seems better for a meta-bug).
This affects subversion (with --fs-type bdb), too.

Changed in db4.4:
assignee:	blueyed → nobody
status:	In Progress → Confirmed

	Status	Importance	Assigned to
db4.4 (Debian)	Fix Released	Unknown	debbugs #441152
db4.4 (Ubuntu)	Confirmed	Medium	Unassigned
postgrey (Ubuntu)	Invalid	Undecided	Unassigned

Ubuntu
db4.4 package

postgrey fails regularly : "fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223."

Bug Description

Other bug subscribers

Remote bug watches

Ubuntudb4.4 package

postgrey fails regularly : "fatal: Can't call method "txn_commit" on an undefined value at /usr/sbin/postgrey line 223."

Bug Description

Other bug subscribers

Remote bug watches

Ubuntu
db4.4 package