Comment 7 for bug 435886

Revision history for this message
Tom Haddon (mthaddon) wrote : Re: Need a way to monitor mailman via nagios

It's an interesting one this... So it helps us to monitor the process to some extent for regular usage, but it doesn't allow us to use it as part of a deployment. The reason is that we need to be able to stop a service, verify it's down, and then restart it with new code and verify it's up. If we're just looking for an entry in a logfile within a certain interval this has no connection to whether the process is still running or not. For example, it heartbeats every 1 minute, so we have a check that says "has it made a heartbeat entry in the logfile within the last minute" - for this to work in a deployment scenario we'd have to then wait at least a minute before we could tell if the service was down, and wait again another minute to verify it's up again.