XVM

XVM Nagios monitoring sucks

Bug #307357 reported by Evan Broder on 2008-12-12
4
Affects Status Importance Assigned to Milestone
XVM
Critical
Unassigned

Bug Description

We currently have an absolutely minimal Nagios setup for monitoring XVM. We should expand its configuration to monitor for specific services - the DNS server, the VNC proxy server, etc.

Duane Hinnen (duanedesign) wrote :

Thank you for taking the time to make Ubuntu better. Since what you submitted is not really a bug, or a problem, but rather an Feature Request to improve Ubuntu, you are invited to post your idea in Ubuntu Brainstorm at [WWW] https://brainstorm.ubuntu.com/ where it can be discussed, voted by the community and reviewed by developers. Thanks for taking the time to share your opinion!

Hi - um...huh? What? This is /not/ an Ubuntu bug, this is a bug with the
XVM project I'm not entirely clear why you responded to it.

- Evan

duanedesign wrote:
> Thank you for taking the time to make Ubuntu better. Since what you
> submitted is not really a bug, or a problem, but rather an Feature
> Request to improve Ubuntu, you are invited to post your idea in Ubuntu
> Brainstorm at [WWW] https://brainstorm.ubuntu.com/ where it can be
> discussed, voted by the community and reviewed by developers. Thanks for
> taking the time to share your opinion!
>
>

Evan Broder (broder) wrote :

Two of XVM's three outages were discovered by fortunate accident, when they should have been discovered by monitoring. This is a serious issue.

Changed in xvm:
importance: Undecided → Critical
status: New → Confirmed
Greg Price (gregprice) wrote :

On Sat, Dec 27, 2008 at 07:06:11AM -0000, Evan Broder wrote:
> Two of XVM's three outages were discovered by fortunate accident, when
> they should have been discovered by monitoring. This is a serious issue.

E.g., we should check that the hosts can read their disks, which is
what failed last night.

Greg

Quentin Smith (quentin-mit) wrote :

As of tonight, we are now monitoring DNS, DHCP, and VNC. Things that we still need to configure monitoring for include:

- xend
- conserver
- the RAID

All of these require running something on each machine being monitored, which means they need to run a daemon of some kind. The canonical solution is called NRPE. We should figure out what a secure configuration looks like and set it up.

To post a comment you must log in.
This report contains Public information  Edit
Everyone can see this information.

Other bug subscribers