New Prometheus-openstack-exporter snap refresh breaks exporter
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
prometheus-node-exporter (Ubuntu) |
New
|
Undecided
|
Unassigned |
Bug Description
Edge Prometheus snap refreshed on a few managed clouds and saw them all loose communication with the exporter. The primary alert we received was the following:
HTTP CRITICAL - No data received from host
jhebden rolled beta, edge, and candidate channels back to revision 16 to fix.
recreating the bug was more difficult than expected.
Once the edge snap had been rolled back, I ran 'snap refresh' on all 4 clouds and saw the majority of alerts change to:
connect to address 127.0.0.1 and port 9183: Connection refused
I see nothing listening on port 9183 and I see multiple exporter processes running (this might be normal but not sure):
systemctl list-units|grep openstack
run-snapd-
snap-
snap-
snap-
jujud-
summary: |
- New Prometheus snap refresh brakes exporter + New Prometheus snap refresh breaks exporter |
summary: |
- New Prometheus snap refresh breaks exporter + New Prometheus-openstack-exporter snap refresh breaks exporter |
This is from log when restarting prometheus- openstack- exporter service
Dec 6 12:39:33 xxx-prometheus-1 prometheus- openstack- exporter. prometheus- openstack- exporter[ 11099]: python2: relocation error: /snap/prometheu s-openstack- exporter/ 16/lib/ x86_64- linux-gnu/ libdl.so. 2: symbol _dl_catch_error, version GLIBC_PRIVATE not defined in file libc.so.6 with link time reference -openstack- exporter. prometheus- openstack- exporter. service: Main process exited, code=exited, status=127/n/a -openstack- exporter. prometheus- openstack- exporter. service: Unit entered failed state. -openstack- exporter. prometheus- openstack- exporter. service: Failed with result 'exit-code'. -openstack- exporter. prometheus- openstack- exporter. service: Service hold-off time over, scheduling restart. openstack- exporter. prometheus- openstack- exporter. openstack- exporter. prometheus- openstack- exporter. snap[11117] : cmd.go:118: DEBUG: restarting into "/snap/ core/current/ usr/bin/ snap"
Dec 6 12:39:33 xxx-prometheus-1 systemd[1]: snap.prometheus
Dec 6 12:39:33 xxx-prometheus-1 systemd[1]: snap.prometheus
Dec 6 12:39:33 xxx-prometheus-1 systemd[1]: snap.prometheus
Dec 6 12:39:33 xxx-prometheus-1 systemd[1]: snap.prometheus
Dec 6 12:39:33 xxx-prometheus-1 systemd[1]: Stopped Service for snap application prometheus-
Dec 6 12:39:33 xxx-prometheus-1 systemd[1]: Started Service for snap application prometheus-
Dec 6 12:39:33 xxx-prometheus-1 /usr/bin/