ceilometer connection fails trying to connect to gnocchi-vip

Bug #1748286 reported by John George
14
This bug affects 2 people
Affects Status Importance Assigned to Milestone
Gnocchi Charm
Confirmed
Critical
Unassigned
OpenStack Ceilometer Charm
New
Undecided
Unassigned

Bug Description

ERROR ceilometer ConnectFailure: Unable to establish connection to http://192.168.33.7:8041/v1/resource_type/ceph_account: HTTPConnectionPool(host='192.168.33.7', port=8041): Max retries exceeded with url: /v1/resource_type/ceph_account (Caused by NewConnectionError('<requests.packages.urllib3.connection.HTTPConnection object at 0x7fa0ec1ebd90>: Failed to establish a new connection: [Errno 113] No route to host',))

However, in the unit-hacluster-gnocchi-1.log it appears that the VIP may not actually be ready.

2018-02-08 05:59:15 DEBUG juju-log ha:57: Configuring and (maybe) restarting corosync

Revision history for this message
John George (jog) wrote :
Revision history for this message
John George (jog) wrote :
Revision history for this message
Nobuto Murata (nobuto) wrote :

LP: #1748286 and LP: #1746548 might have be related each other.

tags: added: cpe-onsite foundations-engine
Revision history for this message
Chris Gregan (cgregan) wrote :

subscribed to field high due to FEs experiencing it in field deploys

Christian Reis (kiko)
summary: - celometer connection fails trying to connect to gnocchi-vip
+ ceilometer connection fails trying to connect to gnocchi-vip
Revision history for this message
Jason Hobbs (jason-hobbs) wrote :

We think this may be related to bug 1749280 - see https://bugs.launchpad.net/charm-ceilometer/+bug/1749280/comments/4

Also maybe related to bug 1746548.

David Ames (thedac)
Changed in charm-gnocchi:
status: New → Confirmed
importance: Undecided → Critical
milestone: none → 18.02
Revision history for this message
David Ames (thedac) wrote :

While digging through the bundle I noticed we have aa-profile-mode: enforce for ceph-osd. James Page recently ran into a bug with ceph-osd and apparmor that caused gnocchi to lock up and cause all kinds of cascading failures.

I'll confer with James Page to confirm.

If anyone is able please test a deploy with aa-profile-mode disabled for ceph-osd and see if there is a difference in behavior.

All of the similar bugs may be due to this root cause. I'll do my own testing as well.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.