keystone is down when rabbitmq in "Stopped" status

Bug #1540085 reported by Oleksiy Butenko
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Mirantis OpenStack
Status tracked in 10.0.x
10.0.x
Invalid
Medium
Unassigned
7.0.x
Won't Fix
Medium
MOS Keystone
8.0.x
Won't Fix
Medium
MOS Keystone
9.x
Won't Fix
Medium
Unassigned

Bug Description

Steps:

1. Deploy ha env with 3 controllers.
2. Destroy two controllers
3. On controller: check pcs status
( Master/Slave Set: master_p_rabbitmq-server [p_rabbitmq-server]
     Stopped: [ node-1.test.domain.local ])
3. Try get new token

Actual result:
Authorization Failed: Unable to establish connection to http://192.168.0.2:5000/v2.0/tokens

http://paste.openstack.org/show/485523/

Revision history for this message
Oleksiy Butenko (obutenko) wrote :

[root@nailgun ~]# cat /etc/fuel/8.0/version.yaml
VERSION:
  feature_groups:
    - mirantis
  production: "docker"
  release: "8.0"
  api: "1.0"
  build_number: "478"
  build_id: "478"
  fuel-nailgun_sha: "ae949905142507f2cb446071783731468f34a572"
  python-fuelclient_sha: "4f234669cfe88a9406f4e438b1e1f74f1ef484a5"
  fuel-agent_sha: "481ed135de2cb5060cac3795428625befdd1d814"
  fuel-nailgun-agent_sha: "b2bb466fd5bd92da614cdbd819d6999c510ebfb1"
  astute_sha: "b81577a5b7857c4be8748492bae1dec2fa89b446"
  fuel-library_sha: "420c6fa5f8cb51f3322d95113f783967bde9836e"
  fuel-ostf_sha: "ab5fd151fc6c1aa0b35bc2023631b1f4836ecd61"
  fuel-mirror_sha: "b62f3cce5321fd570c6589bc2684eab994c3f3f2"
  fuelmenu_sha: "fac143f4dfa75785758e72afbdc029693e94ff2b"
  shotgun_sha: "63645dea384a37dde5c01d4f8905566978e5d906"
  network-checker_sha: "9f0ba4577915ce1e77f5dc9c639a5ef66ca45896"
  fuel-upgrade_sha: "616a7490ec7199f69759e97e42f9b97dfc87e85b"
  fuelmain_sha: "6c6b088a3d52dd0eaf43d59f3a3a149c93a07e7e

Revision history for this message
Oleksiy Butenko (obutenko) wrote :

Reproduced on env with one controller:
root@node-1:~# pcs resource disable p_rabbitmq-server
root@node-1:~# rabbitmqctl stop
root@node-1:~# keystone --debug token-get

On ISO 8.0 478, ISO 7.0 301

summary: - keystone down when rabbitmq in "Stopped" status
+ keystone is down when rabbitmq in "Stopped" status
Changed in mos:
assignee: nobody → MOS Keystone (mos-keystone)
Revision history for this message
Boris Bobrov (bbobrov) wrote :

How is this a critical bug?

Revision history for this message
Oleksiy Butenko (obutenko) wrote :
Changed in mos:
status: New → Incomplete
assignee: MOS Keystone (mos-keystone) → Oleksiy Butenko (obutenko)
Revision history for this message
Roman Podoliaka (rpodolyaka) wrote :

Oleksiy, I'm not saying we shouldn't find the root cause of this, but with 3 controllers running in HA mode we can only lose 1 - if 2 controllers are lost, it's like a split-brain, i.e. our cluster will not be available, and that's *expected*.

At the same time, this does not explain why Keystone goes down ( it is not managed by Pacemaker).

I suggest we downgrade this to Medium and do the RCA, as this use case is not supported and High/Critical seem to be an overkill to me.

Revision history for this message
Timur Nurlygayanov (tnurlygayanov) wrote :

Hi Roman, I agree with you about Medium priority of this issue because it is not a blocker for customers, if RabbitMQ dies all OpenStack services will crash.

But in the same time it looks like we should check the code of Keystone and identify why it crashed with tracebacks if it can't connect to RabbitMQ (other services don't crash in this case)

Changed in mos:
importance: Critical → Medium
status: Incomplete → Confirmed
assignee: Oleksiy Butenko (obutenko) → MOS Keystone (mos-keystone)
Revision history for this message
Roman Podoliaka (rpodolyaka) wrote :

We no longer fix Medium bugs in 8.0, closing as Won't Fix

Changed in mos:
status: Confirmed → Won't Fix
Revision history for this message
Alexander Makarov (amakarov) wrote :

If keystone notifications use RabbitMQ as a transport then keystone will hang on sending the message - that's the expected behavior.

Revision history for this message
Alexander Makarov (amakarov) wrote :

What is the expected result?

Revision history for this message
Dina Belova (dbelova) wrote :

Added move-to-10.0 tag due to the fact bug was transferred from 9.0 to 10.0

tags: added: area-keystone move-to-10.0
removed: keystone
Roman Rufanov (rrufanov)
tags: added: customer-found support
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.