pacemaker restarts rabbitmq due 'rabbitmqctl list_channels' timed out.
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Fix Released
|
High
|
Alexey Lebedeff | ||
7.0.x |
Fix Released
|
High
|
Rodion Tikunov | ||
8.0.x |
Fix Released
|
High
|
Alexey Lebedeff | ||
Future |
Invalid
|
Undecided
|
Alexey Lebedeff |
Bug Description
During boot_and_
from rally.log: http://
from mysql by instance uuid: http://
from nova-compute: http://
from haproxy by request to neutron: http://
from neutron-all on node-197: http://
Rabbitmq was stopped on node-77 and node-198. On node-198 - firstly: http://
from pacemaker.log on node-198: http://
Cluster configuration:
Baremetal,
Controllers:3 Computes:178 Copmutes+Ceph:20
api: '1.0'
astute_sha: 6c5b73f93e24cc7
auth_required: true
build_id: '301'
build_number: '301'
feature_groups:
- mirantis
fuel-agent_sha: 50e90af6e3d560e
fuel-library_sha: 5d50055aeca1dd0
fuel-nailgun-
fuel-ostf_sha: 2cd967dccd66cfc
fuelmain_sha: a65d453215edb02
nailgun_sha: 4162b0c15adb425
openstack_version: 2015.1.0-7.0
production: docker
python-
release: '7.0'
Diagnostic Snapshot: http://
Changed in fuel: | |
assignee: | nobody → MOS Nova (mos-nova) |
status: | New → Confirmed |
importance: | Undecided → High |
milestone: | none → 7.0-updates |
tags: | added: area-mos |
no longer affects: | fuel/8.0.x |
description: | updated |
Changed in fuel: | |
assignee: | MOS Oslo (mos-oslo) → Alexey Lebedeff (alebedev-a) |
Changed in fuel: | |
milestone: | 8.0 → 9.0 |
status: | Confirmed → New |
Changed in fuel: | |
status: | New → Confirmed |
Changed in fuel: | |
status: | In Progress → Fix Committed |
Changed in fuel: | |
status: | Fix Committed → Fix Released |
My and Leontiy observations: once boot_and_ delete_ server_ with_secgroups starts, RabbitMQ CPU usage raises from 300% to 1800% (visible in atop logs). Also, that is the time when 'rabbitmqctl list_channels' starts to time out.
Current plan: we don't understand what exactly causes the issue. It is either big message count or big messages passing through the RabbitMQ. We are going to implement a logging for messages sizes and reproduce the issue once more.