controller node hangs with zombie process
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Mirantis OpenStack |
Won't Fix
|
High
|
Pavel Boldin |
Bug Description
[root@fuel ~]# fuel --fuel-version
api: '1.0'
astute_sha: f7cda2171b0b677
auth_required: true
build_id: 2015-02-07_20-50-01
build_number: '76'
feature_groups:
- mirantis
fuellib_sha: 64f3ebe9fcbd18b
fuelmain_sha: c799e3a6d88289e
nailgun_sha: 2ef819732a3ee7a
ostf_sha: 3b57985d4d21555
production: docker
release: 6.0.1
release_versions:
2014.2-6.0.1:
VERSION:
api: '1.0'
astute_sha: f7cda2171b0b677
build_id: 2015-02-07_20-50-01
build_number: '76'
feature_
- mirantis
fuellib_sha: 64f3ebe9fcbd18b
fuelmain_sha: c799e3a6d88289e
nailgun_sha: 2ef819732a3ee7a
ostf_sha: 3b57985d4d21555
production: docker
release: 6.0.1
Baremetal,Ubuntu, HA, Neutron-
Controllers:3 Computes:96
deployment was successfull, but during rally tests controller node was hunged.
I couldn't perform any bash command:
ls -la
-bash: fork: Cannot allocate memory
I couldn't open nw ssh connection. I couldn't reach bash of the node even ipmi.
Beheviour very similar with https:/
from atop dump I can see that at 2015/02/21 10:44:01 we had zombie process, but I can't detect the process.
Also tslpi grows from Feb 21 10:43
Keystone and most of logs stopped at 10:43.
atop is here https:/
I'll upload DG asap
no longer affects: | fuel |
Changed in mos: | |
assignee: | nobody → Pavel Boldin (pboldin) |
DG: https:/ /drive. google. com/a/mirantis. com/file/ d/0Bx4ptZV1Jt7h UGlSNHBPYWhxZzg /view?usp= sharing