tripleo

virtualbmc processes silently die over time

Bug #1734831 reported by Steven Hardy on 2017-11-28

This bug affects 2 people

Affects		Status	Importance	Assigned to	Milestone
	tripleo	Expired	Medium	Unassigned

Bug Description

I've been seeing this for a while now, so adding a bug where we can hopefully capture more evidence to figure out the root cause.

My quickstart deployed VM development environment seems to gradually lose vbmc processes over time, so every couple of days I need to restart the service, e.g:

(undercloud) [stack@undercloud ~]$ ps ax | grep bmc
2321 ? Sl 4:33 /usr/bin/python2 /usr/bin/vbmc start compute_0
2522 ? Sl 4:32 /usr/bin/python2 /usr/bin/vbmc start compute_2
18168 pts/0 R+ 0:00 grep --color=auto bmc
(undercloud) [stack@undercloud ~]$ sudo systemctl restart virtualbmc
(undercloud) [stack@undercloud ~]$ ps ax | grep bmc
18223 ? Sl 0:00 /usr/bin/python2 /usr/bin/vbmc start compute_0
18233 ? Sl 0:00 /usr/bin/python2 /usr/bin/vbmc start compute_1
18241 ? Sl 0:00 /usr/bin/python2 /usr/bin/vbmc start compute_2
18250 ? Sl 0:00 /usr/bin/python2 /usr/bin/vbmc start control_0
18259 ? Sl 0:00 /usr/bin/python2 /usr/bin/vbmc start control_1
18279 ? Sl 0:00 /usr/bin/python2 /usr/bin/vbmc start control_2
18282 pts/0 S+ 0:00 grep --color=auto bmc

The system has plenty of memory and swap, so I don't think it's due to the OOM killer, and there's nothing in the logs AFAICS:

(undercloud) [stack@undercloud ~]$ free -m
total used free shared buff/cache available
Mem: 12015 7661 2241 0 2112 3933
Swap: 4095 1187 2908

(undercloud) [stack@undercloud ~]$ sudo cat /var/log/virtualbmc/virtualbmc.log | grep -i error
(undercloud) [stack@undercloud ~]$

Steven Hardy (shardy) on 2017-11-28

Changed in tripleo:
status:	New → Triaged
importance:	Undecided → Medium
milestone:	none → queens-3

Emilien Macchi (emilienm) on 2018-01-26

Changed in tripleo:
milestone:	queens-3 → queens-rc1

Alex Schultz (alex-schultz) on 2018-02-20

Changed in tripleo:
milestone:	queens-rc1 → rocky-1

Alex Schultz (alex-schultz) on 2018-04-20

Changed in tripleo:
milestone:	rocky-1 → rocky-2

Emilien Macchi (emilienm) on 2018-06-05

Changed in tripleo:
milestone:	rocky-2 → rocky-3

Emilien Macchi (emilienm) on 2018-07-26

Changed in tripleo:
milestone:	rocky-3 → rocky-rc1

Emilien Macchi (emilienm) on 2018-07-26

Changed in tripleo:
milestone:	rocky-rc1 → stein-1

Juan Antonio Osorio Robles (juan-osorio-robles) on 2018-10-30

Changed in tripleo:
milestone:	stein-1 → stein-2

Revision history for this message

Steven Hardy (shardy) wrote on 2018-10-30:

I haven't seen this in a while so will close and reopen if it happens again

Changed in tripleo:
status:	Triaged → Incomplete
milestone:	stein-2 → none

Revision history for this message

Launchpad Janitor (janitor) wrote on 2018-12-30:

[Expired for tripleo because there has been no activity for 60 days.]

Changed in tripleo:
status:	Incomplete → Expired

Report a bug

This report contains Public information

Everyone can see this information.

You are

Subscribing...

Edit bug mail

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.