pcp installation is unreliable
Bug #1943184 reported by
Clark Boylan
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
devstack |
Won't Fix
|
Medium
|
Unassigned |
Bug Description
pcp (performance co-pilot) is being used to replace dstat which is no longer maintained. Unfortunately, the pcp package installations on Ubuntu are unreliable and often fail due to a timeout starting the pmlogger.service. It seems there have been bugs for this in Fedora [0] and while that bug notes a fix was upstreamed that fix does not seem to be sufficient.
You can see the occurence of these failures by querying logstash for:
message:
Based on this logstash data it seems to hit us several times a day. I noticed it because it has reset the gate queue for openstack projects twice in as many days.
Changed in devstack: | |
status: | In Progress → Triaged |
importance: | Undecided → Medium |
To post a comment you must log in.
Digging into this I've discovered that collectl is another similar tool which also has openstack support which might be useful for devstack. I'll propose a change that starts to try and use collectl as a stand in for dstat (though note they are not directly compatible so any switch should properly move things around).