contrail-api eating 100% CPU and not responding to requests

Bug #1416982 reported by Martin Gerhard Loschwitz
14
This bug affects 3 people
Affects Status Importance Assigned to Milestone
OpenContrail
Fix Committed
Undecided
Unassigned

Bug Description

The problem I have described in http://lists.opencontrail.org/pipermail/dev_lists.opencontrail.org/2015-January/001882.html happens several times a day. This renders opencontrail unusable, as neutron requests time out regularly. What would be a good way to debug this?

TLDR: Contrail will eat 100% cpu time after a while and not respond in time until it gets restarted, which resets the cycle.

Tags: config
tags: added: config
Revision history for this message
Prakash Bailkeri (prakashmb) wrote :

Looked at strace of contrail-api from Martin. Saw GET on virtual-machine-interfaces quite often

Suggested two changes:
1. patch fix from https://review.opencontrail.org/#/c/6786/. Should optimize the list operation
2. heal_instance_info_cache_interval on nova compute to 0(or to higher value than default).

Revision history for this message
Prakash Bailkeri (prakashmb) wrote :
Changed in opencontrail:
status: New → Fix Committed
Roman Rufanov (rrufanov)
tags: added: customer-found support
Revision history for this message
Andrew Woodward (xarses) wrote :

What version is this fix in?

tags: removed: customer-found support
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.