Analyze OSF DB to estimate win by not caching extracted text
Bug #1338271 reported by
Paul Everitt
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
KARL3 |
Fix Released
|
Medium
|
Chris Rossi |
Bug Description
At PyCon, Christian was trying to analyze memory spikes and usage for the ZODB cache. We were/are having trouble getting a stable ZODB cache size on object counts. A number that is steady for weeks suddenly spikes.
Christian noted that we had some objects in cache that were way too big. Investigation showed that they had the old "extracted text" hack we did, where we keep a copy of the extracted content from HTML/Office/PDF etc. to speed up reindexing on evolves etc.
For this task, write a console script that does this same analysis, on karlstaging, and gives us an idea of the scale of the problem. Deliberately vague statement of the work, as you need to use some judgement.
tags: | added: r3.127 |
Changed in karl3: | |
status: | New → In Progress |
Changed in karl3: | |
status: | Fix Committed → Fix Released |
To post a comment you must log in.
Let's see if we can get a decent console script together to collect some facts. I made Christian and Tres nosy on this, they can dump any historical points I left out.