Script to convert pgtextindex to zope.textindex

Bug #771956 reported by Chris Rossi
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
KARL3
Fix Released
High
Chris Rossi

Bug Description

We have a script that converts from the zope text index implementation in repoze.catalog to the pg based implementation in repoze.pgtextindex. We also need a script that can convert the other way in case we need to bail out.

Changed in karl3:
assignee: nobody → Chris Rossi (chris-archimedeanco)
milestone: none → m54
importance: Undecided → Medium
Revision history for this message
Paul Everitt (paul-agendaless) wrote :

Let's make M54 more about its original purpose. If you have time in M54 to work on M55 stuff, so be it.

Changed in karl3:
milestone: m54 → m55
Changed in karl3:
milestone: m55 → m57
milestone: m57 → m56
Changed in karl3:
milestone: m56 → m58
summary: - Need script to convert from repoze.pgtextindex back to
+ Script to convert pgtextindex to zope.textindex
Changed in karl3:
milestone: m58 → m59
tags: added: improved-search-engine
Revision history for this message
Chris Rossi (chris-archimedeanco) wrote :

My plan for this is take the 'use_pgtextindex' script and turn it into a 'reindex_text' script. This script would, by default, rebuild a text index of the same type already in use or if specified by a flag (--pg or --zope) will build a new text index of the type specified, allowing the user to switch arbitrarily between text index types.

The difficult part, such as it is, has already been written. The script already builds a new text index in parallel with the existing one so as not to interrupt normal site operations. It also indexes documents in batches using several transactions to prevent overconsumption of memory on large sites like OSF. An upshot of this approach is the script can be interrupted and resumed.

So the only new work is just renaming the script and adding the options to create either a zope text index or a pg text index when doing the reindex. Should be a very quick job.

Revision history for this message
Paul Everitt (paul-agendaless) wrote :

Shane, I'll hand this one to you for this week. But this needs to get done. If you don't plan to do KARL work this week, please let me know.

Changed in karl3:
assignee: Chris Rossi (chris-archimedeanco) → Shane Hathaway (shane-hathawaymix)
Changed in karl3:
milestone: m59 → m60
Revision history for this message
Shane Hathaway (shane-hathawaymix) wrote :

I implemented this just the way Chris suggested, but I don't have commit privileges to the karlserve project so I made a fork and a pull request:

https://github.com/karlproject/karlserve/pull/2

Changed in karl3:
status: New → Fix Committed
Revision history for this message
Paul Everitt (paul-agendaless) wrote :

Assigning to Chris to do the merge. I'm making this a high priority as it is the one thing that must happen before we can go live.

Changed in karl3:
assignee: Shane Hathaway (shane-hathawaymix) → Chris Rossi (chris-archimedeanco)
importance: Medium → High
status: Fix Committed → In Progress
Changed in karl3:
status: In Progress → Fix Committed
tags: added: r3.65
Changed in karl3:
status: Fix Committed → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.