Comment 12 for bug 1081104

Revision history for this message
Chris Rossi (chris-archimedeanco) wrote :

Theune says what he showed you was LibreOffice, using the CLI. I'll play with it and see how it does. Seems to be a very heavy weight tool, a big process needs to load before it can do anything. So we might be looking at long extract times that would require us to use a queue and a separate thread or process to do the text extraction offline, so we don't slow down user HTTP requests. I