Comment 0 for bug 655307

Revision history for this message
Alexander Belchenko (bialix) wrote :

odt2txt: http://stosberg.net/odt2txt/

From that site:

odt2txt

A simple converter from OpenDocument Text to plain text

odt2txt is a command-line tool which extracts the text out of OpenDocument Texts produced by OpenOffice.org, StarOffice, KOffice and others.

odt2txt can also extract text from some file formats similar to OpenDocument Text, such as OpenOffice.org XML (*.sxw), which was used by OpenOffice.org version 1.x and older StarOffice versions. To a lesser extend, odt2txt may be useful to extract content from OpenDocument spreadsheets (*.ods) and OpenDocument presentations (*.odp).
odt2txt is ...
small
supports multiple output encodings
adopts to your locale
able to substitute common characters which the output charset does not contain with ascii look-a-likes
written in C, has few dependencies
portable (runs on Linux, *BSD, Mac OS X, Windows, Cygwin, Solaris, HP-UX)
licensed under GPL, version 2

It has binary executable for windows.

To use it in diff we need:

* Add option to qconfig to enable/disable diff for OOo documents (odt, sxw, ods, odp -- maybe with checkboxes)
* Add option to qconfig to allow user to specify full path to odt2txt executable
* In qdiff main code check that options and if OOo file found convert it to temp text file; then diff lines of that file for standard diff.