using odt2txt to diff OOo documents

Bug #655307 reported by Alexander Belchenko
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
QBzr
Confirmed
Wishlist
Unassigned

Bug Description

odt2txt: http://stosberg.net/odt2txt/

From that site:

odt2txt

A simple converter from OpenDocument Text to plain text

odt2txt is a command-line tool which extracts the text out of OpenDocument Texts produced by OpenOffice.org, StarOffice, KOffice and others.

odt2txt can also extract text from some file formats similar to OpenDocument Text, such as OpenOffice.org XML (*.sxw), which was used by OpenOffice.org version 1.x and older StarOffice versions. To a lesser extend, odt2txt may be useful to extract content from OpenDocument spreadsheets (*.ods) and OpenDocument presentations (*.odp).
odt2txt is ...
small
supports multiple output encodings
adopts to your locale
able to substitute common characters which the output charset does not contain with ascii look-a-likes
written in C, has few dependencies
portable (runs on Linux, *BSD, Mac OS X, Windows, Cygwin, Solaris, HP-UX)
licensed under GPL, version 2

It has binary executable for windows.

To use it in diff we need:

* Add option to qconfig to enable/disable diff for OOo documents (odt, sxw, ods, odp -- maybe with checkboxes)
* Add option to qconfig to allow user to specify full path to odt2txt executable
* In qdiff main code check that options and if OOo file found convert it to temp text file; then use lines of that temp file for standard diff.

Tags: fagus
description: updated
Revision history for this message
John C Barstow (jbowtie) wrote :

See also oodiff[1], which comes with instructions for integrating it in with various version control systems (unfortunately not bazaar)

For qdiff we also might want to consider [this script][2] which uses Python scripting to invoke OpenOffice's built-in "compare documents" functionality.

[1]: http://www-verimag.imag.fr/~moy/opendocument/
[2]: http://bitbucket.org/cmcqueen1975/oodiff/src/tip/oodiff.py

Revision history for this message
Alexander Belchenko (bialix) wrote : Re: [Bug 655307] Re: using odt2txt to diff OOo documents

John C Barstow пишет:
> See also oodiff[1], which comes with instructions for integrating it in
> with various version control systems (unfortunately not bazaar)

I saw oodiff. Actually I've started from that page.

BUT oodiff is just bash script which is completely unusable for me as
plain windows user.

> For qdiff we also might want to consider [this script][2] which uses
> Python scripting to invoke OpenOffice's built-in "compare documents"
> functionality.
>
> [1]: http://www-verimag.imag.fr/~moy/opendocument/
> [2]: http://bitbucket.org/cmcqueen1975/oodiff/src/tip/oodiff.py
>

--
All the dude wanted was his rug back

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.