Comment 3 for bug 666392

Revision history for this message
Fabian Keil (fk-ux) wrote :

For discs like this I slapped together a perl script that takes the output of isoinfo -i and classifies the files as either "master files" or duplicates.

Based on that classification it can deduplicate an existing copy by replacing the duplicates with links to the "master files" (like Gregory already suggested above), or generate a shell script that would do the same but can be adjusted before executing it.

The deduplication logic currently only works for discs where the duplicates are obvious, and I suspect that it may be possible to master DVDs in a way that they aren't, but so far I haven't seen this.

My goal is to eventually be able to only rip the master files. A dvdbackup option to only rip files specified by name (or start sector) would make this possible and I intend to work on this (or another solution) the next time I want to rip such a disc.

The duplicate detection could be integrated into dvdbackup itself, but I haven't looked into how much work that would be.

The skipping and freezing issues Bryce mentioned are probably symptoms of dvdbackup padding blocks that shouldn't be padded and not directly connected to the duplicated files. Using read error mode "b" could help but might take a while.