remove duplicate images

Bug #1511902 reported by sophia233
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Variety
New
Wishlist
Unassigned

Bug Description

Provide a way to dedupe image across different sources.
E.g. if an image has been downloaded from http://www.wallpapers.net
then when it will cancel fetching the same image from wallhaven.cc

One solution is to store image md5. And if new download image same as pre-existed, delete it. it could be an optional option if use wants to enable it.

sophia233 (steveca2)
tags: added: feature-request
sophia233 (steveca2)
description: updated
Revision history for this message
Iain (igray78756) wrote :

It would also be nice to permanently store these md5 signatures so that if you delete a photo, Variety can use the md5 to cancel fetching those as well.

I find myself deleting photos and then they re-download about a week later.

Revision history for this message
Peter Levi (peterlevi) wrote :

It is not as easy as it sounds. "Same" images are almost never exactly the same, so strict hashes like md5 are always different. Even if the image itself is exactly 100% the same (very very rare in practice), often there are differences in the metadata, again resulting in different file md5. Even Variety itself adds to the image metadata to know where it was downloaded from.

Special image hashes (phash, dhash, etc.) are used for finding images that are "similar", but these come with their own set of problems, e.g. ignoring colors, ignoring differences in "minimal" images with a flat background and just some minor icon in the center, etc. Calibrating these to work properly is a major pain.

Revision history for this message
Advisor (master-of-shadows) wrote :

Hi!

I create tag "duplicate" for this image: https://vrty.org/tag/duplicate

It allow remove/merge image for "hand".

James Lu (jlu5)
Changed in variety:
importance: Undecided → Wishlist
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.