dedup | dupeguru | |
---|---|---|
1 | 154 | |
11 | 4,827 | |
- | - | |
0.0 | 6.7 | |
11 days ago | 2 months ago | |
Python | Python | |
MIT License | GNU General Public License v3.0 only |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
dedup
-
[D] Hashing techniques to compare large datasets?
There is actually a whole family of hashing functions called locality sensitive hashing functions (LSH) that have the property that the likelihood of a hash collision is proportional to the similarity of the hashed data values. I’ve used Simhash myself for textual similarity, but LSHs can be used for finding similar images, audio, or other data types.
dupeguru
-
How to use onedrive for culling photos
Dupeguru
- Does anyone know any freeware duplicate file checkers without an upsell similar to awesome duplicate photo finder?
-
DupeGuru: Open-source, cross-platform GUI tool to find duplicate files
Posted link appears be misspelled
This appears to be actual link:
https://dupeguru.voltaicideas.net/
-
PhotoPrism: Browse Your Life in Pictures
I used to use DupeGuru which has some photo-specific dupe detection where you can fuzzy match image dupes based on content: https://dupeguru.voltaicideas.net/
But I switched over to czkawka, which has a better interface for comparing files, and seems to be a bit faster: https://github.com/qarmin/czkawka
Unfortunately, neither of these are integrated into Photoprism, so you still have to do some file management outside the database before importing.
I also haven't used Photoprism extensively yet (I think it's running on one of my boxes, but I haven't gotten around to setting it up), but I did find that it wasn't really built for file-based libraries. It's a little more heavyweight, but my research shows that Nextcloud Memories might be a better choice for me (it's not the first-party Nextcloud photos app, but another one put together by the community): https://apps.nextcloud.com/apps/memories
-
App recommendation for finding duplicate images
If your photos are exact duplicates you can also use the freeware dupeGuru app.
-
I'm amazed how I find anything & why I have so many dupes!
If you want a some other GUI options, AllDup or DupeGuru might be of interest. AllDup has a rather weird interface, though, imo.
-
Johnny Decimal
I used DupeGuru (https://dupeguru.voltaicideas.net/) in the past but I'm not sure it's the best solution for you. Try it, it's open-source.
- DupeGuru: Open-Source, cross-platform GUI software to find duplicate files
-
Diff tool for medias
dupeGuru
-
App to find duplicates across multiple external drives (Mac/Unix)
dupeGuru is a graphical application available for macOS that can scan multiple drives and identify duplicate files based on various criteria like name, size, and content https://dupeguru.voltaicideas.net/
What are some alternatives?
datasketch - MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
czkawka - Multi functional app to find duplicates, empty folders, similar images etc.
LSH - Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
jdupes - A powerful duplicate file finder and an enhanced fork of 'fdupes'.
AntiDupl - A program to search similar and defect pictures on the disk
coronavirus-dashboard - Dashboard for tracking Coronavirus (COVID-19) across the UK
video-simili-duplicate-cleaner
snapraid - A backup program for disk arrays. It stores parity information of your data and it recovers from up to six disk failures
rmlint - Extremely fast tool to remove duplicates and other lint from your filesystem
dduper - Fast block-level out-of-band BTRFS deduplication tool.
rsync - An open source utility that provides fast incremental file transfer. It also has useful features for backup and restore operations among many other use cases.
findimagedupes