czkawka
fclones
czkawka | fclones | |
---|---|---|
364 | 17 | |
20,515 | 1,994 | |
- | - | |
7.4 | 6.0 | |
about 2 months ago | 6 months ago | |
Rust | Rust | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
czkawka
-
Ask HN: How do you deduplicate files?
You want content-addressed storage; this works with rolling content hashes that identify common blocks of memory. `rsync` uses that technique to minimize bytes to be transferred. https://github.com/qarmin/czkawka is a GUI app and CLI tool to find identical files in general and similar images in particular.
The task is much simpler if you only want to find bit-identical entire files, not part of files; in that case, you can just run a tool like `sha1sum` over each file and record the hash digest in a database; identical files—and only identical ones, with high probability—will have the same hash, non-identical ones will have different hashes.
- Czkawka: Multi functional app to find duplicates, empty folders, similar images
-
Duperemove – Tools for deduping file systems
You might be interested in this app: https://github.com/qarmin/czkawka
- Is there software to compress large but similar files?
- Merge three separate partial libraries from external USB drives
-
Tools to deduplicate files
https://github.com/qarmin/czkawka by far the best of anything iv tried
-
fdupes: Identify or Delete Duplicate Files
I've used Czkawka (https://github.com/qarmin/czkawka) because it does Lanczos-based image duplicate detection, which makes it more practical for me.
-
AllDup suddenly taking forever to process/delete selections
Maybe it's a setting you made or the files, not sure. You can try another software czkawka to see if you get better results with it.
-
Is there a file duplicate finder that works with animated jpegxl-gif?
For static images i used https://github.com/qarmin/czkawka and it works well enough. I think. But when i used it on a folder with gifs and their jxl conversions, it shows nothing. SURELY this could not be user error, rrrright?
-
PhotoPrism: Browse Your Life in Pictures
I used to use DupeGuru which has some photo-specific dupe detection where you can fuzzy match image dupes based on content: https://dupeguru.voltaicideas.net/
But I switched over to czkawka, which has a better interface for comparing files, and seems to be a bit faster: https://github.com/qarmin/czkawka
Unfortunately, neither of these are integrated into Photoprism, so you still have to do some file management outside the database before importing.
I also haven't used Photoprism extensively yet (I think it's running on one of my boxes, but I haven't gotten around to setting it up), but I did find that it wasn't really built for file-based libraries. It's a little more heavyweight, but my research shows that Nextcloud Memories might be a better choice for me (it's not the first-party Nextcloud photos app, but another one put together by the community): https://apps.nextcloud.com/apps/memories
fclones
- Fclones: Parallel duplicate files finder in Rust with reflink support
- Fclones: Efficient Duplicate File Finder
- fdupes: Identify or Delete Duplicate Files
-
Deduplication at file level
Install fclones (fdupes modern equivalent)
- A duplicate file finder with GUI written using gtk4-rs and relm4
-
making duplicate files hard links, probably rdfind
I said "fclone" I meant "fclones" (https://github.com/pkolaczk/fclones) -- I missed the "s" at the end
-
Use `dano` to find duplicate media files
While we're recommending stuff, fclones: https://github.com/pkolaczk/fclones
-
Oxidise Your Life
fclones: Efficient Duplicate File Finder.
-
Pip and cargo are not the same
Yes, that would be safer when available (although generally files within library dependencies are not modified I think?). It looks like fclones implements this, is faster and is written in Rust https://github.com/pkolaczk/fclones (the last is the most important point of course /s).
- Caf a rust program which filters out duplicate files
What are some alternatives?
dupeguru - Find duplicate files
jdupes - A powerful duplicate file finder and an enhanced fork of 'fdupes'.
rdfind - find duplicate files utility
fdupes - FDUPES is a program for identifying or deleting duplicate files residing within specified directories.
rmlint - Extremely fast tool to remove duplicates and other lint from your filesystem
AntiDupl - A program to search similar and defect pictures on the disk
CompactGUI - Transparently compress active games and programs using Windows 10/11 APIs
PhotoPrism - AI-Powered Photos App for the Decentralized Web 🌈💎✨
CompactGUI - Transparently compress active games and programs using Windows 10/11 APIs [Moved to: https://github.com/IridiumIO/CompactGUI]
darktable - darktable is an open source photography workflow application and raw developer
dupd - CLI utility to find duplicate files