diffuzzy
czkawka
diffuzzy | czkawka | |
---|---|---|
6 | 361 | |
6 | 17,762 | |
- | - | |
0.0 | 7.7 | |
over 2 years ago | 20 days ago | |
Shell | Rust | |
- | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
diffuzzy
- File sizes are exact down to the byte, Teracopy verification still necessary?
-
The Icculus Microgrant is giving out 250 dollar grants to open source projects, please brag about your project(s) in this thread so I can see them!
diffuzzy - https://github.com/nathanshearer/diffuzzy
-
What automated tasks you created in your workplace that improved your productivity?
diffuzzy - compare multiple paths in sublinear time to verify data integrity. Sublinear is very fast and you can compare massive multi-petabyte paths quickly without reading or trasnferring an entire dataset.
- Which tool do you use to find duplicate files?
-
We're pushing 22PB over the internet at work.
I also wrote diffuzzy, which I needed to quickly identify failures in the tools that sometimes don't work they way we expect. It can run comparisons in sublinear time without having to scan the entire dataset or transmit the whole dataset. It's great for catching edge cases and performing incremental transfers.
-
I'm giving out microgrants to open source projects for the third year in a row! Brag about your projects here so I can see them, big or small!
diffuzzy - Compare files or paths with an adjustable level of accuracy and speed
czkawka
- Is there software to compress large but similar files?
- Merge three separate partial libraries from external USB drives
-
Tools to deduplicate files
https://github.com/qarmin/czkawka by far the best of anything iv tried
-
fdupes: Identify or Delete Duplicate Files
I've used Czkawka (https://github.com/qarmin/czkawka) because it does Lanczos-based image duplicate detection, which makes it more practical for me.
-
AllDup suddenly taking forever to process/delete selections
Maybe it's a setting you made or the files, not sure. You can try another software czkawka to see if you get better results with it.
-
Is there a file duplicate finder that works with animated jpegxl-gif?
For static images i used https://github.com/qarmin/czkawka and it works well enough. I think. But when i used it on a folder with gifs and their jxl conversions, it shows nothing. SURELY this could not be user error, rrrright?
-
PhotoPrism: Browse Your Life in Pictures
I used to use DupeGuru which has some photo-specific dupe detection where you can fuzzy match image dupes based on content: https://dupeguru.voltaicideas.net/
But I switched over to czkawka, which has a better interface for comparing files, and seems to be a bit faster: https://github.com/qarmin/czkawka
Unfortunately, neither of these are integrated into Photoprism, so you still have to do some file management outside the database before importing.
I also haven't used Photoprism extensively yet (I think it's running on one of my boxes, but I haven't gotten around to setting it up), but I did find that it wasn't really built for file-based libraries. It's a little more heavyweight, but my research shows that Nextcloud Memories might be a better choice for me (it's not the first-party Nextcloud photos app, but another one put together by the community): https://apps.nextcloud.com/apps/memories
-
Please don't post like 20 similar images to the art sites?
Czkawka can do this.
-
I'm amazed how I find anything & why I have so many dupes!
There's always the well-respected tool, Czkawka. Or, of the CLI is your thing, jdupes is a good option.
- I saw a post regarding crate to delete similar files
What are some alternatives?
NimForUE - Nim plugin for UE5 with native performance, hot reloading and full interop that sits between C++ and Blueprints. This allows you to do common UE workflows like for example to extend any UE class in Nim and extending it again in Blueprint if you wish so without restarting the editor. The final aim is to be able to do in Nim what you can do in C++
dupeguru - Find duplicate files
mvregex
jdupes - A powerful duplicate file finder and an enhanced fork of 'fdupes'.
PHP-CRUD-API - Single file PHP script that adds a REST API to a SQL database
fdupes - FDUPES is a program for identifying or deleting duplicate files residing within specified directories.
tubesync - Syncs YouTube channels and playlists to a locally hosted media server
AntiDupl - A program to search similar and defect pictures on the disk
php-mercure - Mercure server implemented in plain PHP
PhotoPrism - AI-Powered Photos App for the Decentralized Web 🌈💎✨
datafaker - Generating fake data for the JVM (Java, Kotlin, Groovy) has never been easier!
darktable - darktable is an open source photography workflow application and raw developer