dedupe
hastyhex
dedupe | hastyhex | |
---|---|---|
3 | 3 | |
3 | 85 | |
- | - | |
0.0 | 0.0 | |
over 6 years ago | about 2 years ago | |
Python | C | |
BSD 2-clause "Simplified" License | The Unlicense |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
dedupe
-
fdupes alternatives?
I wrote https://github.com/Gumnos/dedupe which sounds like it might be useful to you. It's faster than several of the alternatives I've found (many run the checksum across the whole of every file, this uses the file-size as a first-line discriminator, and only if the files are the same size does it go to the trouble of checking the checksum of the files). I designed it for creating hard-links in my media collection, but in the --dry-run mode, it should emit the file-names allowing you to pass it to xargs to remove them if it looks copacetic.
-
File Management via CLI
You can use my dedupe.py script with the dry-run flag (-n) to find all the duplicates on your drive. If you run it without the dry-run flag, it will attempt to make hard-links so that each file exists only once on the drive with multiple hard-links to the underlying file. It should be pretty fast, only needing to checksum file-content in the event that files have the same size (several other such deduplication methods work by checksumming every file on the drive which can be slow).
-
What tools / utilities have you written that you use regularly?
a file-deduplication utility that hard-links duplicate files to save space (our family photo gallery gets pics put in multiple albums for various audiences, so I can cut down on a lot of duplication with this)
hastyhex
-
What tools / utilities have you written that you use regularly?
hastyhex: a fast, color hex dump.
-
In 1982, a one-bit hack let me dodge a summer of filling in potholes
Tried his "hastyhex" binary to hex filter which claims to be faster than alternatives. In tests I ran, it was much faster than hexdump, and even faster than xxd, but was not faster than lesser known public domain code, which was about 2x faster.
https://github.com/skeeto/hastyhex
-
A colorized alternative to hexdump
Here's my own color hexdump project: hastyhex. It's oriented around speed, so it's about 25x faster but has fewer features.
What are some alternatives?
ripgrep-all - rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc.
smenu - smenu started as a lightweight and flexible terminal menu generator, but quickly evolved into a powerful and versatile CLI selection tool for interactive or scripting use.
file-arranger - Simple & capable Directory arranger/cleaner
gitstart - Gitstart automates creating a GitHub repo. The script will create .gitignore, a license.txt, a README.md file and commit with a message. It will create a remote repo and push all the files.
xonsh - :shell: Python-powered, cross-platform, Unix-gazing shell.
dotfiles - My personal dotfiles
tawk - Like awk, but using tcl as the scripting language.
ledger - Double-entry accounting system with a command-line reporting interface
mpd_what - An mpd album art and info getter
nbrowser - 🔗 🌐 : an easy way to open links in browsers, mimic the "Open URL with..." dialog on Android, `nbrowser` help you open links in a browser
tera - Interactive Bash script terminal music radio player. Play your favorite radio station, CRUD your favorite lists, and explore new radio stations from your terminal.