tawk
dedupe
tawk | dedupe | |
---|---|---|
3 | 3 | |
9 | 3 | |
- | - | |
0.0 | 0.0 | |
over 3 years ago | about 6 years ago | |
Tcl | Python | |
MIT License | BSD 2-clause "Simplified" License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tawk
-
Understanding AWK
I wrote my own awk-inspired tool in part to work with non-trivial CSV files like that.
-
What tools / utilities have you written that you use regularly?
tawk, an awk-like program that uses tcl for the script language and has a CSV parser mode for non-trivial data where just splitting on comma isn't enough to properly handle it.
- Tawk – Awk but in Tcl
dedupe
-
fdupes alternatives?
I wrote https://github.com/Gumnos/dedupe which sounds like it might be useful to you. It's faster than several of the alternatives I've found (many run the checksum across the whole of every file, this uses the file-size as a first-line discriminator, and only if the files are the same size does it go to the trouble of checking the checksum of the files). I designed it for creating hard-links in my media collection, but in the --dry-run mode, it should emit the file-names allowing you to pass it to xargs to remove them if it looks copacetic.
-
File Management via CLI
You can use my dedupe.py script with the dry-run flag (-n) to find all the duplicates on your drive. If you run it without the dry-run flag, it will attempt to make hard-links so that each file exists only once on the drive with multiple hard-links to the underlying file. It should be pretty fast, only needing to checksum file-content in the event that files have the same size (several other such deduplication methods work by checksumming every file on the drive which can be slow).
-
What tools / utilities have you written that you use regularly?
a file-deduplication utility that hard-links duplicate files to save space (our family photo gallery gets pics put in multiple albums for various audiences, so I can cut down on a lot of duplication with this)
What are some alternatives?
hastyhex - A blazing fast hex dumper
ripgrep-all - rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc.
EgyBestCLI - A Command-Line Interface Wrapper For EgyBest
file-arranger - Simple & capable Directory arranger/cleaner
nbrowser - 🔗 🌐 : an easy way to open links in browsers, mimic the "Open URL with..." dialog on Android, `nbrowser` help you open links in a browser
xonsh - :shell: Python-powered, cross-platform, Unix-gazing shell.
ledger - Double-entry accounting system with a command-line reporting interface
mpd_what - An mpd album art and info getter
dark-toggle - A small POSIX compliant shell script that toggles between the dark and light variants of a GTK theme.
td-cli - A todo command line todo manager ✔️
fzf - :cherry_blossom: A command-line fuzzy finder