hck
nlpo3
Our great sponsors
hck | nlpo3 | |
---|---|---|
15 | 1 | |
679 | 30 | |
- | - | |
4.6 | 1.6 | |
12 days ago | 5 months ago | |
Rust | Rust | |
The Unlicense | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
hck
- An old but good field command for printing tab separated fields from a file to stdou.t
-
What is yay situation?
hck ["hck" in community repo] - a fancier cut with regex field delimiters
-
What are your favorite Rust-powered Linux programs?
Biased because it's my tool, but I do use it every day! hck - which is like cut, but much faster and with a tidier set of features.
-
Tuc – When cut doesn’t cut it
hck - close to drop in replacement for cut that can use a regex delimiter instead of a fixed string
-
Tuc – when cut doesn’t cut it
Nice, especially the format output.
See also:
* hck (https://github.com/sstadick/hck) - close to drop in replacement for cut that can use a regex delimiter instead of a fixed string
* rcut (https://github.com/learnbyexample/regexp-cut) - my own bash+awk script, supports regexp delimiters, field reordering, negative indexing, etc
- csvlens: Command line CSV file viewer
-
Ask HN: Let's Build CheckStyle for Bash?
You might want to check out 'hck' to replace 'cut'.
https://github.com/sstadick/hck
- hck v0.6.6: > 24% performance improvements on common workloads
- Show HN: Hck – a fast and flexible cut-like tool
nlpo3
-
Thai word tokenizers benchmark: nlpo3 vs newmm
Thanathip Suntorntip Gorlph ported Korakot Chaovavanich's Thai word tokenizer - Newmm, written in Python, to Rust called nlpo3. The nlpo3 website claimed that nlpo3 is 2X faster than Newmm. I felt that Nlpo3 must be faster than this claim because in contrast to Python's Regex engine, Rust's regex runs in the linear time since it was constrained not to support looking back/ahead. Moreover, 2X faster is ambiguous.
What are some alternatives?
sd - Intuitive find & replace CLI (sed alternative)
murex - A smarter shell and scripting environment with advanced features designed for usability, safety and productivity (eg smarter DevOps tooling)
pythainlp - Thai Natural Language Processing in Python.
csvlens - Command line csv viewer
regex - An implementation of regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
UNIC - UNIC: Unicode and Internationalization Crates for Rust
oso - Oso is a batteries-included framework for building authorization in your application.
ripgrep - ripgrep recursively searches directories for a regex pattern while respecting your gitignore
whatlang-rs - Natural language detection library for Rust. Try demo online: https://whatlang.org/
tuc - When cut doesn't cut it
tiktoken-rs - Ready-made tokenizer library for working with GPT and tiktoken