countwords
huniq
countwords | huniq | |
---|---|---|
5 | 3 | |
4 | 229 | |
- | - | |
2.6 | 2.7 | |
6 months ago | 3 months ago | |
Rust | Rust | |
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
countwords
- Are there benchmark results of current Forth implementations (interpreted & compiled)?
-
Open any file as bytes
See an example: https://github.com/kimono-koans/countwords/blob/master/rust/fast-simple/main.rs
-
I/O is no longer the bottleneck
this is truly 1978 all over again. No flame graphs, no hardware counters no bottleneck analysis. Using these 'optimizations' for job interviews is questionable at best.
[1] https://benhoyt.com/writings/count-words/
-
Correct name for word matching problem
This might actually be interesting to you: https://benhoyt.com/writings/count-words/
-
Performance comparison: counting words in Python, C/C++, Awk, Rust, and more
In case anyone is interested, I did an optimized, but much more simple, Rust implementation just today[0], which is faster than the optimized implementation on my machine. No indexing into arrays of bytes, etc., no "code golf" measures.
Looks like idiomatic Rust, which I think is interesting. Shows there is more than one way to skin a cat.
[0]: https://github.com/kimono-koans/countwords/blob/master/rust/...
huniq
-
Zet 1.0 is out (compare to uniq and comm)
How does it compare with huniq and runiq?
-
I/O is no longer the bottleneck
`sort | uniq` is really slow for this, as it has to sort the entire input first. I use `huniq` which is way faster for this. I'm sure there are many similar options.
https://github.com/koraa/huniq
-
What’s your favorite shell one liner?
For better speed, check out https://github.com/koraa/huniq
What are some alternatives?
gccontent-benchmark - Benchmarking different languages for a simple bioinformatics task (Counting the GC fraction of DNA in a FASTA file)
fzy - :mag: A simple, fast fuzzy finder for the terminal
countwords - Playing with counting word frequencies (and performance) in various languages.
RAMCloud - **No Longer Maintained** Official RAMCloud repo
countwords - Playing with counting word frequencies (and performance) in various languages.
repo
robin-hood-hashing - Fast & memory efficient hashtable based on robin hood hashing for C++11/14/17/20
napkin-math - Techniques and numbers for estimating system's performance from first-principles
countwords - Playing with counting word frequencies (and performance) in various languages.
share-file-systems - Use a Windows/OSX like GUI in the browser to share files cross OS privately. No cloud, no server, no third party.
fast-sqlite3-inserts - Some bunch of test scripts to generate a SQLite DB with 1B rows in fastest possible way
runiq - An efficient way to filter duplicate lines from input, à la uniq.