countwords vs huniq

countwords

Playing with counting word frequencies (and performance) in various languages. (by kimono-koans)

huniq

Filter out duplicates on the command line. Replacement for `sort | uniq` optimized for speed (10x faster) when sorting is not needed. (by koraa)

CLI Rust Tools

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

countwords		huniq
	Project
5	Mentions	3
4	Stars	229
-	Growth	-
2.6	Activity	2.7
6 months ago	Latest Commit	3 months ago
Rust	Language	Rust
MIT License	License	-

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

countwords

Posts with mentions or reviews of countwords. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-11-26.

Are there benchmark results of current Forth implementations (interpreted & compiled)?
1 project | /r/Forth | 4 Jun 2023
Open any file as bytes
1 project | /r/rust | 4 Mar 2023

See an example: https://github.com/kimono-koans/countwords/blob/master/rust/fast-simple/main.rs
I/O is no longer the bottleneck
10 projects | news.ycombinator.com | 26 Nov 2022

this is truly 1978 all over again. No flame graphs, no hardware counters no bottleneck analysis. Using these 'optimizations' for job interviews is questionable at best.
[1] https://benhoyt.com/writings/count-words/
Correct name for word matching problem
2 projects | /r/algorithms | 13 Oct 2022

This might actually be interesting to you: https://benhoyt.com/writings/count-words/
Performance comparison: counting words in Python, C/C++, Awk, Rust, and more
12 projects | news.ycombinator.com | 24 Jul 2022

In case anyone is interested, I did an optimized, but much more simple, Rust implementation just today[0], which is faster than the optimized implementation on my machine. No indexing into arrays of bytes, etc., no "code golf" measures.
Looks like idiomatic Rust, which I think is interesting. Shows there is more than one way to skin a cat.
[0]: https://github.com/kimono-koans/countwords/blob/master/rust/...

huniq

Posts with mentions or reviews of huniq. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-04-18.

Zet 1.0 is out (compare to uniq and comm)
4 projects | /r/rust | 18 Apr 2023

How does it compare with huniq and runiq?
I/O is no longer the bottleneck
10 projects | news.ycombinator.com | 26 Nov 2022

`sort | uniq` is really slow for this, as it has to sort the entire input first. I use `huniq` which is way faster for this. I'm sure there are many similar options.
https://github.com/koraa/huniq
What’s your favorite shell one liner?
4 projects | /r/commandline | 17 Feb 2022

For better speed, check out https://github.com/koraa/huniq

What are some alternatives?

When comparing countwords and huniq you can also consider the following projects:

gccontent-benchmark - Benchmarking different languages for a simple bioinformatics task (Counting the GC fraction of DNA in a FASTA file)

fzy - :mag: A simple, fast fuzzy finder for the terminal

countwords - Playing with counting word frequencies (and performance) in various languages.

RAMCloud - **No Longer Maintained** Official RAMCloud repo

countwords - Playing with counting word frequencies (and performance) in various languages.

repo

robin-hood-hashing - Fast & memory efficient hashtable based on robin hood hashing for C++11/14/17/20

napkin-math - Techniques and numbers for estimating system's performance from first-principles

countwords - Playing with counting word frequencies (and performance) in various languages.

share-file-systems - Use a Windows/OSX like GUI in the browser to share files cross OS privately. No cloud, no server, no third party.

fast-sqlite3-inserts - Some bunch of test scripts to generate a SQLite DB with 1B rows in fastest possible way

runiq - An efficient way to filter duplicate lines from input, à la uniq.

countwords vs gccontent-benchmark huniq vs fzy countwords vs countwords huniq vs RAMCloud countwords vs countwords huniq vs repo countwords vs robin-hood-hashing huniq vs napkin-math countwords vs countwords huniq vs share-file-systems countwords vs fast-sqlite3-inserts huniq vs runiq

Compare countwords vs huniq and see what are their differences.

countwords

huniq

countwords

huniq

What are some alternatives?