pybktree
rchowell
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pybktree
-
Ask HN: What are some 'cool' but obscure data structures you know about?
The BK-Tree, which allows fast querying of "close" matches, such as Hamming distance (number of bits different). http://blog.notdot.net/2007/4/Damn-Cool-Algorithms-Part-1-BK...
I wrote a Python library implementing them a number of years ago: https://github.com/benhoyt/pybktree
-
Find closest match to word in really large list
Alternatively a BK-tree might suit your needs https://github.com/benhoyt/pybktree/blob/master/pybktree.py
rchowell
-
Find closest match to word in really large list
The best approach however is probably to build a large Trie of your data. Look at https://github.com/RCHowell/rchowell.github.io/issues/4
What are some alternatives?
nutree - A Python library for tree data structures with an intuitive, yet powerful API.
multiversion-concurrency-control - Implementation of multiversion concurrency control, Raft, Left Right concurrency Hashmaps and a multi consumer multi producer Ringbuffer, concurrent and parallel load-balanced loops, parallel actors implementation in Main.java, Actor2.java and a parallel interpreter
Folly - An open-source C++ library developed and used at Facebook.
TextDistance - 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
minisketch - Minisketch: an optimized library for BCH-based set reconciliation
ann-benchmarks - Benchmarks of approximate nearest neighbor libraries in Python
pyroscope - Continuous Profiling Platform. Debug performance issues down to a single line of code [Moved to: https://github.com/grafana/pyroscope]
multiversion-concurrency-contro
entt - Gaming meets modern C++ - a fast and reliable entity component system (ECS) and much more
PSI - Private Set Intersection Cardinality protocol based on ECDH and Bloom Filters
CPython - The Python programming language