The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 8 nearest-neighbor Open-Source Projects
-
similarity
TensorFlow Similarity is a python package focused on making similarity learning quick and easy.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
raft
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications. (by rapidsai)
-
NearestNeighbors.jl
High performance nearest neighbor data structures (KDTree and BallTree) and algorithms for Julia.
-
MachineLearningWithPython
Get started with Machine Learning with Python - An introduction with Python programming examples
-
Iris
The original lightweight introduction to machine learning in Rubix ML using the famous Iris dataset and the K Nearest Neighbors classifier. (by RubixML)
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
CSGO-Pro-Gear-Performance-and-EDA
Modeling Professional (CS:GO) Gamer's Accuracy Performance Based on Gear and Settings, and Exploratory Data Analysis.
Project mention: Using Your Vector Database as a JSON (Or Relational) Datastore | news.ycombinator.com | 2024-04-23On top of my head, pgvector only supports 2 indexes, those are running in memory only. They don't support GPU indexing, nor Disk based indexing, they also don't have separation of query and insertions.
Also with different people I've talked to, they struggle with scale past 100K-1M vector.
You can also have a look yourself from a performance perspective: https://ann-benchmarks.com/
Project mention: Raft: Fundamental widely-used algorithms and primitives for machine learning | news.ycombinator.com | 2024-02-22
nearest-neighbors related posts
- Using Your Vector Database as a JSON (Or Relational) Datastore
- ANN Benchmarks
- Approximate Nearest Neighbors Oh Yeah
- pgvector vs Pinecone: cost and performance
- Vector database is not a separate database category
- How We Made PostgreSQL a Better Vector Database
- Vector Search with OpenAI Embeddings: Lucene Is All You Need
-
A note from our sponsor - WorkOS
workos.com | 27 Apr 2024
Index
What are some of the best open-source nearest-neighbor projects? This list will help you:
Project | Stars | |
---|---|---|
1 | ann-benchmarks | 4,588 |
2 | similarity | 994 |
3 | raft | 612 |
4 | NearestNeighbors.jl | 400 |
5 | pgANN | 289 |
6 | MachineLearningWithPython | 144 |
7 | Iris | 30 |
8 | CSGO-Pro-Gear-Performance-and-EDA | 1 |
Sponsored