vector-db-benchmark
citrus
vector-db-benchmark | citrus | |
---|---|---|
6 | 1 | |
227 | 93 | |
7.9% | - | |
9.1 | 7.6 | |
5 days ago | about 1 month ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
vector-db-benchmark
-
RAG is Dead. Long Live RAG!
Qdrant’s benchmark results are strongly in favor of accuracy and efficiency. We recommend that you consider them before deciding that an LLM is enough. Take a look at our open-source benchmark reports and try out the tests yourself.
-
Evaluate Vector Database / Benchmarks?
Qdrant made their own benchmark. It is quite simple and also takes into consideration more options, so it should be better suited for benchmarking for production purposes.
-
Qdrant, Pinecone, Supabase
is noWhen it comes to Supabase, it's using pgvector under the hood, so it would make sense to benchmark it with the other Open Source tools. There is an open PR for that, but it's pretty old: https://github.com/qdrant/vector-db-benchmark/pull/50
-
Building a Vector Database with Rust to Make Use of Vector Embeddings
P.S.: Perhaps you want to add your database to our benchmarks repo?
-
New and Improved Embedding Model for OpenAI
Do we have any idea why lucene vector search underperforms? As of lucene 9.1 (and elastic 8.4), it runs the same sort of filtered/categorical HNSW that qdrant runs (https://lucene.apache.org/core/9_1_0/core/org/apache/lucene/...). Qdrant's benchmarking code (https://github.com/qdrant/vector-db-benchmark/blob/9263ba/en...) does use the new filtered ann query with elastic 8.4, so it appears to be a fair benchmark. Why is lucene/elastic so much slower? Is it a rust vs. java thing? Or some memory management issues?
-
Which vector search engine is the fastest?
There is also an open-source framework for benchmarking https://github.com/qdrant/vector-db-benchmark
citrus
-
Created a smol vector database in my free time. Looking to provide a LangChain integration soon!
It supports all the basic features like creating an index, inserting vectors and searching through them. Here's the GitHub link if anyone's interested in going over it: https://github.com/0xDebabrata/citrus
What are some alternatives?
ann-benchmarks - Benchmarks of approximate nearest neighbor libraries in Python
pgANN - Fast Approximate Nearest Neighbor (ANN) searches with a PostgreSQL database.
vector-search - The definitive guide to using Vector Search to solve your semantic search production workload needs.
vector-search-compilation - A compilation of Vector Search Databases
qdrant - Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
vald - Vald. A Highly Scalable Distributed Vector Search Engine
weaviate-examples - Weaviate vector database – examples
awesome-vector-database - A curated list of awesome works related to high dimensional structure/vector search & database
instant-distance - Fast approximate nearest neighbor searching in Rust, based on HNSW index