benchmark vs vector-db-benchmark

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

benchmark		vector-db-benchmark
	Project
1	Mentions	6
7	Stars	229
-	Growth	8.7%
0.0	Activity	9.1
over 1 year ago	Latest Commit	16 days ago
Python	Language	Python
-	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

benchmark

Posts with mentions or reviews of benchmark. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-01-18.

[N] We just got funded for an open-source project to make Metric Learning practical.
2 projects | /r/MachineLearning | 18 Jan 2022

Regarding Milvus. Well, there are a few essential differences between our projects: - Unlike Milvus, we perform filtering during the search in the vector index, which keeps retrieval complexity close to logarithmic - same as in original HNSW. - We can support complex types of filterable payloads like geo-points - it is not a trivial problem to keep the HNSW search graph connected during filtering. We solved it in our custom implementation of the HNSW index. - Unlike Milvus, we perform a query-planning phase to determine an optimal strategy of executing queries with filters - Qdrant uses Rust programming language - it gives us an advantage in avoiding stop-the-world issues of languages with garbage collection. We also have a retrieval speed benchmark - https://github.com/qdrant/benchmark.

vector-db-benchmark

Posts with mentions or reviews of vector-db-benchmark. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-30.

RAG is Dead. Long Live RAG!
1 project | dev.to | 28 Feb 2024

Qdrant’s benchmark results are strongly in favor of accuracy and efficiency. We recommend that you consider them before deciding that an LLM is enough. Take a look at our open-source benchmark reports and try out the tests yourself.
Evaluate Vector Database / Benchmarks?
3 projects | /r/mlops | 30 Jun 2023

Qdrant made their own benchmark. It is quite simple and also takes into consideration more options, so it should be better suited for benchmarking for production purposes.
Qdrant, Pinecone, Supabase
1 project | /r/learnmachinelearning | 15 Jun 2023

is noWhen it comes to Supabase, it's using pgvector under the hood, so it would make sense to benchmark it with the other Open Source tools. There is an open PR for that, but it's pretty old: https://github.com/qdrant/vector-db-benchmark/pull/50
Building a Vector Database with Rust to Make Use of Vector Embeddings
4 projects | /r/rust | 1 Jun 2023

P.S.: Perhaps you want to add your database to our benchmarks repo?
New and Improved Embedding Model for OpenAI
3 projects | news.ycombinator.com | 15 Dec 2022

Do we have any idea why lucene vector search underperforms? As of lucene 9.1 (and elastic 8.4), it runs the same sort of filtered/categorical HNSW that qdrant runs (https://lucene.apache.org/core/9_1_0/core/org/apache/lucene/...). Qdrant's benchmarking code (https://github.com/qdrant/vector-db-benchmark/blob/9263ba/en...) does use the new filtered ann query with elastic 8.4, so it appears to be a fair benchmark. Why is lucene/elastic so much slower? Is it a rust vs. java thing? Or some memory management issues?
Which vector search engine is the fastest?
1 project | news.ycombinator.com | 23 Aug 2022

There is also an open-source framework for benchmarking https://github.com/qdrant/vector-db-benchmark

What are some alternatives?

When comparing benchmark and vector-db-benchmark you can also consider the following projects:

fashion-mnist - A MNIST-like fashion product database. Benchmark :point_down:

citrus - (distributed) vector database

towhee - Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

ann-benchmarks - Benchmarks of approximate nearest neighbor libraries in Python

vector-search - The definitive guide to using Vector Search to solve your semantic search production workload needs.

qdrant - Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

weaviate-examples - Weaviate vector database – examples

instant-distance - Fast approximate nearest neighbor searching in Rust, based on HNSW index

hora - 🚀 efficient approximate nearest neighbor search algorithm collections library written in Rust 🦀 .