VectorDBBench
lucene-grep
VectorDBBench | lucene-grep | |
---|---|---|
16 | 9 | |
408 | 188 | |
10.0% | - | |
8.5 | 5.2 | |
6 days ago | 8 months ago | |
Python | Clojure | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
VectorDBBench
- FLaNK-AIM Weekly 06 May 2024
- GPU index supports in Vector Database benchmark latest version
- Benchmarking Tool for Vector DBs
-
Vespa.ai is spinning out of Yahoo as a separate company
We conducted benchmark tests on Elastic's queries per second (QPS) performance using datasets of 500,000 and 1 million vectors. Result was Zilliz is 13x and 22x faster, per number of vectors respectively. https://zilliz.com/blog/elasticsearch-cloud-vs-zilliz
Feel free to explore our open-source benchmarking tool, which allows you to examine our methodology and even compare it with your vector database. https://github.com/zilliztech/VectorDBBench
- Vector Database benchmark with 1536/768 dim data
-
Vector Dataset benchmark with 1536/768 dim data
"
the link is: https://github.com/zilliztech/VectorDBBench/issues/200#issue...
-
Comparison of Vector Databases
Interesting graphic, bland and unvoiced conclusion
You're also missing a lot of details. For example, Milvus and Zilliz are actually a little different, check this out for more details: https://github.com/zilliztech/VectorDBBench (of course run it on your own stuff, don't blindly trust companies just because their product is open source)
Also if you want to throw some more comparisons in their checkout elastic search
- VectorDB benchmark for both cloud and open source
- Cloud Vector Database Benchmark Result
- FLaNK Stack Weekly for 20 June 2023
lucene-grep
- FLaNK Stack Weekly for 20 June 2023
-
Using Java's Project Loom to build more reliable distributed systems
- Graal native images are real. These boast a far lower startup overhead and much lower steady state memory usage for simpler applications.
Probably my counterexample of choice is this: https://github.com/dainiusjocas/lucene-grep - it uses Lucene, probably the best search library (core of Elasticsearch, Solr, most websites), which is notoriously not simple code to implement grep-like functionality. In simple cases, they demonstrate a 30ms whole process runtime with no more than 32MB of RAM used (which looks suspiciously like a default).
The JVM is fast becoming a bit like Postgres... one of those 'second best at everything' pieces of tech.
- lucene-grep - grep-like utility based on Lucene Monitor compiled with GraalVM native-image
-
Lmgrep: Lucene-based grep-like utility
Here goes: https://github.com/dainiusjocas/lucene-grep/issues/84
I realize some relatively obscure Finnish stemmer and Lucene with GraalVM aren't exactly a common use case. I did some testing and provided my use case. I certainly have much English language content to search with using lucene-grep. So, thank you for making it!
- Lmgrep
What are some alternatives?
jsoncrack.com - ✨ Innovative and open-source visualization application that transforms various data formats, such as JSON, YAML, XML, CSV and more, into interactive graphs.
ArchiveBox - 🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
FinGPT - FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
ali-dbhub - 已迁移新仓库,此版本将不再维护
chroma - the AI-native open-source embedding database
babashka - Native, fast starting Clojure interpreter for scripting
ann-benchmarks - Benchmarks of approximate nearest neighbor libraries in Python
BlockHound - Java agent to detect blocking calls from non-blocking threads.
motorhead - 🧠 Motorhead is a memory and information retrieval server for LLMs.
beagle - A smart, reliable, and highly customizable debug menu library for Android apps that supports screen recording, network activity logging, and many other useful features.
vectara-answer - LLM-powered Conversational AI experience using Vectara
coyote - Coyote is a library and tool for testing concurrent C# code and deterministically reproducing bugs.