vectordb
lancedb-study
vectordb | lancedb-study | |
---|---|---|
6 | 1 | |
552 | 11 | |
5.1% | - | |
7.6 | 8.9 | |
2 days ago | 5 months ago | |
Python | Python | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
vectordb
-
VectorDB: Vector Database Built by Kagi Search
We needed a low latency, on premise solution that we can run on edge nodes (so lightweight) with sane defaults that anyone in the team can whim in a sec.
Result is this and we constantly benchmark performance of different embeddings to ensure best defaults.
[1] https://github.com/kagisearch/vectordb#embeddings-performanc...
-
Embeddings: What they are and why they matter
If you are looking for lightweight, low- latency, fully local, end-to-end solution (chunking, embedding, storage and vector search), try vectordb [1]
Just spent a day updating it with latest benchmarks for text embedding models.
[1] https://github.com/kagisearch/vectordb
lancedb-study
-
VectorDB: Vector Database Built by Kagi Search
I thought the API here was quite neat. It's fairly simple to implement a lancedb backend for it instead of sklearn/faiss/mrpt as the source code is really simple.
This repo is basically just a nice api and the needed chunking and batching logic. Using lancedb, you'd still have to write that, as exemplified here: https://github.com/prrao87/lancedb-study/blob/main/lancedb/i...
What are some alternatives?
langroid - Harness LLMs with Multi-Agent Programming
txtai - 💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
onnxruntime - ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Wallabag - wallabag is a self hostable application for saving web pages: Save and classify articles. Read them later. Freely.
deeplake - Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
telekinesis - Control Objects and Functions Remotely
marqo - Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
cog - Micro Graph Database for Python Applications
supabase - The open source Firebase alternative.
DBoW2 - Enhanced hierarchical bag-of-word library for C++