qdrant
hnsqlite
qdrant | hnsqlite | |
---|---|---|
141 | 6 | |
17,943 | 143 | |
3.4% | 1.4% | |
9.9 | 5.5 | |
5 days ago | 10 months ago | |
Rust | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
qdrant
-
Hindi-Language AI Chatbot for Enterprises Using Qdrant, MLFlow, and LangChain
Great. Now that we have the embeddings, we need to store them in a vector database. We will be using Qdrant for this purpose. Qdrant is an open-source vector database that allows you to store and query high-dimensional vectors. The easiest way to get started with the Qdrant database is using the docker.
-
Boost Your Code's Efficiency: Introducing Semantic Cache with Qdrant
I took Qdrant for this project. The reason was that Qdrant stands for high-performance vector search, the best choice against use cases like finding similar function calls based on semantic similarity. Qdrant is not only powerful but also scalable to support a variety of advanced search features that are greatly useful to nuanced caching mechanisms like ours.
-
Ask HN: Has Anyone Trained a personal LLM using their personal notes?
I'm currently looking to implement locally, using QDrant [1] for instance.
I'm just playing around, but it makes sense to have a runnable example for our users at work too :) [2].
[1]. https://qdrant.tech/
-
Show HN: A fast HNSW implementation in Rust
Also compare with qdrant's Rust implementation; they tout their performance. https://github.com/qdrant/qdrant/tree/master/lib/segment/src...
-
pgvecto.rs alternatives - qdrant and Weaviate
3 projects | 13 Mar 2024
-
Open-source Rust-based RAG
There are much better known examples, such as https://qdrant.tech/ and https://github.com/lancedb/lancedb
-
Qdrant 1.8.0 - Major Performance Enhancements
For more information, see our release notes. Qdrant is an open source project. We welcome your contributions; raise issues, or contribute via pull requests!
-
Perform Image-Driven Reverse Image Search on E-Commerce Sites with ImageBind and Qdrant
Initialize the Qdrant Client with in-memory storage. The collection name will be “imagebind_data” and we will be using cosine distance.
-
7 Vector Databases Every Developer Should Know!
Qdrant is an open-source vector search engine optimized for performance and flexibility. It supports both exact and approximate nearest neighbor search, providing a balance between accuracy and speed for various AI and ML applications.
- Ask HN: Who is hiring? (February 2024)
hnsqlite
-
LangChain: The Missing Manual
For anyone thinking about applications of langchain and pinecone but who are looking for something more turn-key check out https://jiggy.ai
The core is actually open source as well, allowing you to take your data back out via sqlite and hnswlib (https://github.com/jiggy-ai/hnsqlite)
-
I built an open source website that lets you upload large files, such as in-depth novels or academic papers, and ask ChatGPT questions based on your specific knowledge base. So far, I've tested it with long books like the Odyssey and random research papers that I like, and it works shockingly well.
We are built on open core https://github.com/jiggy-ai. Our open source hnsqlite is light weight, easy to use. And best of all, we make it easy for you to get your data out of JiggyBase. You can download a sqlite file that contains your document text data, metadata, embedding vectors, and embedding index. This can be used directly in the open source hnsqlite package.
-
What Is a Vector Database
After working through several projects that utilized local hnswlib and different databases for text and vector persistence, I integrated open source hnswlib with sqlite to create an embedded vector search engine that can easily scale up to millions of embeddings. For self-hosted situations of under 10M embeddings and less than insane throughput I think this combo is hard to beat.
https://github.com/jiggy-ai/hnsqlite
- Show HN: Hnsqlite: hnswlib and SQLite integrated for text embedding search
-
Faiss: A library for efficient similarity search
Thanks Leobg!
For anyone else: you pass it directly in metadata see https://github.com/jiggy-ai/hnsqlite/blob/main/test/test_col...
What are some alternatives?
Milvus - A cloud-native vector database, storage for next generation AI applications
langchainrb - Build LLM-powered applications in Ruby
Weaviate - Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
NeMo-Guardrails - NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
faiss - A library for efficient similarity search and clustering of dense vectors.
guidance - A guidance language for controlling large language models. [Moved to: https://github.com/guidance-ai/guidance]
pgvector - Open-source vector similarity search for Postgres
annoy - Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
Elasticsearch - Free and Open, Distributed, RESTful Search Engine
GPT4Memory
towhee - Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
raft - RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.