SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 similarity-search Open-Source Projects
-
Typesense
Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
-
qdrant
Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
Weaviate
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
-
hora
🚀 efficient approximate nearest neighbor search algorithm collections library written in Rust 🦀 .
-
usearch
Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
similarity
TensorFlow Similarity is a python package focused on making similarity learning quick and easy.
-
SimSIMD
Up to 200x Faster Inner Products and Vector Similarity — for Python, JavaScript, Rust, and C, supporting f64, f32, f16 real & complex, i8, and binary vectors using SIMD for both x86 AVX2 & AVX-512 and Arm NEON & SVE 📐
-
arcadedb
ArcadeDB Multi-Model Database, one DBMS that supports SQL, Cypher, Gremlin, HTTP/JSON, MongoDB and Redis. ArcadeDB is a conceptual fork of OrientDB, the first Multi-Model DBMS. ArcadeDB supports Vector Embeddings.
-
elastiknn
Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using exact and approximate algorithms.
-
generalized-kmeans-clustering
Spark library for generalized K-Means clustering. Supports general Bregman divergences. Suitable for clustering probabilistic data, time series data, high dimensional data, and very large data.
-
Stocks-Pattern-Analyzer
This tool should help discover different patterns based on similarity measures in historical (financial) data
-
oasysdb
An embedded vector database designed to run on edge devices. Lightweight and fast with HNSW indexing algorithm.
-
awesome-vector-database
A curated list of awesome works related to high dimensional structure/vector search & database
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
There are actually plenty of non-ES products that are way easier to integrate and tune (and get better results with less effort).
- Typesense (https://github.com/typesense/typesense)
- Algolia
- Google Programmable Search Engine (https://programmablesearchengine.google.com/about/)
Project mention: Ask HN: Has Anyone Trained a personal LLM using their personal notes? | news.ycombinator.com | 2024-04-03I'm currently looking to implement locally, using QDrant [1] for instance.
I'm just playing around, but it makes sense to have a runnable example for our users at work too :) [2].
[1]. https://qdrant.tech/
Project mention: pgvecto.rs alternatives - qdrant and Weaviate | libhunt.com/r/pgvecto.rs | 2024-03-13
Project mention: Ask HN: What are the drawbacks of caching LLM responses? | news.ycombinator.com | 2024-03-15Just found this: https://github.com/zilliztech/GPTCache which seems to address this idea/issue.
Project mention: Building a Vector Database with Rust to Make Use of Vector Embeddings | /r/rust | 2023-06-01We have been playing around with Hora as a replacement for the Rust-CV implementation as we want PQ as well. I'll check out instanct-distance, looks very interesting!
Project mention: USearch SQLite Extensions for Vector and Text Search | news.ycombinator.com | 2024-02-22
Project mention: Show HN: SimSIMD vs. SciPy: How AVX-512 and SVE make SIMD cleaner and ML faster | news.ycombinator.com | 2023-10-07
Would this[1] library help you? It's a Rust vector similarity search engine, written to be compiled to Wasm. I discovered it through articles like these[2].
[1] https://github.com/tantaraio/voy
Project mention: ArcadeDB: Multi-Model Database Supporting Graphs, KV, Documents, TS, and Vectors | news.ycombinator.com | 2024-01-04
Project mention: Show HN: OasysDB, Storing vectors for RAG in Rust simplified | news.ycombinator.com | 2024-02-27
Project mention: Show HN: SimSIMD vs. SciPy: How AVX-512 and SVE make SIMD cleaner and ML faster | news.ycombinator.com | 2023-10-07
Project mention: Created a smol vector database in my free time. Looking to provide a LangChain integration soon! | /r/LangChain | 2023-05-06It supports all the basic features like creating an index, inserting vectors and searching through them. Here's the GitHub link if anyone's interested in going over it: https://github.com/0xDebabrata/citrus
similarity-search related posts
- 7 Vector Databases Every Developer Should Know!
- Ask HN: Semantic Vector Searching in WASM?
- Voy – A WASM vector similarity search written in Rust
- Show HN: Danswer – open-source question answering across all your docs
- I've changed my mind about Code Interpretor
- scenery - a self-hosted photogallery with built-in image search engine
- FANN: Vector Search in 200 Lines of Rust
-
A note from our sponsor - SaaSHub
www.saashub.com | 19 Apr 2024
Index
What are some of the best open-source similarity-search projects? This list will help you:
Project | Stars | |
---|---|---|
1 | Typesense | 17,796 |
2 | qdrant | 17,718 |
3 | Weaviate | 9,436 |
4 | GPTCache | 6,387 |
5 | gerev | 2,596 |
6 | hora | 2,552 |
7 | usearch | 1,611 |
8 | vald | 1,453 |
9 | awesome-vector-search | 1,257 |
10 | similarity | 994 |
11 | SimSIMD | 707 |
12 | voy | 696 |
13 | quaterion | 619 |
14 | arcadedb | 438 |
15 | awesome-metric-learning | 433 |
16 | elastiknn | 352 |
17 | tinyvector | 335 |
18 | generalized-kmeans-clustering | 295 |
19 | Stocks-Pattern-Analyzer | 214 |
20 | oasysdb | 202 |
21 | postgres-word2vec | 140 |
22 | awesome-vector-database | 127 |
23 | citrus | 92 |