awesome-vector-search
sample-apps
awesome-vector-search | sample-apps | |
---|---|---|
20 | 3 | |
1,275 | 282 | |
2.5% | 1.1% | |
6.1 | 9.5 | |
22 days ago | 11 days ago | |
Jupyter Notebook | ||
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
awesome-vector-search
- Show HN: SimSIMD vs. SciPy: How AVX-512 and SVE make SIMD cleaner and ML faster
-
Reality check on good embedding model (and this idea in general)
Probably. But there are a number of free open source ones. For example, I've got a document that I'm doing embedding-keys for that has about 8000 sentences. Here's a list of some [ https://github.com/currentslab/awesome-vector-search ]
-
Rye, meet GPT3 ... and vice versa :)
note: search for vector databases not written in Go but with Go clients, in case there is anything more local/lightweight: https://github.com/currentslab/awesome-vector-search
-
Vector database built for scalable similarity search
https://github.com/currentslab/awesome-vector-search
I was surprised to see Elastic actually has ok support for some of this stuff, though it appears slower for most of the tasks.
-
[P] My co-founder and I quit our engineering jobs at AWS to build “Tensor Search”. Here is why.
Supporting sequence of vectors does seems like a fresh air to the vector search service. I have added marqo to the list of awesome vector search (disclosure: I am the maintainer of the list) to increase your exposure.
-
What are vector search engines?
If you want a proper curated list of various libraries and standalone services of vector search engines, refer to this awesome GitHub repository by Currents API.
- List of vector search libraries
- List of curated vector search libraries
- A GitHub repository that collects awesome vector search framework/engine, library, cloud service, and research papers
- Find anything fast with Google's vector search technology
sample-apps
-
[P] I'm building a Neural Search Plugin for Elastic/Opensearch
See this blog post https://blog.vespa.ai/pretrained-transformer-language-models-for-search-part-1/ and the open source sample app it describes: https://github.com/vespa-engine/sample-apps/tree/master/msmarco-ranking
-
Find anything fast with Google's vector search technology
>
Vespa.ai supports combining dense vector search with keyword search and ranking, see https://docs.google.com/presentation/d/1vWKhSvFH-4MFcs4aNa9C...
There is also a Vespa sample application (open source, Apache 2) demonstrating multiple different retrieval and ranking strategies over at https://github.com/vespa-engine/sample-apps/blob/master/msma...
-
What Are Some Open Source NLP Framework Pipelines For QA Task
Look up Vespa.ai. https://github.com/vespa-engine/sample-apps/tree/master/dense-passage-retrieval-with-ann
What are some alternatives?
pgvector - Open-source vector similarity search for Postgres
qdrant - Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
annoy - Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
hnswlib - Header-only C++/python library for fast approximate nearest neighbors
Milvus - A cloud-native vector database, storage for next generation AI applications
Weaviate - Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
biggraph-wikidata-search-with-weaviate - Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine
featureform - The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
Elasticsearch - Free and Open, Distributed, RESTful Search Engine