awesome-vector-search
sqlite-vss
awesome-vector-search | sqlite-vss | |
---|---|---|
20 | 17 | |
1,275 | 1,455 | |
2.5% | - | |
6.1 | 8.0 | |
23 days ago | about 2 months ago | |
C++ | ||
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
awesome-vector-search
- Show HN: SimSIMD vs. SciPy: How AVX-512 and SVE make SIMD cleaner and ML faster
-
Reality check on good embedding model (and this idea in general)
Probably. But there are a number of free open source ones. For example, I've got a document that I'm doing embedding-keys for that has about 8000 sentences. Here's a list of some [ https://github.com/currentslab/awesome-vector-search ]
-
Rye, meet GPT3 ... and vice versa :)
note: search for vector databases not written in Go but with Go clients, in case there is anything more local/lightweight: https://github.com/currentslab/awesome-vector-search
-
Vector database built for scalable similarity search
https://github.com/currentslab/awesome-vector-search
I was surprised to see Elastic actually has ok support for some of this stuff, though it appears slower for most of the tasks.
-
[P] My co-founder and I quit our engineering jobs at AWS to build “Tensor Search”. Here is why.
Supporting sequence of vectors does seems like a fresh air to the vector search service. I have added marqo to the list of awesome vector search (disclosure: I am the maintainer of the list) to increase your exposure.
-
What are vector search engines?
If you want a proper curated list of various libraries and standalone services of vector search engines, refer to this awesome GitHub repository by Currents API.
- List of vector search libraries
- List of curated vector search libraries
- A GitHub repository that collects awesome vector search framework/engine, library, cloud service, and research papers
- Find anything fast with Google's vector search technology
sqlite-vss
-
I'm writing a new vector search SQLite Extension
I guess this is an answer to the GitHub issue I opened against SQLite-vss a couple of months ago?
https://github.com/asg017/sqlite-vss/issues/124
-
Embeddings are a good starting point for the AI curious app developer
Perhaps sqlite-vss? It adds vector searches to sqlite.
https://github.com/asg017/sqlite-vss
-
How to Enhance Content with Semantify
Utilizing sqlite-vss to store and query vector embeddings managed by a local SQLite database, Semantify conducts fast, precise vector searches within these embeddings to find and recommend relevant content, ensuring readers are presented with articles that truly match their interests.
-
SQLite vs. Chroma: A Comparative Analysis for Managing Vector Embeddings
Whether you’re navigating through well-known options like SQLite, enriched with the sqlite-vss extension, or exploring other avenues like Chroma, an open-source vector database, selecting the right tool is paramount. This article compares these two choices, guiding you through the pros and cons of each, helping you choose the right tool for storing and querying vector embeddings for your project.
-
Vector database is not a separate database category
Here is a SQLite extension that uses Faiss under the hood.
https://github.com/asg017/sqlite-vss
Not associated with the project, just love SQLite and find it very useful.
- SQLite-Vss: A SQLite Extension for Vector Search
-
Introduction to Vector Search and Embeddings
Vector Databases: As your data grows, efficiently searching through millions of vectors can become a challenge. Specialized vector databases like FAISS, Annoy, or Elasticsearch's vector search capabilities can be explored to manage and search through large-scale vector data. Your sentence is grammatically correct. In addition, databases like SQLite and PostgreSQL have extensions, such as sqlite-vss and pgvector, that can be used to store and query vector embeddings, respectively.
-
The Problem with LangChain
I had a go at one of those a few months ago: https://datasette.io/plugins/datasette-faiss
Alex Garcia built a better one here as a SQLite Rust extension: https://github.com/asg017/sqlite-vss
-
Every request, every microsecond: scalable machine learning at Cloudflare
Since the problem domain is that of anomaly detection from constructed request feature embeddings, I wonder if an ANN-search methodology using an embedded database (such as https://github.com/asg017/sqlite-vss or similar) was explored.
-
Disrupting the AI Scene with Open Source and Open Innovation
As I searched for "sqlite vector plugin" I didn't find any results, before a couple of weeks ago. Two weeks ago I found Alex' SQLite VSS plugin for SQLite. The library was an amazing piece of engineering from an "idea perspective". However, as I started playing around with it, I realised it was ipso facto like "Titanic". Beautiful and amazing, but destined to leak water and sink to the bottom of the ocean because of what we software engineers refers to as "memory leaks".
What are some alternatives?
pgvector - Open-source vector similarity search for Postgres
semantic-kernel - Integrate cutting-edge LLM technology quickly and easily into your apps
annoy - Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
chroma - the AI-native open-source embedding database
qdrant - Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
pgvector-go - pgvector support for Go
Milvus - A cloud-native vector database, storage for next generation AI applications
milvus-lite - A lightweight version of Milvus wrapped with Python.
hnswlib - Header-only C++/python library for fast approximate nearest neighbors
typesense-instantsearch-semantic-search-demo - A demo that shows how to build a semantic search experience with Typesense's vector search feature and Instantsearch.js
featureform - The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
txtai - 💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows