tinyvector
retake
tinyvector | retake | |
---|---|---|
1 | 4 | |
342 | 757 | |
- | - | |
6.1 | 10.0 | |
5 months ago | 8 months ago | |
Rust | Rust | |
MIT License | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tinyvector
retake
-
Show HN: Retake – Open-Source Hybrid Search for Postgres
https://github.com/getretake/retake/pull/198 is a refreshing change given the recent rug pulls, so thank you for that
-
We created an open-source semantic search Python package on top of Postgres
We found it difficult to do well with standard vector databases and so we ended up making a nice open-source package to layer semantic search on top of Postgres with just a few lines of code. It supports Python backends right now, always stays in sync with Postgres via Kafka, doubles as a vector store, and can be deployed anywhere.
- Show HN: Open-Source Infrastructure for Vector Data Streams
What are some alternatives?
embedditor - ⚡ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any file extension, join and split chunks, edit metadata and embedding tokens + remove stop-words and punctuation with one click, add images, and download in .veml to share it with your team.
bionicgpt - BionicGPT is an on-premise replacement for ChatGPT, offering the advantages of Generative AI while maintaining strict data confidentiality [Moved to: https://github.com/bionic-gpt/bionic-gpt]
paradedb - Postgres for Search and Analytics
nfcompose - Build REST APIs/Integrations in minutes instead of hours - NF Compose is a (data) integration platform that allows developers to define REST APIs in seconds instead of hours. Generated REST APIs are backed by postgres and support automatic consumer webhook notifications on data changes out of the box.
vectorflow - VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice.
pgsync - Postgres to Elasticsearch/OpenSearch sync
prism - Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python.
qdrant - Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
mara-pipelines - A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
code-indexer-loop - Code Indexer Loop is a Python library for indexing and retrieving source code files through an integrated vector database that's continuously and efficiently updated.