pgsync
retake
pgsync | retake | |
---|---|---|
1 | 4 | |
1,055 | 757 | |
- | - | |
7.5 | 10.0 | |
18 days ago | 8 months ago | |
Python | Rust | |
MIT License | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pgsync
-
Improving Postgres Text Search Speed 7x on Millions of Records
PGSync might be useful for those who don't mind also running Elasticsearch
https://github.com/toluaina/pgsync
retake
-
Show HN: Retake – Open-Source Hybrid Search for Postgres
https://github.com/getretake/retake/pull/198 is a refreshing change given the recent rug pulls, so thank you for that
-
We created an open-source semantic search Python package on top of Postgres
We found it difficult to do well with standard vector databases and so we ended up making a nice open-source package to layer semantic search on top of Postgres with just a few lines of code. It supports Python backends right now, always stays in sync with Postgres via Kafka, doubles as a vector store, and can be deployed anywhere.
- Show HN: Open-Source Infrastructure for Vector Data Streams
What are some alternatives?
cheatsheets - My Cheatsheet Repository
bionicgpt - BionicGPT is an on-premise replacement for ChatGPT, offering the advantages of Generative AI while maintaining strict data confidentiality [Moved to: https://github.com/bionic-gpt/bionic-gpt]
dbd - dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.
nfcompose - Build REST APIs/Integrations in minutes instead of hours - NF Compose is a (data) integration platform that allows developers to define REST APIs in seconds instead of hours. Generated REST APIs are backed by postgres and support automatic consumer webhook notifications on data changes out of the box.
usaspending-api - Server application to serve U.S. federal spending data via a RESTful API
embedditor - ⚡ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any file extension, join and split chunks, edit metadata and embedding tokens + remove stop-words and punctuation with one click, add images, and download in .veml to share it with your team.
django-multiple-schemas - Sample project that describes how you can handle schema within your Django application.
vectorflow - VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice.
zeek2es - A Python application to filter and transfer Zeek logs to Elastic/OpenSearch+Humio. This app can also output pure JSON logs to stdout for further processing!
tinyvector - A tiny embedding database in pure Rust.
demo-opensearch-python - This repository contains code example in how to write search queries with OpenSearch Python client
prism - Prism is the easiest way to develop, orchestrate, and execute data pipelines in Python.