pgvecto.rs
immich
pgvecto.rs | immich | |
---|---|---|
17 | 291 | |
1,429 | 32,855 | |
14.3% | 11.8% | |
9.3 | 10.0 | |
1 day ago | 4 days ago | |
Rust | TypeScript | |
Apache License 2.0 | GNU Affero General Public License v3.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
pgvecto.rs
-
My binary vector search is better than your FP32 vectors
To evaluate the performance metrics in comparison to the original vector approach, we conducted benchmarking using the dbpedia-entities-openai3-text-embedding-3-large-3072-1M dataset. The benchmark was performed on a Google Cloud virtual machine (VM) with specifications of n2-standard-8, which includes 8 virtual CPUs and 32GB of memory. We used pgvecto.rs v0.2.1 as the vector database.
-
pgvecto.rs 0.2: Unifying Relational Queries and Vector Search in PostgreSQL
Please check out our documentation for more details. We encourage you to try out pgvecto.rs, benchmark it against your workloads, and contribute your indexing innovations. Join our Discord community to connect with the developers and other users working to improve pgvecto.rs!
-
pgvecto.rs alternatives - qdrant and Weaviate
3 projects | 13 Mar 2024
-
Milvus VS pgvecto.rs - a user suggested alternative
2 projects | 13 Mar 2024
-
You Shouldn't Invest in Vector Databases?
It's kind of a tradeoff. Performance is just one factor when choosing the vector database. In pgvecto.rs https://github.com/tensorchord/pgvecto.rs, we store the index separately from PostgreSQL's internal storage, unlike pgvector's approach. This enable us to get multi-threaded indexing, async indexing without blocking the insertion, and faster search speed comparing to pgvector.
I don't see any fundamental reason why the index in Postgres would be slower than a specialized vector database. The query pattern of the vector database is simply a point query using an index, similar to other queries in an OLTP system.
The only limitation I see is scalability. It's not easy to make PostgreSQL distributed, but solutions like Citus exist, making it still possible.
(I'm the author of pgvecto.rs)
-
How We Made PostgreSQL a Better Vector Database
Hi, we've solved the problem you mentioned! Please take a look on our open source postgres vector extension https://github.com/tensorchord/pgvecto.rs.
Our index building process is significantly faster than pgvector on hnsw because we can utilize all the cores, whereas pgvector can only use one core. And for the filter support, we do support pre-filtering, which will guarantee enough results no matter the condition is.
-
First Postgres Vector Extension with Filtering Support
Hi,
In our previous post titled “Do we really need a specialized vector database?” on HN (https://news.ycombinator.com/item?id=37097004) we discussed the importance of using a Postgres-based solution for vector search. However, we acknowledged that existing Postgres vector extensions lack support for metadata filtering.
We are excited to announce that we have now addressed this limitation. We are proud to be the first (https://github.com/tensorchord/pgvecto.rs) to enable conditional filtering directly on HNSW indexes within Postgres. This breakthrough allows for efficient and effective metadata filtering in combination with vector search, eliminating the tradeoff previously associated with using Postgres for this purpose.
We invite you to explore our updated offering and experience the benefits of seamless metadata filtering within a Postgres-based vector search system.
-
A Summary of LLMOps
Yeah, I think in many cases you just need a vector search lib, instead of a DB.
And in some other cases, you may want postgres vector extension e.g. https://github.com/tensorchord/pgvecto.rs instead of a specialized vector db.
-
An early look at HNSW performance with pgvector
Seems that pgvector has a viable competitor extension: https://github.com/tensorchord/pgvecto.rs
-
20x Faster as the Beginning: Introducing pgvecto.rs extension written in Rust
We are thrilled to announce the release of https://github.com/tensorchord/pgvecto.rs, a powerful Postgres extension for vector similarity search written in Rust. Its HNSW algorithm is 20x faster than pgvector at 90% recall. But speed is just the start - pgvecto.rs is architected to add new algorithms easily. We've made it an extensible architecture for contributors to implement the new indexes quickly, and we look forward to the open-source community driving pgvecto.rs to new heights!
immich
- Immich: Self-hosted photo and video management solution
-
Happy 20th Anniversary, Gmail. I'm Sorry I'm Leaving You
It really is hard to leave Gmail when all of your data has been conveniently stored therein. This is one of Google's retention strategies and it is indeed brilliant.
That said, there's a vast number of self-hosted alternatives like Stalwart Mail (email) [1], Immich (images) [2], NextCloud (Google Docs) [3], etc.
[1] https://stalwa.rt
[2] https://immich.app
[3] https://nextcloud.com/
-
I accidentally built a meme search engine
Last year we added CLIP-based image search to https://immich.app/ and even though I have a pretty good understanding of how it works, it still blows my mind damn near every day. It's the closest thing to magic I've ever seen.
- immich SSO with Authentik
-
Show HN: Memories, FOSS Google Photos alternative built for high performance
I’m a big fan of https://immich.app/ and I use it every day for thousands of assets
-
pgvecto.rs 0.2: Unifying Relational Queries and Vector Search in PostgreSQL
Real-world applications often require complex queries that go beyond simple Approximate Nearest Neighbor (ANN) search. To explore a practical example of such applications, let's take a closer look at immich, a self-hosted photo and video backup solution that highlights the importance of advanced vector and traditional relational queries.
- Home Lab Guide
-
Ask HN: What Underrated Open Source Project Deserves More Recognition?
I discovered these 3 amazing projects recently:
Cryptpad, essentially google docs/sheets/forms e2e encrypted. It does include collaboration. https://github.com/cryptpad/cryptpad
Immich, google photos self hostable, with share options https://github.com/immich-app/immich
Nginxproxymanager manages certificates and proxies to self hosted stuff through nginx https://github.com/NginxProxyManager/nginx-proxy-manager
Great self hosting stuff!
-
Ente: Open-Source, E2E Encrypted, Google Photos Alternative
I realize it's very hard, but can we maybe reconsider opening the encryption-at-rest feature request? https://github.com/immich-app/immich/issues/450
Maybe we can give temporary access to processing steps in the pipeline, then have Immich forget the keys after it does the processing?
What are some alternatives?
pgvector - Open-source vector similarity search for Postgres
PhotoPrism - AI-Powered Photos App for the Decentralized Web 🌈💎✨
modelz-llm - OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)
Piwigo - Manage your photos with Piwigo, a full featured open source photo gallery application for the web. Star us on Github! More than 200 plugins and themes available. Join us and contribute!
pgvecto.rs-bench
Nextcloud - ☁️ Nextcloud server, a safe home for all your data
Awesome-LLMOps - An awesome & curated list of best LLMOps tools for developers
librephotos - A self-hosted open source photo management service. This is the repository of the backend.
faiss-rs - Rust language bindings for Faiss
photoview - Photo gallery for self-hosted personal servers
DocumentGPT - DocumentGPT is a web application that allows you to chat over your research document using OpenAI's chat API and perform semantic search using vector databases. This tool provides a seamless interface for interacting with your research document, exploring search results, and engaging in a conversation with an AI chatbot.
PiGallery 2 - A fast directory-first photo gallery website, with rich UI, optimized for running on low resource servers (especially on raspberry pi)