Our great sponsors
-
qdrant
Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
txtai
š” All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
A better option would be using a dedicated open-source vector database like Qdrant, it is more efficient, scalable, and has a convenient API. https://github.com/qdrant/qdrant (disclaimer: I'm part of the team).
I've recently stumbled upon smaller projects, like FREDDY (https://github.com/guenthermi/postgres-word2vec), a Postgres extension that looks interesting. The ability to write ad-hoc similarity queries in SQL seems like it might be valuable in some circumstances. I'm not sure about performance or storage efficacy.
There are tons of other ways to store vector data, one was just recently released - https://github.com/google/tensorstore
I have project, txtai that supports vector/semantic/similarity search. It pairs an ANN index with a relational database to support SQL queries.