You Shouldn't Invest in Vector Databases?

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
getstream.io
featured
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com
featured
  1. txtai

    💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows

    You can try txtai (https://github.com/neuml/txtai) with a Faiss backend.

    This Faiss wiki article might help (https://github.com/facebookresearch/faiss/wiki/Indexing-1G-v...).

    For example, a partial Faiss configuration with 4-bit PQ quantization and only using 5% of the data to train an IVF index is shown below.

    faiss={"components": "IVF,PQ384x4fs", "sample": 0.05}

  2. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  3. faiss

    A library for efficient similarity search and clustering of dense vectors.

    You can try txtai (https://github.com/neuml/txtai) with a Faiss backend.

    This Faiss wiki article might help (https://github.com/facebookresearch/faiss/wiki/Indexing-1G-v...).

    For example, a partial Faiss configuration with 4-bit PQ quantization and only using 5% of the data to train an IVF index is shown below.

    faiss={"components": "IVF,PQ384x4fs", "sample": 0.05}

  4. pgvecto.rs

    Scalable, Low-latency and Hybrid-enabled Vector Search in Postgres. Revolutionize Vector Search, not Database.

    It's kind of a tradeoff. Performance is just one factor when choosing the vector database. In pgvecto.rs https://github.com/tensorchord/pgvecto.rs, we store the index separately from PostgreSQL's internal storage, unlike pgvector's approach. This enable us to get multi-threaded indexing, async indexing without blocking the insertion, and faster search speed comparing to pgvector.

    I don't see any fundamental reason why the index in Postgres would be slower than a specialized vector database. The query pattern of the vector database is simply a point query using an index, similar to other queries in an OLTP system.

    The only limitation I see is scalability. It's not easy to make PostgreSQL distributed, but solutions like Citus exist, making it still possible.

    (I'm the author of pgvecto.rs)

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • AI for Scientific Search

    1 project | news.ycombinator.com | 3 Jul 2025
  • What's Happening in Developer Tools? (OpenAI Just Bought Windsurf for $3B)

    7 projects | dev.to | 7 May 2025
  • Chunking your data for RAG

    13 projects | dev.to | 11 Feb 2025
  • Analyzing LinkedIn Company Posts with Graphs and Agents

    1 project | dev.to | 12 Jan 2025
  • Show HN: I made a website to semantically search ArXiv papers

    7 projects | news.ycombinator.com | 24 Dec 2024