Comparison of Vector Databases

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
  • VectorDBBench

    A Benchmark Tool for VectorDB

  • Interesting graphic, bland and unvoiced conclusion

    You're also missing a lot of details. For example, Milvus and Zilliz are actually a little different, check this out for more details: https://github.com/zilliztech/VectorDBBench (of course run it on your own stuff, don't blindly trust companies just because their product is open source)

    Also if you want to throw some more comparisons in their checkout elastic search

  • ann-benchmarks

    Benchmarks of approximate nearest neighbor libraries in Python

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • motorhead

    🧠 Motorhead is a memory and information retrieval server for LLMs.

  • Metal [1] is another one on my radar. Their API looks super simple.

    Disclosures: None

    [1] https://getmetal.io

  • vectara-answer

    LLM-powered Conversational AI experience using Vectara

  • With Vectara (full disclosure: I work there; https://vectara.com) we provide a simple API to implement applications with Grounded Generation (aka retrieval augmented generation). The embeddings model, the vector store, the retrieval engine and all the other functionality - implemented by the Vectara platform, so you don't have to choose which vector DB to use, which embeddings model to use, and so on. Makes life easy and simple, and you can focus on developing your application.

  • txtai

    💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

  • txtai (https://github.com/neuml/txtai) is another option to consider. It has vector search with SQL, topic modeling and LLM prompt-driven search (retrieval augmented generation).

    Disclaimer: I am the author of txtai

  • chroma

    the AI-native open-source embedding database

  • chroma can help here https://github.com/chroma-core/chroma

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts