DiskANN

Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search (by microsoft)

DiskANN Alternatives

Similar projects and alternatives to DiskANN

  1. prometheus

    The Prometheus monitoring system and time series database.

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. Redis

    For developers, who are building real-time data-driven applications, Redis is the preferred, fastest, and most feature-rich cache, data structure server, and document and vector query engine.

  4. MongoDB

    309 DiskANN VS MongoDB

    The MongoDB Database

  5. ApacheKafka

    A curated re-sources list for awesome Apache Kafka

  6. Milvus

    131 DiskANN VS Milvus

    Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

  7. RabbitMQ

    119 DiskANN VS RabbitMQ

    Open source RabbitMQ: core server and tier 1 (built-in) plugins

  8. pgvector

    Open-source vector similarity search for Postgres

  9. Stream

    Stream - Scalable APIs for Chat, Feeds, Moderation, & Video. Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.

    Stream logo
  10. TimescaleDB

    A time-series database for high-performance real-time analytics packaged as a Postgres extension

  11. datafusion

    Apache DataFusion SQL Query Engine

  12. Neo4j

    60 DiskANN VS Neo4j

    Graphs for Everyone

  13. tantivy

    57 DiskANN VS tantivy

    Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust

  14. RocksDB

    51 DiskANN VS RocksDB

    A library that provides an embeddable, persistent key-value store for fast storage.

  15. nodejs-pubsub

    27 DiskANN VS nodejs-pubsub

    Node.js client for Google Cloud Pub/Sub: Ingest event streams from anywhere, at any scale, for simple, reliable, real-time stream analytics.

  16. pg-boss

    22 DiskANN VS pg-boss

    Queueing jobs in Postgres from Node.js like a boss

  17. pgai

    19 DiskANN VS pgai

    A suite of tools to develop RAG, semantic search, and other AI applications more easily with PostgreSQL

  18. pgvectorscale

    A complement to pgvector for high performance, cost efficient vector search on large workloads.

  19. pg_jsonschema

    PostgreSQL extension providing JSON Schema validation

  20. pinecone

    Peer-to-peer overlay routing for the Matrix ecosystem

  21. pgANN

    2 DiskANN VS pgANN

    Discontinued Fast Approximate Nearest Neighbor (ANN) searches with a PostgreSQL database.

  22. pgaidocs

    Discontinued [GET https://api.github.com/repos/timescale/pgaidocs: 404 - Not Found // See: https://docs.github.com/rest/repos/repos#get-a-repository]

  23. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better DiskANN alternative or higher similarity.

DiskANN discussion

Log in or Post with

DiskANN reviews and mentions

Posts with mentions or reviews of DiskANN. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2025-05-28.
  • Unpacking DiskANN: My Technical Journey Through Billion-Scale Vector Search
    1 project | dev.to | 3 Jul 2025
    What happens when vector datasets exceed what RAM can handle? This question drove my investigation into DiskANN – an SSD-optimized approach for massive-scale similarity search. Unlike traditional methods like HNSW that hit scalability ceilings around 100M vectors, DiskANN achieves billion-scale indexing by strategically leveraging disk storage. I’ll share how it balances latency, recall, and cost through architectural innovations.
  • Pushing Billion-Scale Vector Search Beyond RAM Limits with DiskANN
    1 project | dev.to | 23 Jun 2025
    The Memory Wall Problem Most vector indexes prioritize RAM for low latency. HNSW, for example, achieves 95% recall at <5ms for 100M vectors but requires ~500GB RAM. At 1B vectors, RAM costs exceed $10k/month on cloud instances—prohibitively expensive for many teams. DiskANN flips this model:
  • PostgreSQL Maximalism
    46 projects | dev.to | 28 May 2025
    Learns from Microsoft's DiskANN: "Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search"
  • Join us for the Open Source AI Challenge with pgai and Ollama: $3,000 in Prizes!
    6 projects | dev.to | 30 Oct 2024
    A new index type called StreamingDiskANN, inspired by the DiskANN algorithm, based on research from Microsoft.
  • DiskANN Implementation in Rust + Easy NN Search
    2 projects | /r/rust | 20 May 2023
    Hi! I have been noodling away at a re-implementation of the original C++ DiskANN project as well as packaging the latest advances in embedding generation. The rough repo is here and will remain licensed as Apache-2.0!
  • Pinecone raises $100M Series B
    6 projects | news.ycombinator.com | 27 Apr 2023
    Spot on. There is zero moat and the self-hosted alternatives are rapidly improving (if not better) than Pinecone. There are good open-source contributions coming from bigcorp beyond Meta too, e.g., DiskANN (https://github.com/microsoft/DiskANN).
  • SSD-Based Vector Indices
    1 project | news.ycombinator.com | 22 May 2022
  • A note from our sponsor - Stream
    getstream.io | 14 Jul 2025
    Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure. Learn more →

Stats

Basic DiskANN repo stats
7
1,398
5.0
7 days ago

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that C++ is
the 7th most popular programming language
based on number of references?