Are there more practical tools for KNN searches and storing documents/embeddings?

This page summarizes the projects mentioned and recommended in the original post on reddit.com/r/LanguageTechnology

Our great sponsors
  • SonarQube - Static code analysis for 29 languages.
  • Mergify - Automate your Pull Request with Mergify
  • Scout APM - Less time debugging, more time building
  • nmslib

    Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.

    I also needed to build a similar system and I used nmslib, maybe check it out - https://github.com/nmslib/nmslib

  • TorchPQ

    Efficient implementations of Product Quantization and its variants using Pytorch and CUDA

    If you're using GPU for search, You can give TorchPQ a try, it's an ANN search library implemented with PyTorch and CUDA. with TorchPQ, you can freely add / remove vectors with IDs, you can save/load them just like a regular PyTorch model. TorchPQ is still in development, and I'd love to hear any suggestion or feedback.

  • SonarQube

    Static code analysis for 29 languages.. Your projects are multi-language. So is SonarQube analysis. Find Bugs, Vulnerabilities, Security Hotspots, and Code Smells so you can release quality code every time. Get started analyzing your projects today for free.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts