Python nearest-neighbor-search

Open-source Python projects categorized as nearest-neighbor-search

Top 8 Python nearest-neighbor-search Projects

nearest-neighbor-search
  1. lancedb

    Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.

    Project mention: Meilisearch – search engine API bringing AI-powered hybrid search | news.ycombinator.com | 2025-04-14

    Try LanceDB https://github.com/lancedb/lancedb

    It’s based off of the data fusion engine, has vector indexing and BM 25 indexing, has pipes on and rust bindings

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. docarray

    Represent, send, store and search multimodal data

  4. similarity

    TensorFlow Similarity is a python package focused on making similarity learning quick and easy.

  5. pynndescent

    A Python nearest neighbor descent for approximate nearest neighbors

    Project mention: Spann: Highly-Efficient Billion-Scale Approximate Nearest Neighbor Search | news.ycombinator.com | 2024-11-02

    If you just want in-memory then PyNNDescent (https://github.com/lmcinnes/pynndescent) can work pretty well. It should install easily with pip, works well at the scales you mention, and supports a large number of metrics, including cosine.

  6. quaterion

    Blazing fast framework for fine-tuning similarity learning models

  7. citrus

    (distributed) vector database (by 0xDebabrata)

  8. lshashing

    python library to perform Locality-Sensitive Hashing for faster nearest neighbors search in high dimensional data

    Project mention: Show HN: Locality-Sensitive Hashing in Python for Faster Nearest Neighbor Search | news.ycombinator.com | 2024-08-15
  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. tinyknn

    A tiny approximate K-Nearest Neighbour library in Python based on Fast Product Quantization and IVF

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python nearest-neighbor-search discussion

Log in or Post with

Python nearest-neighbor-search related posts

  • Document Loading, Parsing, and Cleaning in AI Applications

    9 projects | dev.to | 24 Apr 2025
  • Spann: Highly-Efficient Billion-Scale Approximate Nearest Neighbor Search

    2 projects | news.ycombinator.com | 2 Nov 2024
  • vec2pg: Migrate to pgvector from Pinecone and Qdrant

    5 projects | dev.to | 3 Sep 2024
  • beginner guide to fully local RAG on entry-level machines

    5 projects | dev.to | 2 Jun 2024
  • FLaNK 04 March 2024

    26 projects | dev.to | 4 Mar 2024
  • 7 Vector Databases Every Developer Should Know!

    4 projects | dev.to | 8 Feb 2024
  • Qdrant, the Vector Search Database, raised $28M in a Series A round

    8 projects | news.ycombinator.com | 23 Jan 2024
  • A note from our sponsor - SaaSHub
    www.saashub.com | 13 Jun 2025
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source nearest-neighbor-search projects in Python? This list will help you:

# Project Stars
1 lancedb 6,576
2 docarray 3,069
3 similarity 1,019
4 pynndescent 925
5 quaterion 656
6 citrus 104
7 lshashing 19
8 tinyknn 16

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?