usearch

Fast Open-Source Search & Clustering engine Γ— for Vectors & πŸ”œ Strings Γ— in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram πŸ” (by unum-cloud)

Usearch Alternatives

Similar projects and alternatives to usearch

  • marqo

    114 usearch VS marqo

    Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

  • pgvector

    Open-source vector similarity search for Postgres

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • ann-benchmarks

    Benchmarks of approximate nearest neighbor libraries in Python

  • chroma

    32 usearch VS chroma

    the AI-native open-source embedding database

  • rust-memchr

    Optimized string search routines for Rust.

  • ustore

    Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindings for C 99, C++ 17, Python 3, Java, GoLang πŸ—„οΈ

  • StringZilla

    Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging SWAR and SIMD on Arm Neon and x86 AVX2 & AVX-512-capable chips to accelerate search, sort, edit distances, alignment scores, etc πŸ¦–

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • rebar

    A biased barometer for gauging the relative speed of some regex engines on a curated set of tasks.

  • SimSIMD

    Up to 200x Faster Inner Products and Vector Similarity β€” for Python, JavaScript, Rust, and C, supporting f64, f32, f16 real & complex, i8, and binary vectors using SIMD for both x86 AVX2 & AVX-512 and Arm NEON & SVE πŸ“

  • kuzu

    Embeddable property graph database management system built for query speed and scalability. Implements Cypher.

  • uform

    8 usearch VS uform

    Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and πŸ”œ video, up to 5x faster than OpenAI CLIP and LLaVA πŸ–ΌοΈ & πŸ–‹οΈ

  • pgrx

    13 usearch VS pgrx

    Build Postgres Extensions with Rust!

  • voy

    πŸ•ΈοΈπŸ¦€ A WASM vector similarity search written in Rust

  • faiss

    A library for efficient similarity search and clustering of dense vectors.

  • lantern

    PostgreSQL vector database extension for building AI applications

  • gollum

    Production grade LLM-ops in Golang (by stillmatic)

  • qdrant-client

    Python client for Qdrant vector search engine

  • jvector

    JVector: the most advanced embedded vector search engine

  • fann

    Approx nearest neighbor search in Rust (by fennel-ai)

  • chromem-go

    Embeddable vector database for Go with Chroma-like interface and zero third-party dependencies. In-memory with optional persistence.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better usearch alternative or higher similarity.

usearch reviews and mentions

Posts with mentions or reviews of usearch. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-01-12.
  • USearch SQLite Extensions for Vector and Text Search
    1 project | news.ycombinator.com | 22 Feb 2024
  • Ask HN: What is the state of art approximate k-NN search algorithm today?
    1 project | news.ycombinator.com | 17 Jan 2024
    Another worth mentioning in this thread is usearch, though not a separate algorithm, based on HNSW with a bunch of optimizations https://github.com/unum-cloud/usearch
  • Vector Databases: A Technical Primer [pdf]
    7 projects | news.ycombinator.com | 12 Jan 2024
    I've used usearch successfully for a small project: https://github.com/unum-cloud/usearch/
  • 90x Faster Than Pgvector – Lantern's HNSW Index Creation Time
    7 projects | news.ycombinator.com | 2 Jan 2024
  • Python, C, Assembly – Faster Cosine Similarity
    5 projects | news.ycombinator.com | 18 Dec 2023
    The hardest (still missing) part of efficient cosine computation distance computation is picking a good epsilon for the `sqrt` calculation and avoiding "division by zero" problems.

    We have an open issue about it in USearch and a related one in SimSIMD itself, so if you have any suggestions, please share your insights - they would impact millions of devices using the library (directly on servers and mobile, and through projects like ClickHouse and some of the Google repos): https://github.com/unum-cloud/usearch/issues/320

  • Show HN: I scraped 25M Shopify products to build a search engine
    4 projects | news.ycombinator.com | 13 Dec 2023
    As you scale, you may benefit from these two projects I maintain, and the Big Tech uses :)

    https://github.com/unum-cloud/usearch - for faster search

    https://github.com/unum-cloud/uform - for cheaper multi-lingual multi-modal embeddings

  • [P] unum-cloud/usearch: Fastest Open-Source Similarity Search engine for Vectors in Python, JavaScript, C++, C, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram πŸ”
    1 project | /r/MachineLearning | 28 Nov 2023
  • USearch: SIMD-accelerated Vector Search Structure for 10 Programming Languages
    1 project | /r/programming | 11 Sep 2023
  • Stringzilla: Fastest string sort, search, split, and shuffle using SIMD
    9 projects | news.ycombinator.com | 29 Aug 2023
    > It doesn't appear to query CPUID

    Yes, I'm actually looking for a good way to do it for other projects as well. I've looked into a couple more libs, and here is the best I've come up with so far: https://github.com/unum-cloud/usearch/blob/f942b6f334b31716f...

    > Your substring routines have multiplicative worst case

    Yes, that is true. It's a very simple stupid trick, just happens to work well for me :)

    > It seems quite likely that your confirmation step

    We have a different library internally at Unum, that avoids this shortcoming. It has a few thousand lines of C++ templates with SIMD intrinsics... and it's definitely more efficient, but the margins aren't always high. So I kept the pure C version with inlined functions as minimal and simple as possible.

    > It would actually be possible to hook Stringzilla up to `memchr`'s benchmark suite if you were interested. :-)

    Yes, that would be a fun thing to do! I haven't had time to look into `memchr` yet, but would expect great perf from your lib as well. For me the State of the Art is Intel HyperScan. Probably the most advanced SIMD library overall, not just for strings. I was very impressed with their perf ~5 years ago. But the repo is 200 K LOC... So get ready to invest a weekend :)

    That said, I'm a bit slammed with work right now, including open-source. Hoping to ship a new major release in UCall this week, and a minor one in USearch :)

  • Unum: Vector Search engine in a single file
    8 projects | news.ycombinator.com | 31 Jul 2023
    We don't use BLAS. Why? BLAS helps with matrix-matrix multiplications, if you feel lazy and don't want to write the matrix tiling code manually.

    They bring essentially nothing of value in vector-vector operations, as compilers can properly auto-vectorize simple dot products... Moreover, they generally only target single and double precision, while we often prefer half or quarter precision. All in all, meaningless dependency.

    What do we use? I wrote a tiny package called SimSIMD. It's idea is to utilize less common SIMD instructions, especially in mixed-typed computations, that are hard for compilers to optimize. It was also a fun exercise to evaluate the performance of new SVE instruction on recent Arm CPUs, like the Graviton 3. You can find the code, the benchmarks, and the results in the repo: https://github.com/ashvardanian/simsimd

    Still, even without SimSIMD, USearch seems to be one of the faster implementations of vector search. You can find the benchmarks in the first table here: https://github.com/unum-cloud/usearch#memory-efficiency-down...

  • A note from our sponsor - SaaSHub
    www.saashub.com | 28 Apr 2024
    SaaSHub helps you find the best software and product alternatives Learn more β†’

Stats

Basic usearch repo stats
20
1,629
9.8
5 days ago

unum-cloud/usearch is an open source project licensed under Apache License 2.0 which is an OSI approved license.

The primary programming language of usearch is C++.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com