fast_vector_similarity vs rust-ndarray

fast_vector_similarity

The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors. (by Dicklesworthstone)

Suggest topics

Source Code

pypi.org

Suggest alternative

Edit details

rust-ndarray

ndarray: an N-dimensional array with array views, multidimensional slicing, and efficient operations (by rust-ndarray)

Data processing Rust Numerics Scientific Computing rust-sci

Source Code

docs.rs

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

fast_vector_similarity		rust-ndarray
	Project
7	Mentions	20
323	Stars	3,349
-	Growth	2.8%
7.2	Activity	8.2
9 months ago	Latest Commit	5 days ago
Rust	Language	Rust
-	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

fast_vector_similarity

Posts with mentions or reviews of fast_vector_similarity. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-07.

SentenceTransformers: Python framework for sentence, text and image embeddings
2 projects | news.ycombinator.com | 7 Apr 2024

Yes, check out my library for vector similarity that has various other measures which are more discriminative:
https://github.com/Dicklesworthstone/fast_vector_similarity
pip install fast_vector_similarity
Show HN: Neum AI – Open-source large-scale RAG framework
3 projects | news.ycombinator.com | 21 Nov 2023

Got it. I'd encourage you to expose more of that functionality at the level of your application if possible. I think there is a lot of potential in using more than just cosine similarity, especially when there are lots of candidates and you really want to sharpen up the top few recommendations to the best ones. You might find this open-source library I made recently useful for that:
https://github.com/Dicklesworthstone/fast_vector_similarity
I've had good results from starting with cosine similarity (using FAISS) and then "enriching" the top results from that with more sophisticated measures of similarity from my library to get the final ranking.
Some Reasons to Avoid Cython
5 projects | news.ycombinator.com | 22 Sep 2023

You can see how I did something similar in my library here:
https://github.com/Dicklesworthstone/fast_vector_similarity/...
Basically you use ndarray instead of numpy, try to vectorize anything you can, and for the for loops that can’t be vectorized, you can use rayon to do them in parallel.
FLaNK Stack Weekly 28 August 2023
27 projects | dev.to | 28 Aug 2023
Fast Vector Similarity Library, Useful for Working With Llama2 Embedding Vectors
1 project | /r/LocalLLaMA | 25 Aug 2023
Show HN: Fast Vector Similarity Using Rust and Python
8 projects | news.ycombinator.com | 23 Aug 2023

Yeah, like the other commenter said, everything is in this file here:
https://github.com/Dicklesworthstone/fast_vector_similarity/...
If you also make your project using Rust and Maturin, you can literally just copy and paste that into your project because it's totally generic, and if the repo is public, GitHub will just run it all for you for free.
The only thing is you need to create an account on PyPi (pip) and add 2-Factor Auth so you can generate an API key. Then you go into the repo settings and go to secrets, and create a Github Actions secret with the name PYPI_API_TOKEN and make the value your PyPi token. That's it! It will not only compile all the wheels for you but even upload the project to PyPi for you using the settings found in your pyproject.toml file, like this:
https://github.com/Dicklesworthstone/fast_vector_similarity/...

rust-ndarray

Posts with mentions or reviews of rust-ndarray. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-09-22.

Some Reasons to Avoid Cython
5 projects | news.ycombinator.com | 22 Sep 2023

I would love some examples of how to do non-trivial data interop between Rust and Python. My experience is that PyO3/Maturin is excellent when converting between simple datatypes but conversions get difficult when there are non-standard types, e.g. Python Numpy arrays or Rust ndarrays or whatever other custom thing.
Polars seems to have a good model where it uses the Arrow in memory format, which has implementations in Python and Rust, and makes a lot of the ndarray stuff easier. However, if the Rust libraries are not written with Arrow first, they become quite hard to work with. For example, there are many libraries written with https://github.com/rust-ndarray/ndarray, which is challenging to interop with Numpy.
(I am not an expert at all, please correct me if my characterizations are wrong!)
Helper crate for working with image data of varying type?
1 project | /r/rust | 29 May 2023

Thanks for sharing. I read this issue on why ndarray does not have a dynamically typed array: https://github.com/rust-ndarray/ndarray/issues/651
What is the most efficient way to study Rust for scientific computing applications?
1 project | /r/rust | 23 May 2023

You can get involved with the ndarray project
faer 0.8.0 release
6 projects | /r/rust | 21 Apr 2023

Sadly Ndarray does look a little abandoned to me: https://github.com/rust-ndarray/ndarray
Status and Future of ndarray?
2 projects | /r/rust | 3 Apr 2023

The date of the last commit of [ndarray](https://github.com/rust-ndarray/ndarray) lies 6 month in the past while many recent issues are open and untouched.
How does explicit unrolling differ from iterating through elements one-by-one? (ndarray example)
1 project | /r/rust | 13 Jan 2023

While looking through ndarrays src, I came across a set of functions that explicitly unroll 8 variables on each iteration of a loop, with the comment eightfold unrolled so that floating point can be vectorized (even with strict floating point accuracy semantics). I don't understand why floats would be affected by unrolling, and in general I'm confused as to how explicit unrolling differs from iterating through each element one by one. I assumed this would be a scenario where the compiler would optimize best anyway, which seems to be confirmed (at least in the context of using iter() rather than for) here. Could anyone give a little context into what this, or any explicit unrolling achieves?
Announcing Burn: New Deep Learning framework with CPU & GPU support using the newly stabilized GAT feature
7 projects | /r/rust | 6 Nov 2022

Burn is different: it is built around the Backend trait which encapsulates tensor primitives. Even the reverse mode automatic differentiation is just a backend that wraps another one using the decorator pattern. The goal is to make it very easy to create optimized backends and support different devices and use cases. For now, there are only 3 backends: NdArray (https://github.com/rust-ndarray/ndarray) for a pure rust solution, Tch (https://github.com/LaurentMazare/tch-rs) for an easy access to CUDA and cuDNN optimized operations and the ADBackendDecorator making any backend differentiable. I am now refactoring the internal backend API to make it as easy as possible to plug in new ones.
Pure rust implementation for deep learning models
3 projects | /r/rust | 9 Oct 2022

Looks like it's an open request
The Illustrated Stable Diffusion
3 projects | news.ycombinator.com | 4 Oct 2022

https://github.com/rust-ndarray/ndarray/issues/281
Answer: you can’t with this crate. I implemented a dynamic n-dim solution myself but it uses views of integer indices that get copied to a new array, which have indexes to another flattened array in order to avoid duplication of possibly massive amounts of n-dimensional data; using the crate alone, copying all the array data would be unavoidable.
Ultimately I’ve had to make my own axis shifting and windowing mechanisms. But the crate is still a useful lib and continuing effort.
While I don’t mind getting into the weeds, these kinds of side efforts can really impact context focus so it’s just something to be aware of.
Any efficient way of splitting vector?
2 projects | /r/rust | 12 Sep 2022

In principle you're trying to convert between columnar and row-based data layouts, something that happens fairly often in data science. I bet there's some hyper-efficient SIMD magic that could be invoked for these slicing operations (and maybe the iterator solution does exactly that). Might be worth taking a look at how the relevant Rust libraries like ndarray do it.

What are some alternatives?

When comparing fast_vector_similarity and rust-ndarray you can also consider the following projects:

simsimd

nalgebra - Linear algebra library for Rust.

swiss_army_llama - A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.

Rust-CUDA - Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.

np-sims - numpy ufuncs for vector similarity

image - Encoding and decoding images in Rust

QTVR - Tools for QTVR 1 files

neuronika - Tensors and dynamic neural networks in pure Rust.

llama_embeddings_fastap

utah - Dataframe structure and operations in Rust

DoctorGPT - 💻📚💡 DoctorGPT provides advanced LLM prompting for PDFs and webpages.

linfa - A Rust machine learning framework.

Compare fast_vector_similarity vs rust-ndarray and see what are their differences.

fast_vector_similarity

rust-ndarray

fast_vector_similarity

rust-ndarray

What are some alternatives?