rust-bert VS rust-numpy

Compare rust-bert vs rust-numpy and see what are their differences.

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
rust-bert rust-numpy
7 10
2,427 1,019
- 2.1%
6.8 8.0
about 2 months ago 17 days ago
Rust Rust
Apache License 2.0 BSD 2-clause "Simplified" License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

rust-bert

Posts with mentions or reviews of rust-bert. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-07.
  • How to leverage the state-of-the-art NLP models in Rust
    3 projects | /r/infinilabs | 7 Jun 2023
    brew install libtorch brew link libtorch brew ls --verbose libtorch | grep dylib export LIBTORCH=$(brew --cellar pytorch)/$(brew info --json pytorch | jq -r '.[0].installed[0].version') export LD_LIBRARY_PATH=${LIBTORCH}/lib:$LD_LIBRARY_PATH git clone https://github.com/guillaume-be/rust-bert.git cd rust-bert ORT_STRATEGY=system cargo run --example sentence_embeddings
  • Transformers.js
    9 projects | news.ycombinator.com | 16 Mar 2023
    I'd like to use this transformer model in rust (because it's on the backend, because I can use data munging and it will be faster, and for other reasons). It looks like a good model! But, it doesn't compile on Apple Silicon for wierd linking issues that aren't apparent - https://github.com/guillaume-be/rust-bert/issues/338. I've spent a large part of today and yesterday attempting to find out why. The only other library that I've found for doing this kind of thing programmatically (particularly sentiment analysis) is this (https://github.com/JohnSnowLabs/spark-nlp). Some of the models look a little older, which is OK, but it does mean that I'd have to do this in another language.

    Does anyone know of any sentiment analysis software that can be tuned (other than VADER - I'm looking for more along the lines of a transformer model) - like BERT, but is pretrained and can be used in Rust or Python? Otherwise I'll probably using spark-nlp and having to spin another process.

    Thanks.

  • Running large language models like ChatGPT on a single GPU
    7 projects | news.ycombinator.com | 20 Feb 2023
    Give this a look: https://github.com/guillaume-be/rust-bert

    If you have Pytorch configured correctly, this should "just work" for a lot of the smaller models. It won't be a 1:1 ChatGPT replacement, but you can build some pretty cool stuff with it.

    > it's basically Python or bust in this space

    More or less, but that doesn't have to be a bad thing. If you're on Apple Silicon, you have plenty of performance headroom to deploy Python code for this. I've gotten this library to work on systems with as little as 2gb of memory, so outside of ultra-low-end use cases, you should be fine.

  • Self-hosted Whisper-based voice recognition server for open Android phones
    2 projects | news.ycombinator.com | 13 Feb 2023
    I suspect something similar is possible with ChatGPT. Using the GPT-neo-125m model I've been able to get some really convincing (if lackluster) answers on 4 core ARM hardware and less than 2gb of memory. With enough sampling, you can get legible paragraph-length responses out in less than 10 seconds; that's pretty good for an offline program in my book.

    I'm using rust-bert to serve it over a Discord bot, similar to one of their examples[0]. It's running on Oracle VCPUs right now, but with dedi hardware and ML acceleration I can imagine the field moving really quickly.

    [0] https://github.com/guillaume-be/rust-bert/blob/master/exampl...

  • Ask HN: What AI developer tools do you wish you'd discovered sooner?
    2 projects | news.ycombinator.com | 12 Feb 2023
    Maybe a little played-out, but I've been having a blast with the rust-bert library this weekend: https://github.com/guillaume-be/rust-bert

    With a little fanagling, you can get the GPT-Neo-1.3b model running on those free Oracle ARM VMs you can provision. I'm impressed, especially with the performance of the smallest model that uses less than a gig of memory.

  • Ask HN: Has anyone made a toy that integrates ChatGPT with voice into a toy?
    2 projects | news.ycombinator.com | 9 Feb 2023
    Nope, but it's probably possible on a smaller, hobbyist scale. I've been playing with a few GPT libraries this week (namely rust-bert[0]) and I've been really impressive with local generation results on my crappy 2 core netbook. I can get 2 sentences to generate in ~5 seconds, which is pretty good in my book.

    Armed with a Pi-style SBC and your AI library of choice, I bet you could get pretty far implementing some stuff. Bonus points if you use Whisper for speech-to-text, and double brownie points if you can get an AI voice to read the generation back.

    [0] https://github.com/guillaume-be/rust-bert/tree/master/exampl...

  • [D] Is Rust stable/mature enough to be used for production ML? Is making Rust-based python wrappers a good choice for performance heavy uses and internal ML dependencies in 2021?
    8 projects | /r/MachineLearning | 30 Dec 2021
    If you are using BERT models and some miscellaneous other related stuff then you should check out the rust-bert and Bert Sentence repos https://github.com/guillaume-be/rust-bert

rust-numpy

Posts with mentions or reviews of rust-numpy. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-12-27.
  • Numba: A High Performance Python Compiler
    11 projects | news.ycombinator.com | 27 Dec 2022
    On the contrary, it can use and interface with numpy quite easily: https://github.com/PyO3/rust-numpy
  • Carefully exploring Rust as a Python developer
    9 projects | news.ycombinator.com | 13 Nov 2022
  • Hmm
    13 projects | /r/ProgrammerHumor | 11 Aug 2022
    Once I figured out the right tools, it was easy. Its just "maturin new". It automatically converts python floats and strings. Numpy arrays come through as a special Pyarray type, that you need to unwrap, but that's just one builtin function. Using pyo3, maturin and numpy, https://github.com/PyO3/rust-numpy it's fairly easy.
  • Man, I love this language.
    9 projects | /r/rust | 18 Feb 2022
    If I'm understanding this documentation correctly then you may be able to pass the numpy array directly with func(df['col'].to_numpy) which may save some conversion.
  • [D] Is Rust stable/mature enough to be used for production ML? Is making Rust-based python wrappers a good choice for performance heavy uses and internal ML dependencies in 2021?
    8 projects | /r/MachineLearning | 30 Dec 2021
    Otherwise, though, Rust is an excellent choice. The many advantages of Rust (great package manager, memory safety, modern language features, ...) are already well documented so I won't repeat them here. Specifically for writing Python libraries, check out PyO3, maturin, and rust-numpy, which allow for seamless integration with the Python scientific computing ecosystem. Dockerizing/packaging is a non-issue, with the aforementioned libraries you can easily publish Rust libraries as pip packages or compile them from source as part of your docker build. We have several successful production deployments of Rust code at OpenAI, and I have personally found it to be a joy to work with.
  • Writing Rust libraries for the Python scientific computing ecosystem
    12 projects | /r/rust | 19 Dec 2021
    Integration with numpy uses the rust-numpy crate: Example of method that accepts numpy arrays as arguments Example of a method that returns a numpy array to Python (this performs a copy, there ought to be a way to avoid it but the current implementation has been plenty fast for my use case so far)
  • Feasibility of Using a Python Image Super Resolution Library in My Rust App
    3 projects | /r/rust | 19 Apr 2021
    This example maybe helpful.
  • Julia is the better language for extending Python
    13 projects | news.ycombinator.com | 19 Apr 2021
    Given that it's via pyO3, you could even pass the numpy arrays using https://github.com/PyO3/rust-numpy and get ndarrays at the other side.

    Same no copy, slightly more user friendly approach.

    Further criticism of the actual approach - even if we didn't do zero copy, there's no preallocation for the vector despite the size being known upfront, and nested vectors are very slow by default.

    So you could speed up the entire thing by passing it to ndarray, and then running a single call to sum over the 2D array you'd find at the other end. (https://docs.rs/ndarray/0.15.1/ndarray/struct.ArrayBase.html...)

  • Parsing PDF Documents in Rust
    1 project | /r/rust | 31 Jan 2021
    I believe converting between pandas Series (e.g. columns) and numpy ndarrays can be pretty cheap, right? Once they're in that format, you can use rust to work directly on the numpy memory buffer with rust-numpy. Otherwise, feather is a format designed for IPC of columnar data; pyarrow is in pandas (might be an optional dependency) and may be pretty quick for that, and rust has an arrow implementation too.
  • PyO3: Rust Bindings for the Python Interpreter
    18 projects | news.ycombinator.com | 29 Jan 2021
    https://github.com/PyO3/rust-numpy

What are some alternatives?

When comparing rust-bert and rust-numpy you can also consider the following projects:

Dlib - A toolkit for making real world machine learning and data analysis applications in C++

RustPython - A Python Interpreter written in Rust

speak - Talk with your machine in this minimalistic Rust crate!

julia - The Julia Programming Language

FlexGen - Running large language models like OPT-175B/GPT-3 on a single GPU. Focusing on high-throughput generation. [Moved to: https://github.com/FMInference/FlexGen]

polars - Dataframes powered by a multithreaded, vectorized query engine, written in Rust

are-we-learning-yet - How ready is Rust for Machine Learning?

rayon - Rayon: A data parallelism library for Rust

ggml - Tensor library for machine learning

image-super-resolution - 🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.

lightseq - LightSeq: A High Performance Library for Sequence Processing and Generation

PyO3 - Rust bindings for the Python interpreter