SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Rust Machine Learning Projects
-
qdrant
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
Project mention: The Backend Shift: Leveraging Open Source Powerhouses for Faster, Leaner Apps | dev.to | 2025-05-10Weaviate (weaviate.io) / Qdrant (qdrant.tech):
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
burn
Burn is a next generation Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.
I intend to grow this into an open-source project because deep inside, this is ideally how I would like ComfyUI to be. There's still a long journey ahead for building all the custom nodes, which is especially challenging given that the majority of code for AI workflows is written in Python. However, with my hands-on experience with Candle and Burn libraries, I may be able to get pretty close!
-
Project mention: Postgres Learns to RAG: Wikipedia Q&A using Llama 3.1 inside the database | news.ycombinator.com | 2024-09-24
GitHub: https://github.com/postgresml/postgresml
Looking forward to your feedback and any questions about the technical details.
-
-
-
-
lance
Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, and PyTorch with more integrations coming..
Project mention: ⚡🦀 Deploy a blazing-fast & Lightweight LLM app with Rust-Rig-LanceDB | dev.to | 2024-11-22Lance is an open-source columnar data format designed for performant ML workloads.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
Project mention: Rust Linfa: The Rising Star of Machine Learning in Systems Programming | dev.to | 2024-12-23
Linfa is a modular approach to machine learning in Rust, offering a collection of statistical learning algorithms and tools. Unlike monolithic frameworks, Linfa follows Rust's philosophy of small, focused crates that can be composed together.
-
tensorzero
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
Project mention: Ask HN: Freelancer? Seeking freelancer? (April 2025) | news.ycombinator.com | 2025-04-01SEEKING FREELANCER
TensorZero | https://github.com/tensorzero/tensorzero | Staff Front-end / Design Engineer | Remote or Onsite (NYC) | Full-time or Part-time
TensorZero creates a feedback loop for optimizing LLM applications — turning production data into smarter, faster, and cheaper models.
We're looking for a contract / freelance Staff Front-end / Design Engineer with the following skillset:
‣ Must have: expert in TypeScript, React, and web fundamentals
‣ Nice to have: familiar with LLMs, experience with Vite / React Router V7 (RemixJS) / Tailwind
What we offer:
‣ Vast majority of your work → open source
‣ Flexible arrangement: remote or onsite (NYC), full-time or part-time
‣ Small and entirely technical team: former Rust compiler maintainer, ML researchers with 1000's of citations, decacorn CPO
‣ Engagement expected to last a few months
‣ Compensation in line with staff+ experience
Also hiring full-time employees: https://news.ycombinator.com/item?id=43569646
Apply: [email protected]
-
rust-bert
Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)
-
Project mention: Lessons Learned from Scaling to Multi-Terabyte Datasets | news.ycombinator.com | 2024-06-20
https://github.com/Eventual-Inc/Daft Is also great at these types of workloads since it’s both distributed and vectorized!
-
hora
🚀 efficient approximate nearest neighbor search algorithm collections library written in Rust 🦀 .
-
spiceai
A portable accelerated data query and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.
Project mention: Show HN: TextQuery – Query CSV, JSON, XLSX Files with SQL | news.ycombinator.com | 2025-05-05 -
-
-
Hey Rustaceans and quant enthusiasts,
I’d like to introduce my library, stochastic-rs: https://github.com/dancixx/stochastic-rs, which focuses on the simulation and analysis of stochastic processes. This project started as part of my PhD work, and it covers topics like Malliavin calculus, rough fractional stochastic processes, and high-performance data generation for AI training in market finance. While the initial focus was around market finance applications, my long-term goal is to build a more general quant library in Rust.
There is already an excellent library RustQuant: https://github.com/avhz/RustQuant. you’ll notice some overlap in the quant-focused areas, but stochastic-rs introduces some features that haven’t been covered there (or only partially). So I would say this not “yet another quant library”, just it has a different focus.
This is in early stage but I’m always eager for feedbacks—whether it’s on potential improvements, use cases, or contributions! Feel free to check it out and let me know what you think.
-
-
-
extractous
Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.
Project mention: Ask HN: What is the best method for turning a scanned book as a PDF into text? | news.ycombinator.com | 2025-02-16Seeing blind recommendations for AI slop is very disappointing for HN.
For OP, there is a library written in rust that can do exactly what you need with very high accuracy and performant.
https://github.com/yobix-ai/extractous
Would need to OCR dependencies to get it to work on scanned books [2]
[1] https://github.com/yobix-ai/extractous
[2] https://github.com/yobix-ai/extractous?tab=readme-ov-file#-s...
-
Project mention: Running Durable Workflows in Postgres Using DBOS | news.ycombinator.com | 2024-12-10
Great points. Besides performance, centralized coordination and distributed dataplane is better for operability of schedulers as well. Some examples - Being able to roll out new features in the scheduler, tracing scheduling behavior and decisions, deploying configuration changes.
Even with a centralized scheduler it should be possible to create a DevEx that makes use of decorators to author workflows easily.
We are doing that with Indexify(https://github.com/tensorlakeai/indexify) for authoring data intensive workflows to process unstructured data(documents, videos, etc) - it’s like Spark but uses Python instead of Scala/SQL/UDFs.
-
Project mention: Show HN: MusicGPT – An Open Source App for Generating Music with Local LLMs | news.ycombinator.com | 2024-05-23
-
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Rust Machine Learning discussion
Rust Machine Learning related posts
-
Conduit: A UI-less node-based system
-
Burn v0.17: Deep Learning in Rust gets new back ends and improved kernel fusion
-
Ask HN: Who is hiring? (April 2025)
-
Feature Factory: A Feature Engineering Library for Rust (Built on Apache DataFusion) 🦀
-
LangChain: From Chains to Threads
-
Show HN: Spice.ai OSS 1.0 – data query and AI-inference engine built in Rust
-
Show HN: Pure Rust Implementation of GPT
-
A note from our sponsor - SaaSHub
www.saashub.com | 14 May 2025
Index
What are some of the best open-source Machine Learning projects in Rust? This list will help you:
# | Project | Stars |
---|---|---|
1 | qdrant | 23,471 |
2 | burn | 11,066 |
3 | postgresml | 6,269 |
4 | leaf | 5,553 |
5 | rust | 5,335 |
6 | tch-rs | 4,767 |
7 | lance | 4,591 |
8 | linfa | 4,123 |
9 | tensorzero | 4,065 |
10 | rust-bert | 2,836 |
11 | Daft | 2,826 |
12 | hora | 2,638 |
13 | spiceai | 2,400 |
14 | dfdx | 1,808 |
15 | ocrs | 1,502 |
16 | RustQuant | 1,376 |
17 | ort | 1,303 |
18 | juice | 1,119 |
19 | extractous | 1,097 |
20 | indexify | 994 |
21 | MusicGPT | 991 |
22 | motorhead | 878 |
23 | femtoGPT | 871 |