Awesome-Rust-MachineLearning
datafusion
| Awesome-Rust-MachineLearning | datafusion | |
|---|---|---|
| 5 | 71 | |
| 2,244 | 8,850 | |
| 0.0% | 2.3% | |
| 0.0 | 10.0 | |
| over 2 years ago | 2 days ago | |
| JavaScript | Rust | |
| MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Awesome-Rust-MachineLearning
-
Performance critical ML: How viable is Rust as an alternative to C++
There’s an awesome-git list for a bunch of ML rust stuff not sure how up to date it is as well https://github.com/vaaaaanquish/Awesome-Rust-MachineLearning … not mine
-
Machine Learning Inference Server in Rust?
I am looking for something like [Triton Inference Server](https://github.com/triton-inference-server/server) or [TFX Serving](https://www.tensorflow.org/tfx/guide/serving), but in Rust. I came across [Orkon](https://github.com/vertexclique/orkhon) which seems to be dormant and a bunch of examples off of the [Awesome-Rust-MachineLearning](https://github.com/vaaaaanquish/Awesome-Rust-MachineLearning)
-
Any role that Rust could have in the Data world (Big Data, Data Science, Machine learning, etc.)?
There's also https://github.com/vaaaaanquish/Awesome-Rust-MachineLearning
- I wanted to share my experience of Rust as a deep learning researcher
-
Awesome Rewrite It In Rust - A curated list of replacements for existing software written in Rust
I saw Awesome-Rust-MachineLearning. It have something replacements from Python.
datafusion
- Show HN: DAGraph – local-first reactive graph from arithmetic to OLAP SQL
-
Ask HN: What Are You Working On? (May 2026)
I am working on DAGraph (https://dagraph.com)
DAGraph is a local-first reactive DAG for analytical SQL (OLAP), running entirely in the browser (there is also a native version in the works).
Some tech details: written in Rust, targeting WASM (and native). The SQL engine is Apache DataFusion[1]. The UI uses Egui[2]. Workspace data is persisted in browser using OPFS[3] via OpenDAL[4]. The graph is functional and handles dependencies for you (via parsing the SQL).
Building this to be accessible for beginners while remaining powerful for advanced users. Still very early, lots more features to add, but now usable.
Hope you find it interesting!
[1] https://datafusion.apache.org
-
Show HN: I built an LSM storage engine from scratch in Rust
Yes! This is a great idea. Thanks for mentioning it! GlueSQL could be another cool target too (https://github.com/gluesql/gluesql). I think there's a fun exploration in taking a storage engine and seeing if it's compatible with different SQL layers — GlueSQL as a simpler starting point, DataFusion as the more complete option (https://github.com/apache/datafusion). Plugging into a real SQL engine seems like a great strategy for uncovering bugs, fixing correctness issues, and discovering what use cases the storage layer actually needs to support. It's also interesting how certain SQL engines and storage engines tend to align well with each other — the right pairing can unlock a lot. Definitely a direction I want to explore.
- Apache DataFusion SQL Query Engine
-
The Equality Delete Problem in Apache Iceberg
Large numbers of Equality Delete files can cause significant read amplification, as query engines must scan both data files and delete files and perform value matching. RisingWave internally implements a schedulable compaction service (see https://github.com/nimtable/iceberg-compaction, an engine based on DataFusion project) that periodically removes equality delete files and compacts small files, reducing read amplification and storage fragmentation while preserving data freshness:
-
Embedding User-Defined Indexes in Apache Parquet
I we are actively working on supporting extension types. The mechanism is likely to be using the Arrow extension type mechanism (a logical annotation on top of existing Arrow types https://arrow.apache.org/docs/format/Columnar.html#format-me...)
I expect this to be used to support Variant https://github.com/apache/datafusion/issues/16116
As well as geometry types
-
From Go to Rust: Supercharging Our ClickHouse UDFs with Alloy
While our Go-based implementation has served us well, we've been exploring whether Rust—with its rapidly maturing Ethereum ecosystem—could take us even further. The potential benefits are compelling: better performance, enhanced safety, and improved portability that could make it easier to bring these UDFs to other analytical engines like DataFusion or DuckDB.
-
PostgreSQL Maximalism
Based on Apache DataFusion.
- Apache Datafusion – An extensible query engine in Rust that uses Apache Arrow
-
Ask HN: Who wants to be hired? (February 2025)
Remote: Yes
Willing to relocate: Yes
Technologies: Rust, Nodejs, Javascript, Typescript, Golang
Résumé/CV: https://drive.google.com/drive/folders/1ecTn700lcmt8cqlnBTtm...
Email: jatin6972@gmail.com
Github: https://github.com/jatin510
Info: Hi, I'm Jagdish Parihar! A Backend Engineer with 4+ years of experience building scalable systems and microservices using Rust, Node.js, and Golang. I've contributed to open-source projects like Apache DataFusion and thrive on solving complex backend challenges.
I'm exploring the opportunity to be working in the DB based startups. I am looking for an entry to be an engineer who will work on databases. Currently, I am contributing to open source, looking for part-time/full-time working with databases.
Datafusion contributions: https://github.com/apache/datafusion/pulls?q=is%3Apr+author%...
Datafusion comet contributions: https://github.com/apache/datafusion-comet/pulls?q=is%3Apr+a...
Let’s connect!
What are some alternatives?
linfa - A Rust machine learning framework.
DuckDB - DuckDB is an analytical in-process SQL database management system
neuronika - Tensors and dynamic neural networks in pure Rust.
polars - Extremely fast Query Engine for DataFrames, written in Rust
r4cppp - Rust for C++ programmers
arrow2 - Transmute-free Rust library to work with the Arrow format