bacon
datafusion
bacon | datafusion | |
---|---|---|
2 | 66 | |
190 | 7,434 | |
1.6% | 2.3% | |
4.0 | 10.0 | |
about 2 months ago | 7 days ago | |
Rust | Rust | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
bacon
- Any role that Rust could have in the Data world (Big Data, Data Science, Machine learning, etc.)?
-
Scientific Computing in Rust
See the github repo here https://github.com/aftix/bacon
datafusion
-
Embedding User-Defined Indexes in Apache Parquet
I we are actively working on supporting extension types. The mechanism is likely to be using the Arrow extension type mechanism (a logical annotation on top of existing Arrow types https://arrow.apache.org/docs/format/Columnar.html#format-me...)
I expect this to be used to support Variant https://github.com/apache/datafusion/issues/16116
As well as geometry types
-
From Go to Rust: Supercharging Our ClickHouse UDFs with Alloy
While our Go-based implementation has served us well, we've been exploring whether Rust—with its rapidly maturing Ethereum ecosystem—could take us even further. The potential benefits are compelling: better performance, enhanced safety, and improved portability that could make it easier to bring these UDFs to other analytical engines like DataFusion or DuckDB.
-
PostgreSQL Maximalism
Based on Apache DataFusion.
- Apache Datafusion – An extensible query engine in Rust that uses Apache Arrow
-
Ask HN: Who wants to be hired? (February 2025)
Remote: Yes
Willing to relocate: Yes
Technologies: Rust, Nodejs, Javascript, Typescript, Golang
Résumé/CV: https://drive.google.com/drive/folders/1ecTn700lcmt8cqlnBTtm...
Email: [email protected]
Github: https://github.com/jatin510
Info: Hi, I'm Jagdish Parihar! A Backend Engineer with 4+ years of experience building scalable systems and microservices using Rust, Node.js, and Golang. I've contributed to open-source projects like Apache DataFusion and thrive on solving complex backend challenges.
I'm exploring the opportunity to be working in the DB based startups. I am looking for an entry to be an engineer who will work on databases. Currently, I am contributing to open source, looking for part-time/full-time working with databases.
Datafusion contributions: https://github.com/apache/datafusion/pulls?q=is%3Apr+author%...
Datafusion comet contributions: https://github.com/apache/datafusion-comet/pulls?q=is%3Apr+a...
Let’s connect!
- Apache DataFusion
-
How to build a new Harlequin adapter with Poetry
Harlequin is a TUI client for SQL databases known for its light-weight extensive support for SQL databases. It is a versatile tool for data exploration and analysis workflows. Harlequin provides an interactive SQL editor with features like autocomplete, syntax highlighting, and query history. It also has a results viewer that can display large result sets. However, Harlequin did not have a DataFusion adapter before. Thankfully, it was really easy to add one.
-
Why you should keep an eye on Apache DataFusion and its community.
In case you don't know what Apache DataFusion is, here's the high-level blurb.
-
Make Rust Object Oriented with the dual-trait pattern
I've invented 😎 this dual-trait pattern for the purposes of the logical planner, as seen in this merged PR. The problem was that the nodes in the plan (filter, select, etc.) had to support at the same time:
- Pg_lakehouse: A DuckDB Alternative in Postgres
What are some alternatives?
Apache Arrow - Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
DuckDB - DuckDB is an analytical in-process SQL database management system
rink-rs - Unit conversion tool and library written in rust
polars - Dataframes powered by a multithreaded, vectorized query engine, written in Rust
statrs - Statistical computation library for Rust
ClickHouse - ClickHouse® is a real-time analytics database management system