Rust Dataframe

Open-source Rust projects categorized as Dataframe

Top 9 Rust Dataframe Projects

  • polars

    Fast multi-threaded, hybrid-out-of-core DataFrame library in Rust | Python | Node.js

    Project mention: Benchmarking for Pandas and Polars Using CSV and Parquet File | reddit.com/r/Python | 2023-05-15

    I have updated this issue https://github.com/pola-rs/polars/issues/8533, please kindly help to solve it. I have also sent similar issues to Pandas https://github.com/pandas-dev/pandas/issues/53249

  • arrow-datafusion

    Apache Arrow DataFusion SQL Query Engine

    Project mention: DuckDB 0.8.0 | news.ycombinator.com | 2023-05-17

    DuckDB is a great piece of software if you are

    If you are looking for a query engine implemented in a safe language (Rust) I definitely suggest checking out DataFusion. It is comparable to DuckDB in performance, has all the standard built in SQL functionality, and is extensible in pretty much all areas (query language, data formats, catalogs, user defined functions, etc)

    https://arrow.apache.org/datafusion/

    Disclaimer I am a maintainer of DataFusion

  • InfluxDB

    Access the most powerful time series database as a service. Ingest, store, & analyze all types of time series data in a fully-managed, purpose-built database. Keep data forever with low-cost storage and superior data compression.

  • tv

    📺(tv) Tidy Viewer is a cross-platform CLI csv pretty printer that uses column styling to maximize viewer enjoyment.

    Project mention: Are there any TUI apps you recommend outside of ncdu / nnn / htop / vim / bat / fd / tig / duf? | reddit.com/r/commandline | 2022-10-12

    I work with data a lot so I use the sqlite cli. I also made tv (self-promotion) to view csvs.

  • connector-x

    Fastest library to load data from DB to DataFrames in Rust and Python

    Project mention: I used multiprocessing and multithreading at the same time to drop the execution time of my code from 155+ seconds to just over 2+ seconds | reddit.com/r/Python | 2023-05-29

    There's packages like connector-x and polars that do a lot of what you're mentioning out of the box. I used these two to massively speed up an SQLalchemy + Pandas based ETL in the past as well.

  • arrow-ballista

    Apache Arrow Ballista Distributed Query Engine

    Project mention: Evolution and Trends of Data Engineering 2022/23 | reddit.com/r/dataengineering | 2023-05-19

    Ballista (Arrow-Rust), which is largely inspired by Apache Spark, there are some interesting differences.

  • Peroxide

    Rust numeric library with R, MATLAB & Python syntax

    Project mention: Hey Rustaceans! Got a question? Ask here! (39/2022)! | reddit.com/r/rust | 2022-09-26

    Rust’s standard library is relatively small by design and doesn’t contain any tools for numeric integration. However, you can probably find a crate on crates.io that does what you need. A quick search suggests Peroxide.

  • myval

    Lightweight Apache Arrow data frame for Rust

    Project mention: myval - lightweight Apache Arrow data frame for Rust | reddit.com/r/rust | 2023-04-21
  • SonarLint

    Clean code begins in your IDE with SonarLint. Up your coding game and discover issues early. SonarLint is a free plugin that helps you find & fix bugs and security issues from the moment you start writing code. Install from your favorite IDE marketplace today.

  • dply-rs

    A dataframe manipulation tool inspired by dplyr and powered by polars.

    Project mention: A dplyr interpreter powered by Polars | reddit.com/r/rust | 2023-05-10

    I have added documentation for all supported functions here.

  • ux-dataflow

    UX-Dataflow is a streaming capable data multiplexer that allows you to aggregate data and then process it using a Chain of Responsibility design pattern.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-05-29.

Rust Dataframe related posts

Index

What are some of the best open-source Dataframe projects in Rust? This list will help you:

Project Stars
1 polars 17,362
2 arrow-datafusion 3,701
3 tv 1,893
4 connector-x 1,332
5 arrow-ballista 785
6 Peroxide 381
7 myval 55
8 dply-rs 14
9 ux-dataflow 8
ONLYOFFICE Docs — document collaboration in your environment
Powerful document editing and collaboration in your app or environment. Ultimate security, API and 30+ ready connectors, SaaS or on-premises
www.onlyoffice.com