Introducing tidypolars - a Python data frame package for R tidyverse users

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

polars

144 26,043 10.0 Rust

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

tidypolars uses the polars package as a backend, which might be the fastest data frame manipulation library out there. (Faster even than R's data.table, which has been the king of speed for many years.)

db-benchmark

91 319 0.0 R

reproducible benchmark of database-like ops

I think having a basic understanding of pandas, given how broadly it's used, is beneficial. That being said, polars seems to be matching or beating data.table in performance, so I think it'd be very worth it to take it up. Wes McKinney, creator of pandas, has been quite vocal about architecture flaws of pandas -- which is why he's been working on the Arrow project. polars is based on Arrow, so in principle it's kinda like pandas 2.0 (adopting the changes that Wes proposed).

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Apache Arrow

75 13,480 10.0 C++

Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing

I think having a basic understanding of pandas, given how broadly it's used, is beneficial. That being said, polars seems to be matching or beating data.table in performance, so I think it'd be very worth it to take it up. Wes McKinney, creator of pandas, has been quite vocal about architecture flaws of pandas -- which is why he's been working on the Arrow project. polars is based on Arrow, so in principle it's kinda like pandas 2.0 (adopting the changes that Wes proposed).

tidypolars

7 308 8.0 Python

Tidy interface to polars
extendr

2 399 8.4 Rust

R extension library for rust designed to be familiar to R users.
tidytable

26 435 8.3 R

Tidy interface to 'data.table'

What's cool about this (and /u/GoodAboutHood's other package tidytable) is that they adopt the widely used Tidyverse syntax for high-performance packages without sacrificing speed (and, in my opinion of dtplyr, making it too complicated).

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Why Python's Integer Division Floors (2010)
1 project | news.ycombinator.com | 28 Feb 2024
Polars 0.20 Released
1 project | news.ycombinator.com | 16 Dec 2023
Polars: Dataframes powered by a multithreaded query engine, written in Rust
1 project | news.ycombinator.com | 7 Dec 2023
Polars 0.34 is released. (A query engine focussing on DataFrame front ends)
1 project | /r/u_Dazzling_Finger_8120 | 26 Oct 2023
Polars 0.34 is released. (A query engine focussing on DataFrame front ends)
1 project | /r/rust | 26 Oct 2023

Introducing tidypolars - a Python data frame package for R tidyverse users

This page summarizes the projects mentioned and recommended in the original post on /r/rstats
Rust Arrow dataframe-library Dataframe R
Post date: 10 Nov 2021

polars

db-benchmark

InfluxDB

Apache Arrow

tidypolars

extendr

tidytable

Related posts

Introducing tidypolars - a Python data frame package for R tidyverse users

This page summarizes the projects mentioned and recommended in the original post on /r/rstats Rust Arrow dataframe-library Dataframe R Post date: 10 Nov 2021

polars

db-benchmark

InfluxDB

Apache Arrow

tidypolars

extendr

tidytable

Related posts

This page summarizes the projects mentioned and recommended in the original post on /r/rstats
Rust Arrow dataframe-library Dataframe R
Post date: 10 Nov 2021