tidytable
Tidy interface to 'data.table' (by markfairbanks)
tidypolars
Tidy interface to polars (by markfairbanks)
tidytable | tidypolars | |
---|---|---|
26 | 7 | |
455 | 350 | |
1.1% | 0.9% | |
7.9 | 8.9 | |
about 1 month ago | 3 months ago | |
R | Python | |
GNU General Public License v3.0 or later | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tidytable
Posts with mentions or reviews of tidytable.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-04-09.
- Tidyverse 2.0.0
-
fuzzyjoin - "Error in which(m) : argument to 'which' is not logical"
If you need speed, you should consider using dtplyr (or tidytable), or even dbplyr with duckdb.
-
tidytable v0.10.0 is now on CRAN - use tidyverse-like syntax with data.table speed
What do you think of this instead?
-
Offering several functions to create the same object in my package
Here's an example - I use this in a package I've built called tidytable. Here is the as_tidytable() function I use that uses method dispatch.
-
Dplyr performance issues (Late 2022)
If you're having performance issues with dplyr you can also try out tidytable
-
R Dialects Broke Me
I’d say tidytable is a better option these days as it supports more functions. Although I think dtplyr has improved on this front recently, but still lags. The author of tidytable contributes to dtplyr as well.
-
Why is mlr3 so under-marketed?
I know you said it 'feels much faster' which isn't exactly a data oriented comparison, but tidymodels performs very well. You can see one of the dplyr functions as step_* in tidymodels, for example mutate vs. step_mutate under recipes library. The author of tidytable, which uses data.table, had some revisions due to this conversation, just as an example.
-
Why is {dplyr} so huge, and are there any alternatives or a {dplyr} 'lite' that I can use for the basic mutate, group_by, summarize, etc?
Tidytable is what you might be looking for: https://markfairbanks.github.io/tidytable/, this will require a bit of refactoring (e.g group-bys happen as arguments in summarise/mutate). You'll get data.table like speed in a very compact & complete package.
-
Programming with R {dplyr}
People can also use tidytable and keep the same workflow they're already used to 😄
- tidytable v0.8.1 is on CRAN - it also comes with a new logo! Need data.table speed with tidyverse syntax? Check out tidytable.
tidypolars
Posts with mentions or reviews of tidypolars.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-01-07.
-
Modern Polars: a side-by-side comparison with Pandas
I recommend trying tidypolars
-
Modern Polars: an extensive side-by-side comparison of Polars and Pandas
There’s a tidypolars package that appears to be well-maintained https://github.com/markfairbanks/tidypolars
-
R Tidyverse / dplyr is life changing!
tidypolars is one I’ve seen. Still very new, but it’s built on top of polars (which is a little more like dplyr to begin with), so it’s much faster than pandas.
-
Introducing tidypolars - a Python data frame package with syntax familiar to R tidyverse users
If I misunderstood your question - feel free to open a discussion with a small code example and we can talk through how you can do it in tidypolars.
- Introducing tidypolars - a Python data frame package for R tidyverse users
-
Tidyverse appreciation thread
Try out tidypolars. It's really close to tidyverse syntax and it's a lot faster than pandas as well
What are some alternatives?
When comparing tidytable and tidypolars you can also consider the following projects:
dtplyr - Data table backend for dplyr
polars - Dataframes powered by a multithreaded, vectorized query engine, written in Rust
Tidier.jl - Meta-package for data analysis in Julia, modeled after the R tidyverse.
root - The official repository for ROOT: analyzing, storing and visualizing big data, scientifically
extendr - R extension library for rust designed to be familiar to R users.
box - Write reusable, composable and modular R code
db-benchmark - reproducible benchmark of database-like ops
tidyr - Tidy Messy Data
Apache Arrow - Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
modin - Modin: Scale your Pandas workflows by changing a single line of code