poorman
tidytable
Our great sponsors
poorman | tidytable | |
---|---|---|
2 | 26 | |
328 | 435 | |
- | - | |
5.7 | 8.3 | |
2 months ago | 8 days ago | |
R | R | |
GNU General Public License v3.0 or later | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
poorman
-
Why is {dplyr} so huge, and are there any alternatives or a {dplyr} 'lite' that I can use for the basic mutate, group_by, summarize, etc?
You might find the poorman package interesting: https://github.com/nathaneastwood/poorman
-
Just how widely accepted is tidyr/dplyr these days?
It's true that their packages are heavy on dependencies, and if that is a concern, you have alternatives: - poorman: no dependencies, same syntax as dplyr, but only includes basic verbs. - datawizard: low dependencies, slightly different syntax, has base-R implementations of most of dplyr / tidyr functions, plus some other goodies likes scaling, mean-centering, rank transforming, ... - And of course, data.table: 0 dependencies, ultra-fast (everything is written in optimized C under the hood), can manipulate much bigger data than the Tidyverse, and can do everything the tidyverse can when it comes to data wrangling (however, sometimes the tidyverse has convenience functions that make some operations shorter than with data.table). The downside is that data.table's syntax requires more efforts to learn / is less intuitive to read for neophytes.
tidytable
- Tidyverse 2.0.0
-
fuzzyjoin - "Error in which(m) : argument to 'which' is not logical"
If you need speed, you should consider using dtplyr (or tidytable), or even dbplyr with duckdb.
-
tidytable v0.10.0 is now on CRAN - use tidyverse-like syntax with data.table speed
And the Github link if you want to dive into the code.
Other than that you can check the news for other changes!
What do you think of this instead?
-
R Dialects Broke Me
I’d say tidytable is a better option these days as it supports more functions. Although I think dtplyr has improved on this front recently, but still lags. The author of tidytable contributes to dtplyr as well.
-
Why is {dplyr} so huge, and are there any alternatives or a {dplyr} 'lite' that I can use for the basic mutate, group_by, summarize, etc?
Tidytable is what you might be looking for: https://markfairbanks.github.io/tidytable/, this will require a bit of refactoring (e.g group-bys happen as arguments in summarise/mutate). You'll get data.table like speed in a very compact & complete package.
- tidytable v0.8.1 is on CRAN - it also comes with a new logo! Need data.table speed with tidyverse syntax? Check out tidytable.
-
tidytable v0.7.0 is now on CRAN! Use tidyverse-like syntax with data.table speed.
Github page - (if you want to browse the source code)
What are some alternatives?
dtplyr - Data table backend for dplyr
tidypolars - Tidy interface to polars
polars - Dataframes powered by a multithreaded, vectorized query engine, written in Rust
Apache Arrow - Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
tidyr - Tidy Messy Data
Tidier.jl - Meta-package for data analysis in Julia, modeled after the R tidyverse.
root - The official repository for ROOT: analyzing, storing and visualizing big data, scientifically
extendr - R extension library for rust designed to be familiar to R users.
db-benchmark - reproducible benchmark of database-like ops
re2 - R interface to Google re2 (C++) regular expression engine
lintr - Static Code Analysis for R
box - Write reusable, composable and modular R code