tidytable
AlgebraOfGraphics.jl
tidytable | AlgebraOfGraphics.jl | |
---|---|---|
26 | 4 | |
435 | 393 | |
- | 1.3% | |
8.2 | 5.0 | |
27 days ago | 6 days ago | |
R | Julia | |
GNU General Public License v3.0 or later | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
tidytable
- Tidyverse 2.0.0
-
fuzzyjoin - "Error in which(m) : argument to 'which' is not logical"
If you need speed, you should consider using dtplyr (or tidytable), or even dbplyr with duckdb.
-
tidytable v0.10.0 is now on CRAN - use tidyverse-like syntax with data.table speed
What do you think of this instead?
-
Offering several functions to create the same object in my package
Here's an example - I use this in a package I've built called tidytable. Here is the as_tidytable() function I use that uses method dispatch.
-
Dplyr performance issues (Late 2022)
If you're having performance issues with dplyr you can also try out tidytable
-
R Dialects Broke Me
I’d say tidytable is a better option these days as it supports more functions. Although I think dtplyr has improved on this front recently, but still lags. The author of tidytable contributes to dtplyr as well.
-
Why is mlr3 so under-marketed?
I know you said it 'feels much faster' which isn't exactly a data oriented comparison, but tidymodels performs very well. You can see one of the dplyr functions as step_* in tidymodels, for example mutate vs. step_mutate under recipes library. The author of tidytable, which uses data.table, had some revisions due to this conversation, just as an example.
-
Why is {dplyr} so huge, and are there any alternatives or a {dplyr} 'lite' that I can use for the basic mutate, group_by, summarize, etc?
Tidytable is what you might be looking for: https://markfairbanks.github.io/tidytable/, this will require a bit of refactoring (e.g group-bys happen as arguments in summarise/mutate). You'll get data.table like speed in a very compact & complete package.
-
Programming with R {dplyr}
People can also use tidytable and keep the same workflow they're already used to 😄
- tidytable v0.8.1 is on CRAN - it also comes with a new logo! Need data.table speed with tidyverse syntax? Check out tidytable.
AlgebraOfGraphics.jl
- Makie, a modern and fast plotting library for Julia
-
Tidyverse 2.0.0
This illustrates the point perfectly. Julia is attempting this and has a beachhead with Dataframes.jl. Confusingly though, Tidier.jl isn't really analogous to R's Tidyverse. It's more like one of a handful of meta-packages around Dataframes.jl.
Then there are Grammar of Graphics (ggplot was Tidyverse's first star) style plotting libraries that Julia has been building. I'm probably most excited about Algebra of Graphics (https://github.com/MakieOrg/AlgebraOfGraphics.jl/) as part of the Makie Plots ecosystem. It does still feel a bit like Julia community can't decide between following Matplotlib or R's Grid/Ggplot approach.
The seeds of a Tidyverse for Julia are there, but it'll take some time to achieve the consistency and maturity of the original Tidyverse.
-
What Julia plotting library do you use/think will be the standard going forward?
Did you maybe overlook something, in https://github.com/JuliaPlots/AlgebraOfGraphics.jl or other package? I looked up "grid" and it seems to have something. I realize R, and ggplot2, were considered best by many (and Gadfly.jl similar, AoG seems to be its replacement?), but I didn't realize it had extensions (that you clarify below). At least you can call R, and thus use its plotting (and I assume its extensions too, can you confirm or deny?). For some reasons you got downvoted, so might you be ignorant of new developments in Julia (also Makie, to me it seemed excellent and I thought Julia caught up with plotting, and also had more options than other languages), or the others, or people simply very opinionated about plotting? It's about features, also speed/latency/TTFP, which is getting better.
-
Julia Update: Adoption Keeps Climbing; Is It a Python Challenger?
Julia has plenty of plotting solutions that are better for stats than matplotlib:
https://github.com/JuliaPlots/AlgebraOfGraphics.jl
What are some alternatives?
dtplyr - Data table backend for dplyr
Genie.jl - 🧞The highly productive Julia web framework
tidypolars - Tidy interface to polars
StatsPlots.jl - Statistical plotting recipes for Plots.jl
polars - Dataframes powered by a multithreaded, vectorized query engine, written in Rust
Chain.jl - A Julia package for piping a value through a series of transformation expressions using a more convenient syntax than Julia's native piping functionality.
Apache Arrow - Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
VegaLite.jl - Julia bindings to Vega-Lite
tidyr - Tidy Messy Data
RCall.jl - Call R from Julia
root - The official repository for ROOT: analyzing, storing and visualizing big data, scientifically
Revise.jl - Automatically update function definitions in a running Julia session