Top 4 R reproducible-research Projects

  • GitHub repo drake

    An R-focused pipeline toolkit for reproducibility and high-performance computing (by ropensci)

    Project mention: Your impression of {targets}? (r package) | reddit.com/r/Rlanguage | 2021-05-02

    The targets package is the official successor to Drake, and has the same primary author (Will Landau). He has explained why he created targets, which includes stronger guardrails for users and better UX.

  • GitHub repo targets

    Function-oriented Make-like declarative workflows for R

    Project mention: How do you manage, distribute and schedule jobs written in R? | reddit.com/r/dataengineering | 2021-10-07

    That said, you might want to check out the ‘targets’ package, which provides a DSL for specifying complex workflow descriptions in R. When repeatedly running the same jobs on changing data, this package helps ensure that only necessary work is performed (suitable intermediate results are reused), and scripts are run reproducibly. This might help with sceduling.

  • GitHub repo lumberjack

    Track changes in data with ease (by markvanderloo)

    Project mention: [P] Datasets should behave like Git repositories | reddit.com/r/MachineLearning | 2021-01-19

    There is an R project... https://github.com/markvanderloo/lumberjack/blob/master/pkg/vignettes/jss4008.pdf

  • GitHub repo groundhog

    Reproducible R Scripts Via Version-Specific CRAN-Package Storing and Loading (by CredibilityLab)

    Project mention: Groundhog: Addressing the Threat That R Poses to Reproducible Research | news.ycombinator.com | 2021-01-06

    Is it not on GitHub at https://github.com/CredibilityLab/groundhog ?

