Top 8 R Machine Learning Projects
👷♂️ A simple package for extracting useful features from character objects 👷♀️Project mention: Using dictionaries to check text in R, language processing | reddit.com/r/rstats | 2021-08-18
TextFeatures can be used to create a summary table of occurrences of text features like number of unique words, etc. Don't know if it does nouns or verbs.
Parallelizable Bayesian Optimization in RProject mention: [D] Selecting Hyperparameters Using Bayesian Optimization | reddit.com/r/statistics | 2021-03-03
Disclaimer: I am the maintainer of ParBayesianOptimization. That readme has a pretty good walkthrough of how Bayesian optimization works.
Run Linux Software Faster and Safer than Linux with Unikernels.
Recommended learners for mlr3Project mention: Machine learning algorithms in R | reddit.com/r/rstats | 2021-08-03
miceRanger: Fast Imputation with Random Forests in RProject mention: Multiple imputation packages in R | reddit.com/r/Rlanguage | 2021-09-14
I developed miceRanger because the mice package uses a really slow implementation of random forests. It has a bunch of plotting capabilities and can impute new datasets without re-training the models used in the mice procedure.
:package: Non-parametric Causal Effects of Feasible Interventions Based on Modified Treatment Policies :crystal_ball:Project mention: [Q] Should G-methods, IPTW always be used over traditional regression? | reddit.com/r/statistics | 2021-09-12
The tlverse/sl3 super learner library is much better integrated and a lot more powerful (a bit more complicated in the beginning but once you understand it, its great). LMTP has a separate branch that uses sl3: https://github.com/nt-williams/lmtp/tree/sl3-devel. To specify formulas is sl3, you just do Lrnr_glmnet$new(formula = ~ 1 + W + A + A*W), but make sure to download the "dev" version: devtools::install_github("tlverse/sl3", ref = "devel").
🎯 💯 Targeted Learning and Variable Importance for the Causal Effect of an Optimal Individualized Treatment InterventionProject mention: [D] Is there a such thing as "Prespective Statistical Models"? | reddit.com/r/statistics | 2021-09-21
This package and the references therein allows for nonparametric estimation and inference for the optimal dynamic treatment: https://github.com/tlverse/tmle3mopttx.
Interpretable and model-robust causal inference for heterogeneous treatment effects using generalized linear working models with targeted machine-learningProject mention: [Q] Sensitivity of (Causal) Inference to Nonlinear Functional Form | reddit.com/r/statistics | 2021-09-28
Why not both? https://tlverse.org/causalglm/ (Will replace this with a more informative comment when I have free time later today)
Scout APM: A developer's best friend. Try free for 14-days. Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.
An analysis and visualization R package that works with publicly available soccer dataProject mention: ggshakeR - R’s first all-in-one data analysis and visualization package on open soccer data | reddit.com/r/rstats | 2021-10-18
What are some of the best open-source Machine Learning projects in R? This list will help you:
Are you hiring? Post a new remote job listing for free.