Top 23 R Open-Source Projects
Apache Spark - A unified analytics engine for large-scale data processingProject mention: is anyone want to join maintaining spark java framework? | reddit.com/r/java | 2022-06-21
Wow, this has nothing to do with Apache Spark (https://spark.apache.org/), the wildly popular JVM based data processing framework.
GraalVM: Run Programs Faster Anywhere :rocket:Project mention: Truffle Framework - How to achieve variable scoping with native compilation? | reddit.com/r/graalvm | 2022-06-22
I am certain if you scan some of the other language implementations here: https://github.com/oracle/graal/blob/master/truffle/docs/Languages.md You will find more examples.
Less time debugging, more time building. Scout APM allows you to find and fix performance issues with no hassle. Now with error monitoring and external services monitoring, Scout is a developer's best friend when it comes to application development.
This is an older example, i found on github here
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.Project mention: Can anyone suggest me some good resources on time series analysis and forecasting? | reddit.com/r/datascience | 2022-06-24
Try Facebook's Prophet library.
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.Project mention: Search YouTube from the terminal written in python | reddit.com/r/Python | 2022-02-28
Microsoft lightGBM. https://github.com/microsoft/LightGBM
List of Data Science Cheatsheets to rule the worldProject mention: ⚙️ Data Science Cheat Sheets: A collection of cheat sheets for #DataScience and problem solving. h/t @Sauain | reddit.com/r/policerewired | 2021-10-01
mal - Make a LispProject mention: Resources to build an interpreter or PL in Haskell? | reddit.com/r/ProgrammingLanguages | 2022-06-24
Developer Ecosystem Survey 2022. Take part in the Developer Ecosystem Survey 2022 by JetBrains and get a chance to win a Macbook, a Nvidia graphics card, or other prizes. We’ll create an infographic full of stats, and you’ll get personalized results so you can compare yourself with other developers.
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.Project mention: Writing the fastest GBDT libary in Rust | dev.to | 2022-01-11
Here are our benchmarks on training time comparing Tangram's Gradient Boosted Decision Tree Library to LightGBM, XGBoost, CatBoost, and sklearn.
🔥 Hugo website builder, Hugo themes & Hugo CMS. No code, build with widgets! 创建在线课程，学术简历或初创网站。Project mention: wowchemy-hugo-themes VS ough-hugo - a user suggested alternative | libhunt.com/r/wowchemy-hugo-themes | 2022-04-19
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
:rocket: Build and manage real-life data science projects with ease!Project mention: AWS Summit 2022 Australia and New Zealand - Day 2, AI/ML Edition | dev.to | 2022-05-20
As a result of their new DS framework (based on a Metaflow - a DS framework built at Netflix and AWS SageMaker Pipelines), they were able to free up their DS resources so that Software Developers were now trained and equipped to tackle their normal DS projects, at a ratio of 70% DS/ML work was now completed by developers. This leaves the 30% meatier and more difficult problems for the Data Scientists to tackle.
An implementation of the Grammar of Graphics in R
A curated list of awesome R packages, frameworks and software.Project mention: Python vs Matlab vs R | reddit.com/r/GradSchool | 2022-02-12
📚 Parameterize, execute, and analyze notebooksProject mention: html reports using python | reddit.com/r/learnpython | 2022-04-01
papermill - similar to nbconvert with parametrization, and intergration for cloud storages
dplyr: A grammar of data manipulationProject mention: Have R changed a lot in the past 10 years? | reddit.com/r/rprogramming | 2022-05-31
Just start by memorising a five Tidy verbs (mutate.(), select.(), filter.(), arrange.(), summarise()) covers 50% of EDA. Use .by = XYZ within these Tidy verbs for tidytable, and check some examples + cheatsheets here: https://dplyr.tidyverse.org (the website recommends dtplyr but tidytable is just better/more mature/more saturated/etc.).
R for data science: a bookProject mention: Factor(1) | reddit.com/r/rprogramming | 2022-06-26
Hi everyone, I was reading R for Data Science by Hadley Wickham and I came across this question:
Realtime Web Apps and Dashboards for Python and R (by h2oai)Project mention: PyScript | news.ycombinator.com | 2022-06-22
🛠 All-in-one web-based IDE specialized for machine learning and data science.Project mention: Dynamically spin up VM (based on specific HTTPS request) and stop it once session is over? | reddit.com/r/devops | 2022-06-02
It will be a web based IDE dev kit (like Jupyter Hub, or JupyterLab) if you are familiar with them)
Dynamic Documents for RProject mention: Obsidian Compatibility with R | reddit.com/r/ObsidianMD | 2022-05-31
I would start here: https://rmarkdown.rstudio.com. Personally, I would use this instead of Obsidian if I also wanted working R code.
Multi-language suite for high-performance solvers of differential equations and scientific machine learning (SciML) componentsProject mention: When is julia getting proper precompilation? | reddit.com/r/Julia | 2021-12-10
It's not faith, and it's not all from Julia itself. https://github.com/SciML/DifferentialEquations.jl/issues/785 should reduce compile times of what OP mentioned for example.
Data Science Repo and blog for John Hopkins Coursera Courses. Please let me know if you have any questions.Project mention: datasciencecoursera: NEW Courses - star count:2053.0 | reddit.com/r/algoprojects | 2022-06-25
Carefully curated resource links for data science in one placeProject mention: ⚙️ Data Science Collected Resources: A trove of carefully curated resources and links (on the topics of software, platforms, language, techniques, etc.) related to #DataScience, all in one place. h/t @Sauain | reddit.com/r/policerewired | 2021-09-21
R related posts
PRQL 0.2 — a modern language for transforming data — a simple, powerful, pipelined SQL replacement. Now ready to use!
4 projects | reddit.com/r/programming | 27 Jun 2022
Standalone R script to executable
2 projects | reddit.com/r/Rlanguage | 27 Jun 2022
1 project | reddit.com/r/rprogramming | 26 Jun 2022
datasciencecoursera: NEW Courses - star count:2053.0
1 project | reddit.com/r/algoprojects | 25 Jun 2022
datasciencecoursera: NEW Courses - star count:2053.0
1 project | reddit.com/r/algoprojects | 24 Jun 2022
what are the prerequisite of learning R ?
1 project | reddit.com/r/Rlanguage | 24 Jun 2022
Can anyone suggest me some good resources on time series analysis and forecasting?
1 project | reddit.com/r/datascience | 24 Jun 2022
What are some of the best open-source R projects? This list will help you:
Are you hiring? Post a new remote job listing for free.