Top 23 R Open-Source Projects
-
Project mention: is anyone want to join maintaining spark java framework? | reddit.com/r/java | 2022-06-21
Wow, this has nothing to do with Apache Spark (https://spark.apache.org/), the wildly popular JVM based data processing framework.
-
Project mention: Truffle Framework - How to achieve variable scoping with native compilation? | reddit.com/r/graalvm | 2022-06-22
I am certain if you scan some of the other language implementations here: https://github.com/oracle/graal/blob/master/truffle/docs/Languages.md You will find more examples.
-
Scout APM
Less time debugging, more time building. Scout APM allows you to find and fix performance issues with no hassle. Now with error monitoring and external services monitoring, Scout is a developer's best friend when it comes to application development.
-
This is an older example, i found on github here
-
Prophet
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
Project mention: Can anyone suggest me some good resources on time series analysis and forecasting? | reddit.com/r/datascience | 2022-06-24Try Facebook's Prophet library.
-
LightGBM
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
Project mention: Search YouTube from the terminal written in python | reddit.com/r/Python | 2022-02-28Microsoft lightGBM. https://github.com/microsoft/LightGBM
-
Project mention: ⚙️ Data Science Cheat Sheets: A collection of cheat sheets for #DataScience and problem solving. h/t @Sauain | reddit.com/r/policerewired | 2021-10-01
-
Project mention: Resources to build an interpreter or PL in Haskell? | reddit.com/r/ProgrammingLanguages | 2022-06-24
-
JetBrains
Developer Ecosystem Survey 2022. Take part in the Developer Ecosystem Survey 2022 by JetBrains and get a chance to win a Macbook, a Nvidia graphics card, or other prizes. We’ll create an infographic full of stats, and you’ll get personalized results so you can compare yourself with other developers.
-
catboost
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
Here are our benchmarks on training time comparing Tangram's Gradient Boosted Decision Tree Library to LightGBM, XGBoost, CatBoost, and sklearn.
-
wowchemy-hugo-themes
🔥 Hugo website builder, Hugo themes & Hugo CMS. No code, build with widgets! 创建在线课程,学术简历或初创网站。
Project mention: wowchemy-hugo-themes VS ough-hugo - a user suggested alternative | libhunt.com/r/wowchemy-hugo-themes | 2022-04-19 -
H2O
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
-
Project mention: AWS Summit 2022 Australia and New Zealand - Day 2, AI/ML Edition | dev.to | 2022-05-20
As a result of their new DS framework (based on a Metaflow - a DS framework built at Netflix and AWS SageMaker Pipelines), they were able to free up their DS resources so that Software Developers were now trained and equipped to tackle their normal DS projects, at a ratio of 70% DS/ML work was now completed by developers. This leaves the 30% meatier and more difficult problems for the Data Scientists to tackle.
-
-
-
papermill - similar to nbconvert with parametrization, and intergration for cloud storages
-
Project mention: Have R changed a lot in the past 10 years? | reddit.com/r/rprogramming | 2022-05-31
Just start by memorising a five Tidy verbs (mutate.(), select.(), filter.(), arrange.(), summarise()) covers 50% of EDA. Use .by = XYZ within these Tidy verbs for tidytable, and check some examples + cheatsheets here: https://dplyr.tidyverse.org (the website recommends dtplyr but tidytable is just better/more mature/more saturated/etc.).
-
Hi everyone, I was reading R for Data Science by Hadley Wickham and I came across this question:
-
-
Project mention: Dynamically spin up VM (based on specific HTTPS request) and stop it once session is over? | reddit.com/r/devops | 2022-06-02
It will be a web based IDE dev kit (like Jupyter Hub, or JupyterLab) if you are familiar with them)
-
I would start here: https://rmarkdown.rstudio.com. Personally, I would use this instead of Obsidian if I also wanted working R code.
-
DifferentialEquations.jl
Multi-language suite for high-performance solvers of differential equations and scientific machine learning (SciML) components
It's not faith, and it's not all from Julia itself. https://github.com/SciML/DifferentialEquations.jl/issues/785 should reduce compile times of what OP mentioned for example.
-
m2cgen
Transform ML models into a native code (Java, C, Python, Go, JavaScript, Visual Basic, C#, R, PowerShell, PHP, Dart, Haskell, Ruby, F#, Rust) with zero dependencies
-
datasciencecoursera
Data Science Repo and blog for John Hopkins Coursera Courses. Please let me know if you have any questions.
Project mention: datasciencecoursera: NEW Courses - star count:2053.0 | reddit.com/r/algoprojects | 2022-06-25 -
Project mention: ⚙️ Data Science Collected Resources: A trove of carefully curated resources and links (on the topics of software, platforms, language, techniques, etc.) related to #DataScience, all in one place. h/t @Sauain | reddit.com/r/policerewired | 2021-09-21
R related posts
- PRQL 0.2 — a modern language for transforming data — a simple, powerful, pipelined SQL replacement. Now ready to use!
- Standalone R script to executable
- Factor(1)
- datasciencecoursera: NEW Courses - star count:2053.0
- datasciencecoursera: NEW Courses - star count:2053.0
- what are the prerequisite of learning R ?
- Can anyone suggest me some good resources on time series analysis and forecasting?
Index
What are some of the best open-source R projects? This list will help you:
Project | Stars | |
---|---|---|
1 | Apache Spark | 33,221 |
2 | Graal | 17,164 |
3 | dash | 16,700 |
4 | Prophet | 14,576 |
5 | LightGBM | 13,926 |
6 | ds-cheatsheets | 10,649 |
7 | mal | 8,704 |
8 | catboost | 6,597 |
9 | wowchemy-hugo-themes | 6,527 |
10 | H2O | 5,856 |
11 | metaflow | 5,735 |
12 | ggplot2 | 5,465 |
13 | awesome-R | 5,070 |
14 | papermill | 4,743 |
15 | dplyr | 4,128 |
16 | r4ds | 3,536 |
17 | wave | 2,968 |
18 | ML-Workspace | 2,601 |
19 | rmarkdown | 2,471 |
20 | DifferentialEquations.jl | 2,244 |
21 | m2cgen | 2,121 |
22 | datasciencecoursera | 2,058 |
23 | Data-science-best-resources | 1,931 |
Are you hiring? Post a new remote job listing for free.