Open-source projects categorized as R Edit details

Top 23 R Open-Source Projects

  • Apache Spark

    Apache Spark - A unified analytics engine for large-scale data processing

    Project mention: is anyone want to join maintaining spark java framework? | reddit.com/r/java | 2022-06-21

    Wow, this has nothing to do with Apache Spark (https://spark.apache.org/), the wildly popular JVM based data processing framework.

  • Graal

    GraalVM: Run Programs Faster Anywhere :rocket:

    Project mention: Truffle Framework - How to achieve variable scoping with native compilation? | reddit.com/r/graalvm | 2022-06-22

    I am certain if you scan some of the other language implementations here: https://github.com/oracle/graal/blob/master/truffle/docs/Languages.md You will find more examples.

  • Scout APM

    Less time debugging, more time building. Scout APM allows you to find and fix performance issues with no hassle. Now with error monitoring and external services monitoring, Scout is a developer's best friend when it comes to application development.

  • dash

    Analytical Web Apps for Python, R, Julia, and Jupyter. No JavaScript Required.

    Project mention: Can anyone explain me callback in dash | reddit.com/r/learnpython | 2022-05-10

    This is an older example, i found on github here

  • Prophet

    Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.

    Project mention: Can anyone suggest me some good resources on time series analysis and forecasting? | reddit.com/r/datascience | 2022-06-24

    Try Facebook's Prophet library.

  • LightGBM

    A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

    Project mention: Search YouTube from the terminal written in python | reddit.com/r/Python | 2022-02-28

    Microsoft lightGBM. https://github.com/microsoft/LightGBM

  • ds-cheatsheets

    List of Data Science Cheatsheets to rule the world

    Project mention: ⚙️ Data Science Cheat Sheets: A collection of cheat sheets for #DataScience and problem solving. h/t @Sauain | reddit.com/r/policerewired | 2021-10-01
  • mal

    mal - Make a Lisp

    Project mention: Resources to build an interpreter or PL in Haskell? | reddit.com/r/ProgrammingLanguages | 2022-06-24
  • JetBrains

    Developer Ecosystem Survey 2022. Take part in the Developer Ecosystem Survey 2022 by JetBrains and get a chance to win a Macbook, a Nvidia graphics card, or other prizes. We’ll create an infographic full of stats, and you’ll get personalized results so you can compare yourself with other developers.

  • catboost

    A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

    Project mention: Writing the fastest GBDT libary in Rust | dev.to | 2022-01-11

    Here are our benchmarks on training time comparing Tangram's Gradient Boosted Decision Tree Library to LightGBM, XGBoost, CatBoost, and sklearn.

  • wowchemy-hugo-themes

    🔥 Hugo website builder, Hugo themes & Hugo CMS. No code, build with widgets! 创建在线课程,学术简历或初创网站。

    Project mention: wowchemy-hugo-themes VS ough-hugo - a user suggested alternative | libhunt.com/r/wowchemy-hugo-themes | 2022-04-19
  • H2O

    H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

    Project mention: A Tiny Grammar of Graphics | news.ycombinator.com | 2022-06-14
  • metaflow

    :rocket: Build and manage real-life data science projects with ease!

    Project mention: AWS Summit 2022 Australia and New Zealand - Day 2, AI/ML Edition | dev.to | 2022-05-20

    As a result of their new DS framework (based on a Metaflow - a DS framework built at Netflix and AWS SageMaker Pipelines), they were able to free up their DS resources so that Software Developers were now trained and equipped to tackle their normal DS projects, at a ratio of 70% DS/ML work was now completed by developers. This leaves the 30% meatier and more difficult problems for the Data Scientists to tackle.

  • ggplot2

    An implementation of the Grammar of Graphics in R

    Project mention: A Tiny Grammar of Graphics | news.ycombinator.com | 2022-06-14
  • awesome-R

    A curated list of awesome R packages, frameworks and software.

    Project mention: Python vs Matlab vs R | reddit.com/r/GradSchool | 2022-02-12
  • papermill

    📚 Parameterize, execute, and analyze notebooks

    Project mention: html reports using python | reddit.com/r/learnpython | 2022-04-01

    papermill - similar to nbconvert with parametrization, and intergration for cloud storages

  • dplyr

    dplyr: A grammar of data manipulation

    Project mention: Have R changed a lot in the past 10 years? | reddit.com/r/rprogramming | 2022-05-31

    Just start by memorising a five Tidy verbs (mutate.(), select.(), filter.(), arrange.(), summarise()) covers 50% of EDA. Use .by = XYZ within these Tidy verbs for tidytable, and check some examples + cheatsheets here: https://dplyr.tidyverse.org (the website recommends dtplyr but tidytable is just better/more mature/more saturated/etc.).

  • r4ds

    R for data science: a book

    Project mention: Factor(1) | reddit.com/r/rprogramming | 2022-06-26

    Hi everyone, I was reading R for Data Science by Hadley Wickham and I came across this question:

  • wave

    Realtime Web Apps and Dashboards for Python and R (by h2oai)

    Project mention: PyScript | news.ycombinator.com | 2022-06-22
  • ML-Workspace

    🛠 All-in-one web-based IDE specialized for machine learning and data science.

    Project mention: Dynamically spin up VM (based on specific HTTPS request) and stop it once session is over? | reddit.com/r/devops | 2022-06-02

    It will be a web based IDE dev kit (like Jupyter Hub, or JupyterLab) if you are familiar with them)

  • rmarkdown

    Dynamic Documents for R

    Project mention: Obsidian Compatibility with R | reddit.com/r/ObsidianMD | 2022-05-31

    I would start here: https://rmarkdown.rstudio.com. Personally, I would use this instead of Obsidian if I also wanted working R code.

  • DifferentialEquations.jl

    Multi-language suite for high-performance solvers of differential equations and scientific machine learning (SciML) components

    Project mention: When is julia getting proper precompilation? | reddit.com/r/Julia | 2021-12-10

    It's not faith, and it's not all from Julia itself. https://github.com/SciML/DifferentialEquations.jl/issues/785 should reduce compile times of what OP mentioned for example.

  • m2cgen

    Transform ML models into a native code (Java, C, Python, Go, JavaScript, Visual Basic, C#, R, PowerShell, PHP, Dart, Haskell, Ruby, F#, Rust) with zero dependencies

  • datasciencecoursera

    Data Science Repo and blog for John Hopkins Coursera Courses. Please let me know if you have any questions.

    Project mention: datasciencecoursera: NEW Courses - star count:2053.0 | reddit.com/r/algoprojects | 2022-06-25
  • Data-science-best-resources

    Carefully curated resource links for data science in one place

    Project mention: ⚙️ Data Science Collected Resources: A trove of carefully curated resources and links (on the topics of software, platforms, language, techniques, etc.) related to #DataScience, all in one place. h/t @Sauain | reddit.com/r/policerewired | 2021-09-21
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2022-06-26.

R related posts


What are some of the best open-source R projects? This list will help you:

Project Stars
1 Apache Spark 33,221
2 Graal 17,164
3 dash 16,700
4 Prophet 14,576
5 LightGBM 13,926
6 ds-cheatsheets 10,649
7 mal 8,704
8 catboost 6,597
9 wowchemy-hugo-themes 6,527
10 H2O 5,856
11 metaflow 5,735
12 ggplot2 5,465
13 awesome-R 5,070
14 papermill 4,743
15 dplyr 4,128
16 r4ds 3,536
17 wave 2,968
18 ML-Workspace 2,601
19 rmarkdown 2,471
20 DifferentialEquations.jl 2,244
21 m2cgen 2,121
22 datasciencecoursera 2,058
23 Data-science-best-resources 1,931
Find remote jobs at our new job board 99remotejobs.com. There are 4 new remote jobs listed recently.
Are you hiring? Post a new remote job listing for free.
Static code analysis for 29 languages.
Your projects are multi-language. So is SonarQube analysis. Find Bugs, Vulnerabilities, Security Hotspots, and Code Smells so you can release quality code every time. Get started analyzing your projects today for free.