Regression

Top 23 Regression Open-Source Projects

  • pycaret

    An open-source, low-code machine learning library in Python

  • Smile

    Statistical Machine Intelligence & Learning Engine

  • Project mention: The Current State of Clojure's Machine Learning Ecosystem | news.ycombinator.com | 2024-04-07

    > I don't think it's right to recommend that new users move away from the package because of licensing issues

    I was going to chime in to agree but then I saw how this was done - a completely innocuous looking commit:

    https://github.com/haifengl/smile/commit/6f22097b233a3436519...

    And literally no mention in the release notes:

    https://github.com/haifengl/smile/releases/tag/v3.0.0

    I think if you are going to change license especially in a way that makes it less permissive you need to be super open and clear about both the fact you are doing it and your reasons for that. This is done so silently as to look like it is intentionally trying to mislead and trick people.

    So maybe I wouldn't say to move away because of the specific license, but it's legitimate to avoid something when it's so clearly driven by a single entity and that entity acts in a way that isn't trustworthy.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • mlpack

    mlpack: a fast, header-only C++ machine learning library

  • Project mention: How much C++ is used when it comes to performing quant research? | /r/quant | 2023-07-03

    Does C++ have the equivalent of Pandas or Apache Spark? Are there extensive libraries that exist/are being developed that allow you to perform operations with data? Or do people just use a combination of Python & its various libraries (NumPy etc)? If we leave aside the data bit, are there libraries that allow you to develop ML models in C++ (mlpack for instance ) faster & more efficiently compared to their Python counterparts (scikit-learn)? On a more general note, how does C++ fit into the routine of a Quant Researcher? And at what scale does an organization decide they need to start switching to other languages and spend more time developing the code ?

  • tsai

    Time series Timeseries Deep Learning Machine Learning Pytorch fastai | State-of-the-art Deep Learning library for Time Series and Sequences in Pytorch / fastai

  • Project mention: Aeon: A unified framework for machine learning with time series | news.ycombinator.com | 2023-06-22

    Also https://github.com/timeseriesAI/tsai

  • orange

    🍊 :bar_chart: :bulb: Orange: Interactive data analysis

  • Project mention: Hierarchical Clustering | news.ycombinator.com | 2024-04-20

    I know I've tooted its horn before, but Orange3 is a pretty neat Python-based GUI platform that makes this and a metric buttload of other statistical/ML techniques available to non-programmer types.

    Just watch out for null character `x00` in the corpus. That always seems to kill it stone dead.

    https://orangedatamining.com/

    https://orange3.readthedocs.io/projects/orange-visual-progra...

  • FLAML

    A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.

  • Project mention: AutoGen: Enabling Next-Gen GPT-X Applications | news.ycombinator.com | 2023-08-22

    I really like the simplicity of this framework, and they hit on a lot of common problems found in other agent-based frameworks. Most intrigued by the RAG improvements.

    Seems like Microsoft was frustrated with the pace of movement in this space and the shitty results of agents (which admittedly kept my interest turned away from agents for the last few months). I'm interested again because it makes practical sense, and from looking at the example notebooks, seems fairly easy to integrate into existing applications.

    Maybe this is the 'low code' approach that might actually work, and bridge together engineering and non-engineering resources.

    This example was what caught my eye: https://github.com/microsoft/FLAML/blob/main/notebook/autoge...

  • MathNet

    Math.NET Numerics

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • simple-statistics

    simple statistics for node & browser javascript

  • Project mention: Simple Statistics: Statistical methods in JavaScript for browsers and servers | news.ycombinator.com | 2023-06-24

    This project is nice enough, but I wonder what is the point of some of the simpler functions.

    The maxSorted function for example is as trivial as it gets: https://github.com/simple-statistics/simple-statistics/blob/... It would be actually simpler for everyone without the abstraction.

  • lazypredict

    Lazy Predict help build a lot of basic models without much code and helps understand which models works better without any parameter tuning

  • Math PHP

    Powerful modern math library for PHP: Features descriptive statistics and regressions; Continuous and discrete probability distributions; Linear algebra with matrices and vectors, Numerical analysis; special mathematical functions; Algebra

  • RubixML

    A high-level machine learning and deep learning library for the PHP language.

  • Project mention: Machine learning and deep learning library for the PHP language | news.ycombinator.com | 2023-11-04
  • orbit

    A Python package for Bayesian forecasting with object-oriented design and probabilistic models under the hood. (by uber)

  • MLJ.jl

    A Julia machine learning framework

  • MLBox

    MLBox is a powerful Automated Machine Learning python library.

  • BeautifulAlgorithms.jl

    Concise and beautiful algorithms written in Julia

  • Tribuo

    Tribuo - A Java machine learning library

  • Project mention: FLaNK Weekly 08 Jan 2024 | dev.to | 2024-01-08
  • owl

    Owl - OCaml Scientific Computing @ https://ocaml.xyz

  • Project mention: Owl project (OCaml scientific computing) formally concluded | news.ycombinator.com | 2024-02-19
  • MAPIE

    A scikit-learn-compatible module for estimating prediction intervals.

  • reassure

    Performance testing companion for React and React Native

  • AlphaPy

    Python AutoML for Trading Systems and Sports Betting

  • Project mention: AlphaPy: machine learning framework built on sklearn and pandas. Support pyfolio/xgboost/lightgmb/catboost(gradient boosting on decision tress) etc. Examples include financial market prediction/sports prediction/kaggle. Configurations are set though | /r/algoprojects | 2023-12-10
  • mlr3

    mlr3: Machine Learning in R - next generation

  • PyABSA

    Sentiment Analysis, Text Classification, Text Augmentation, Text Adversarial defense, etc.;

  • RustQuant

    Rust library for quantitative finance.

  • Project mention: RustQuant: A Library for Quantitative Finance | news.ycombinator.com | 2024-02-01
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Regression related posts

Index

What are some of the best open-source Regression projects? This list will help you:

Project Stars
1 pycaret 8,385
2 Smile 5,921
3 mlpack 4,797
4 tsai 4,659
5 orange 4,604
6 FLAML 3,671
7 MathNet 3,383
8 simple-statistics 3,326
9 lazypredict 2,661
10 Math PHP 2,303
11 RubixML 1,972
12 orbit 1,799
13 MLJ.jl 1,720
14 MLBox 1,474
15 BeautifulAlgorithms.jl 1,305
16 Tribuo 1,218
17 owl 1,179
18 MAPIE 1,150
19 reassure 1,097
20 AlphaPy 1,049
21 mlr3 879
22 PyABSA 854
23 RustQuant 853

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com