Python scikit-learn

Open-source Python projects categorized as scikit-learn

Top 23 Python scikit-learn Projects

  • data-science-ipython-notebooks

    Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • best-of-ml-python

    🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

  • Dask

    Parallel computing with task scheduling

    Project mention: The Distributed Tensor Algebra Compiler (2022) | news.ycombinator.com | 2023-06-15
  • mlcourse.ai

    Open Machine Learning Course

    Project mention: Open Machine Learning Course | news.ycombinator.com | 2023-10-22
  • auto-sklearn

    Automated Machine Learning with scikit-learn

  • sktime

    A unified framework for machine learning with time series

  • autogluon

    AutoGluon: Fast and Accurate ML in 3 Lines of Code

    Project mention: pip install remyxai - easiest way to create custom vision models | /r/computervision | 2023-04-25

    This seems not very convincing. There are other popular frameworks that provide AutoML with existing datasets (eg https://github.com/autogluon/autogluon)

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

  • featuretools

    An open source python library for automated feature engineering

    Project mention: Featuretools – A Python Library for Automated Feature Engineering | news.ycombinator.com | 2023-09-20
  • orange

    🍊 :bar_chart: :bulb: Orange: Interactive data analysis

    Project mention: Orange Data Mining | news.ycombinator.com | 2024-04-15
  • superduperdb

    🔮 SuperDuperDB: Bring AI to your database! Build, deploy and manage any AI application directly with your existing data infrastructure, without moving your data. Including streaming inference, scalable model training and vector search.

    Project mention: FLaNK Stack Weekly 12 February 2024 | dev.to | 2024-02-12
  • yellowbrick

    Visual analysis and diagnostic tools to facilitate machine learning model selection.

  • flower

    Flower: A Friendly Federated Learning Framework (by adap)

    Project mention: Pyenv – lets you easily switch between multiple versions of Python | news.ycombinator.com | 2024-03-25

    We use Pyenv successfully for developing the Flower open-source project. We use a few simple Bash scripts to manage virtual environments with different Python versions via pyenv and the pyenv-virtualenv plugin.

    The main scripts are `venv-create.sh`, `venv-delete.sh` and `bootstrap.sh`. `venv-reset.sh` pulls these three scripts together to make reinstalling your venv a single command.

    Here's the link if anyone is interested: https://github.com/adap/flower/tree/main/dev

  • Artificial-Intelligence-Deep-Learning-Machine-Learning-Tutorials

    A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep Learning / Machine Vision / NLP and industry specific areas such as Climate / Energy, Automotives, Retail, Pharma, Medicine, Healthcare, Policy, Ethics and more.

  • hummingbird

    Hummingbird compiles trained ML models into tensor computation for faster inference.

    Project mention: Treebomination: Convert a scikit-learn decision tree into a Keras model | news.ycombinator.com | 2023-06-11
  • igel

    a delightful machine learning tool that allows you to train, test, and use models without writing code

  • mljar-supervised

    Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation

    Project mention: Show HN: Web App with GUI for AutoML on Tabular Data | news.ycombinator.com | 2023-08-24

    Web App is using two open-source packages that I've created:

    - MLJAR AutoML - Python package for AutoML on tabular data https://github.com/mljar/mljar-supervised

    - Mercury - framework for converting Jupyter Notebooks into Web App https://github.com/mljar/mercury

    You can run Web App locally. What is more, you can adjust notebook's code for your needs. For example, you can set different validation strategies or evalutaion metrics or longer training times. The notebooks in the repo are good starting point for you to develop more advanced apps.

  • scikit-llm

    Seamlessly integrate LLMs into scikit-learn.

    Project mention: Show HN: Scikit-LLM seamlessly integrate LLMs into scikit-learn | news.ycombinator.com | 2023-08-25
  • m2cgen

    Transform ML models into a native code (Java, C, Python, Go, JavaScript, Visual Basic, C#, R, PowerShell, PHP, Dart, Haskell, Ruby, F#, Rust) with zero dependencies

    Project mention: How to use python ML script in tauri? | /r/rust | 2023-05-02

    Check out: https://github.com/BayesWitnesses/m2cgen

  • mars

    Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.

  • MachineLearningStocks

    Using python and scikit-learn to make stock predictions

    Project mention: Scikit-learn Stock Prediction: using fundamental and pricing data to predict future stock returns. Sklearn's randomforest classifier is trainded and author claimed positive live trading results. Not actively mainained Other Models - star count:1520.0 | /r/algoprojects | 2023-08-28
  • AutoViz

    Automatically Visualize any dataset, any size with a single line of code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.

  • jetson-containers

    Machine Learning Containers for NVIDIA Jetson and JetPack-L4T

    Project mention: Install ros 2 humble on jetson orin | /r/ROS | 2023-06-25

    https://github.com/dusty-nv/jetson-containers This one might be helpful

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2024-04-15.

Python scikit-learn related posts

Index

What are some of the best open-source scikit-learn projects in Python? This list will help you:

Project Stars
1 data-science-ipython-notebooks 26,438
2 best-of-ml-python 15,284
3 Dask 11,965
4 mlcourse.ai 9,382
5 auto-sklearn 7,388
6 sktime 7,387
7 autogluon 7,050
8 featuretools 7,010
9 orange 4,594
10 superduperdb 4,304
11 yellowbrick 4,192
12 flower 4,116
13 Artificial-Intelligence-Deep-Learning-Machine-Learning-Tutorials 3,634
14 hummingbird 3,296
15 igel 3,080
16 mljar-supervised 2,924
17 scikit-llm 2,917
18 m2cgen 2,703
19 mars 2,673
20 modAL 2,132
21 MachineLearningStocks 1,642
22 AutoViz 1,617
23 jetson-containers 1,596
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com