Python scikit-learn

Open-source Python projects categorized as scikit-learn

Top 23 Python scikit-learn Projects

  • data-science-ipython-notebooks

    Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • best-of-ml-python

    🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • Dask

    Parallel computing with task scheduling

  • Project mention: The Distributed Tensor Algebra Compiler (2022) | news.ycombinator.com | 2023-06-15
  • mlcourse.ai

    Open Machine Learning Course

  • Project mention: Open Machine Learning Course | news.ycombinator.com | 2023-10-22
  • sktime

    A unified framework for machine learning with time series

  • auto-sklearn

    Automated Machine Learning with scikit-learn

  • autogluon

    AutoGluon: Fast and Accurate ML in 3 Lines of Code

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • featuretools

    An open source python library for automated feature engineering

  • Project mention: Featuretools – A Python Library for Automated Feature Engineering | news.ycombinator.com | 2023-09-20
  • orange

    🍊 :bar_chart: :bulb: Orange: Interactive data analysis

  • Project mention: Hierarchical Clustering | news.ycombinator.com | 2024-04-20

    I know I've tooted its horn before, but Orange3 is a pretty neat Python-based GUI platform that makes this and a metric buttload of other statistical/ML techniques available to non-programmer types.

    Just watch out for null character `x00` in the corpus. That always seems to kill it stone dead.

    https://orangedatamining.com/

    https://orange3.readthedocs.io/projects/orange-visual-progra...

  • superduperdb

    🔮 SuperDuperDB: Bring AI to your database! Build, deploy and manage any AI application directly with your existing data infrastructure, without moving your data. Including streaming inference, scalable model training and vector search.

  • Project mention: FLaNK Stack Weekly 12 February 2024 | dev.to | 2024-02-12
  • yellowbrick

    Visual analysis and diagnostic tools to facilitate machine learning model selection.

  • flower

    Flower: A Friendly Federated Learning Framework (by adap)

  • Project mention: Pyenv – lets you easily switch between multiple versions of Python | news.ycombinator.com | 2024-03-25

    We use Pyenv successfully for developing the Flower open-source project. We use a few simple Bash scripts to manage virtual environments with different Python versions via pyenv and the pyenv-virtualenv plugin.

    The main scripts are `venv-create.sh`, `venv-delete.sh` and `bootstrap.sh`. `venv-reset.sh` pulls these three scripts together to make reinstalling your venv a single command.

    Here's the link if anyone is interested: https://github.com/adap/flower/tree/main/dev

  • Artificial-Intelligence-Deep-Learning-Machine-Learning-Tutorials

    A comprehensive list of Deep Learning / Artificial Intelligence and Machine Learning tutorials - rapidly expanding into areas of AI/Deep Learning / Machine Vision / NLP and industry specific areas such as Climate / Energy, Automotives, Retail, Pharma, Medicine, Healthcare, Policy, Ethics and more.

  • hummingbird

    Hummingbird compiles trained ML models into tensor computation for faster inference.

  • Project mention: Treebomination: Convert a scikit-learn decision tree into a Keras model | news.ycombinator.com | 2023-06-11
  • igel

    a delightful machine learning tool that allows you to train, test, and use models without writing code

  • mljar-supervised

    Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation

  • Project mention: Show HN: Web App with GUI for AutoML on Tabular Data | news.ycombinator.com | 2023-08-24

    Web App is using two open-source packages that I've created:

    - MLJAR AutoML - Python package for AutoML on tabular data https://github.com/mljar/mljar-supervised

    - Mercury - framework for converting Jupyter Notebooks into Web App https://github.com/mljar/mercury

    You can run Web App locally. What is more, you can adjust notebook's code for your needs. For example, you can set different validation strategies or evalutaion metrics or longer training times. The notebooks in the repo are good starting point for you to develop more advanced apps.

  • scikit-llm

    Seamlessly integrate LLMs into scikit-learn.

  • Project mention: Show HN: Scikit-LLM seamlessly integrate LLMs into scikit-learn | news.ycombinator.com | 2023-08-25
  • m2cgen

    Transform ML models into a native code (Java, C, Python, Go, JavaScript, Visual Basic, C#, R, PowerShell, PHP, Dart, Haskell, Ruby, F#, Rust) with zero dependencies

  • Project mention: How to use python ML script in tauri? | /r/rust | 2023-05-02

    Check out: https://github.com/BayesWitnesses/m2cgen

  • mars

    Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.

  • MachineLearningStocks

    Using python and scikit-learn to make stock predictions

  • Project mention: Scikit-learn Stock Prediction: using fundamental and pricing data to predict future stock returns. Sklearn's randomforest classifier is trainded and author claimed positive live trading results. Not actively mainained Other Models - star count:1520.0 | /r/algoprojects | 2023-08-28
  • jetson-containers

    Machine Learning Containers for NVIDIA Jetson and JetPack-L4T

  • Project mention: Install ros 2 humble on jetson orin | /r/ROS | 2023-06-25

    https://github.com/dusty-nv/jetson-containers This one might be helpful

  • AutoViz

    Automatically Visualize any dataset, any size with a single line of code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python scikit-learn related posts

Index

What are some of the best open-source scikit-learn projects in Python? This list will help you:

Project Stars
1 data-science-ipython-notebooks 26,459
2 best-of-ml-python 15,335
3 Dask 11,999
4 mlcourse.ai 9,390
5 sktime 7,404
6 auto-sklearn 7,403
7 autogluon 7,091
8 featuretools 7,017
9 orange 4,604
10 superduperdb 4,346
11 yellowbrick 4,194
12 flower 4,166
13 Artificial-Intelligence-Deep-Learning-Machine-Learning-Tutorials 3,643
14 hummingbird 3,302
15 igel 3,080
16 mljar-supervised 2,929
17 scikit-llm 2,920
18 m2cgen 2,707
19 mars 2,677
20 modAL 2,137
21 MachineLearningStocks 1,643
22 jetson-containers 1,624
23 AutoViz 1,621

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com