scikit-learn

Open-source projects categorized as scikit-learn

Top 23 scikit-learn Open-Source Projects

  • ML-For-Beginners

    12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all

  • Project mention: Good coding groups for black women? | news.ycombinator.com | 2024-01-13

    - https://github.com/microsoft/ML-For-Beginners

    Also check out this list Pitt puts out every year:

  • 100-Days-Of-ML-Code

    100 Days of ML Coding

  • Project mention: Top 10 GitHub Repositories Every Developer Should Bookmark in 2024 | dev.to | 2024-02-07

    2) 100 Days of ML Code: Embark on a 100-day journey into the fascinating world of machine learning with this structured curriculum. Packed with bite-sized coding challenges and real-world projects, this repository will transform you from a coding novice to a confident ML enthusiast. (https://github.com/Avik-Jain/100-Days-Of-ML-Code)

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • PythonDataScienceHandbook

    Python Data Science Handbook: full text in Jupyter Notebooks

  • Project mention: About Data analyst, data scientist and data engineer, resources and experiences | dev.to | 2024-03-26

    Python Data Science Handbook

  • data-science-ipython-notebooks

    Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • handson-ml

    ⛔️ DEPRECATED – See https://github.com/ageron/handson-ml3 instead.

  • best-of-ml-python

    🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

  • onnxruntime

    ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

  • Project mention: Machine Learning with PHP | dev.to | 2024-04-22

    ONNX Runtime: ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • python-machine-learning-book

    The "Python Machine Learning (1st edition)" book code repository and info resource

  • Dask

    Parallel computing with task scheduling

  • Project mention: The Distributed Tensor Algebra Compiler (2022) | news.ycombinator.com | 2023-06-15
  • mlcourse.ai

    Open Machine Learning Course

  • Project mention: Open Machine Learning Course | news.ycombinator.com | 2023-10-22
  • auto-sklearn

    Automated Machine Learning with scikit-learn

  • sktime

    A unified framework for machine learning with time series

  • autogluon

    AutoGluon: Fast and Accurate ML in 3 Lines of Code

  • Project mention: pip install remyxai - easiest way to create custom vision models | /r/computervision | 2023-04-25

    This seems not very convincing. There are other popular frameworks that provide AutoML with existing datasets (eg https://github.com/autogluon/autogluon)

  • featuretools

    An open source python library for automated feature engineering

  • Project mention: Featuretools – A Python Library for Automated Feature Engineering | news.ycombinator.com | 2023-09-20
  • interpret

    Fit interpretable models. Explain blackbox machine learning.

  • skorch

    A scikit-learn compatible neural network library that wraps PyTorch

  • orange

    🍊 :bar_chart: :bulb: Orange: Interactive data analysis

  • Project mention: Hierarchical Clustering | news.ycombinator.com | 2024-04-20

    I know I've tooted its horn before, but Orange3 is a pretty neat Python-based GUI platform that makes this and a metric buttload of other statistical/ML techniques available to non-programmer types.

    Just watch out for null character `x00` in the corpus. That always seems to kill it stone dead.

    https://orangedatamining.com/

    https://orange3.readthedocs.io/projects/orange-visual-progra...

  • machine_learning_complete

    A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.

  • python-machine-learning-book-3rd-edition

    The "Python Machine Learning (3rd edition)" book code repository

  • superduperdb

    🔮 SuperDuperDB: Bring AI to your database! Build, deploy and manage any AI application directly with your existing data infrastructure, without moving your data. Including streaming inference, scalable model training and vector search.

  • Project mention: FLaNK Stack Weekly 12 February 2024 | dev.to | 2024-02-12
  • yellowbrick

    Visual analysis and diagnostic tools to facilitate machine learning model selection.

  • flower

    Flower: A Friendly Federated Learning Framework (by adap)

  • Project mention: Pyenv – lets you easily switch between multiple versions of Python | news.ycombinator.com | 2024-03-25

    We use Pyenv successfully for developing the Flower open-source project. We use a few simple Bash scripts to manage virtual environments with different Python versions via pyenv and the pyenv-virtualenv plugin.

    The main scripts are `venv-create.sh`, `venv-delete.sh` and `bootstrap.sh`. `venv-reset.sh` pulls these three scripts together to make reinstalling your venv a single command.

    Here's the link if anyone is interested: https://github.com/adap/flower/tree/main/dev

  • FLAML

    A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.

  • Project mention: AutoGen: Enabling Next-Gen GPT-X Applications | news.ycombinator.com | 2023-08-22

    I really like the simplicity of this framework, and they hit on a lot of common problems found in other agent-based frameworks. Most intrigued by the RAG improvements.

    Seems like Microsoft was frustrated with the pace of movement in this space and the shitty results of agents (which admittedly kept my interest turned away from agents for the last few months). I'm interested again because it makes practical sense, and from looking at the example notebooks, seems fairly easy to integrate into existing applications.

    Maybe this is the 'low code' approach that might actually work, and bridge together engineering and non-engineering resources.

    This example was what caught my eye: https://github.com/microsoft/FLAML/blob/main/notebook/autoge...

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

scikit-learn related posts

Index

What are some of the best open-source scikit-learn projects? This list will help you:

Project Stars
1 ML-For-Beginners 66,806
2 100-Days-Of-ML-Code 43,200
3 PythonDataScienceHandbook 41,407
4 data-science-ipython-notebooks 26,459
5 handson-ml 25,090
6 best-of-ml-python 15,302
7 onnxruntime 12,656
8 python-machine-learning-book 12,076
9 Dask 11,982
10 mlcourse.ai 9,390
11 auto-sklearn 7,394
12 sktime 7,387
13 autogluon 7,091
14 featuretools 7,017
15 interpret 5,988
16 skorch 5,614
17 orange 4,604
18 machine_learning_complete 4,501
19 python-machine-learning-book-3rd-edition 4,386
20 superduperdb 4,327
21 yellowbrick 4,194
22 flower 4,166
23 FLAML 3,671

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com