Top 23 Python ML Projects

yolov5

129 46,921 8.8 Python

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Project mention: จำแนกสายพันธ์ุหมากับแมวง่ายๆด้วยYoLoV5 | dev.to | 2024-04-15

Ref https://www.youtube.com/watch?v=0GwnxFNfZhM https://github.com/ultralytics/yolov5 https://dev.to/gfstealer666/kaaraich-yolo-alkrithuemainkaartrwcchcchabwatthu-object-detection-3lef https://www.kaggle.com/datasets/devdgohil/the-oxfordiiit-pet-dataset/data

MindsDB

78 21,223 10.0 Python

The platform for customizing AI from enterprise data

Project mention: What’s the Difference Between Fine-tuning, Retraining, and RAG? | dev.to | 2024-04-08

Check us out on GitHub.

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
MLflow

55 17,234 9.9 Python

Open source platform for the machine learning lifecycle

Project mention: My Favorite DevTools to Build AI/ML Applications! | dev.to | 2024-04-23

MLflow is an open-source platform for managing the end-to-end machine learning lifecycle. It includes features for experiment tracking, model versioning, and deployment, enabling developers to track and compare experiments, package models into reproducible runs, and manage model deployment across multiple environments.

best-of-ml-python

16 15,302 7.9 Python

🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
ludwig

3 10,801 9.5 Python

Low-code framework for building custom LLMs, neural networks, and other AI models

Project mention: Show HN: Toolkit for LLM Fine-Tuning, Ablating and Testing | news.ycombinator.com | 2024-04-07

This is a great project, little bit similar to https://github.com/ludwig-ai/ludwig, but it includes testing capabilities and ablation.
questions regarding the LLM testing aspect: How extensive is the test coverage for LLM use cases, and what is the current state of this project area? Do you offer any guarantees, or is it considered an open-ended problem?
Would love to see more progress toward this area!

deeplake

13 7,690 9.8 Python

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

Project mention: FLaNK AI Weekly 25 March 2025 | dev.to | 2024-03-25

metaflow

24 7,586 9.2 Python

:rocket: Build and manage real-life ML, AI, and data science projects with ease!

Project mention: FLaNK Stack 05 Feb 2024 | dev.to | 2024-02-05

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
CoreML-Models

2 6,221 2.3 Python

Largest list of models for Core ML (for iOS 11+)
feast

8 5,255 9.3 Python

Feature Store for Machine Learning

Project mention: What's Happening with Feast? | news.ycombinator.com | 2023-12-07

aim

70 4,782 8.0 Python

Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.

Project mention: aim VS cascade - a user suggested alternative | libhunt.com/r/aim | 2023-12-05

superduperdb

24 4,327 9.9 Python

🔮 SuperDuperDB: Bring AI to your database! Build, deploy and manage any AI application directly with your existing data infrastructure, without moving your data. Including streaming inference, scalable model training and vector search.

Project mention: FLaNK Stack Weekly 12 February 2024 | dev.to | 2024-02-12

zenml

33 3,657 9.8 Python

ZenML 🙏: Build portable, production-ready MLOps pipelines. https://zenml.io.

Project mention: FLaNK AI - 01 April 2024 | dev.to | 2024-04-01

awesome-mlops

7 3,555 6.8 Python

:sunglasses: A curated list of awesome MLOps tools (by kelvins)

Project mention: Choosing an Orchestrator in a green-field setup | /r/mlops | 2023-12-07

Lots of good projects on https://github.com/kelvins/awesome-mlops too

polyaxon

9 3,479 8.7 Python

MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle
hub

1 3,436 3.7 Python

A library for transfer learning by reusing parts of TensorFlow models. (by tensorflow)
deepchecks

15 3,338 8.6 Python

Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.

Project mention: Detect, Defend, Prevail: Payments Fraud Detection using ML & Deepchecks | dev.to | 2024-01-13

Also if you have any confusion related to it. You can directly go to their discussion section in github :

giskard

7 3,111 10.0 Python

🐢 Open-Source Evaluation & Testing framework for LLMs and ML models

Project mention: Show HN: Evaluate LLM-based RAG Applications with automated test set generation | news.ycombinator.com | 2024-04-11

zvt

1 2,981 5.5 Python

modular quant framework.
deepsparse

21 2,873 9.5 Python

Sparsity-aware deep learning inference runtime for CPUs

Project mention: Fast Llama 2 on CPUs with Sparse Fine-Tuning and DeepSparse | news.ycombinator.com | 2023-11-23

Interesting company. Yannic Kilcher interviewed Nir Shavit last year and they went into some depth: https://www.youtube.com/watch?v=0PAiQ1jTN5k DeepSparse is on GitHub: https://github.com/neuralmagic/deepsparse

RasaGPT

8 2,168 5.6 Python

💬 RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. Built w/ Rasa, FastAPI, Langchain, LlamaIndex, SQLModel, pgvector, ngrok, telegram

Project mention: (1/2) May 2023 | /r/dailyainews | 2023-06-02

RasaGPT: headless LLM chatbot platform built on top of Rasa and Langchain (https://github.com/paulpierre/RasaGPT)

ScaledYOLOv4

10 2,017 0.0 Python

Scaled-YOLOv4: Scaling Cross Stage Partial Network
GPflow

1 1,794 5.8 Python

Gaussian processes in TensorFlow
Photonix

54 1,765 0.0 Python

A modern, web-based photo management server. Run it on your home server and it will let you find the right photo from your collection on any device. Smart filtering is made possible by object recognition, face recognition, location awareness, color analysis and other ML algorithms.
SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python ML related posts

What’s the Difference Between Fine-tuning, Retraining, and RAG?
1 project | dev.to | 8 Apr 2024
Fine-tuning a Mistral Language Model with Anyscale
2 projects | dev.to | 1 Feb 2024
Launch HN: Encord (YC W21) – Unit testing for computer vision models
2 projects | news.ycombinator.com | 31 Jan 2024
Show HN: Finagg – free and nearly unlimited financial data
1 project | news.ycombinator.com | 21 Jan 2024
FLaNK 15 Jan 2024
21 projects | dev.to | 15 Jan 2024
Detect, Defend, Prevail: Payments Fraud Detection using ML & Deepchecks
1 project | dev.to | 13 Jan 2024
MindsDB Docker Extension: Build ML powered apps at a much faster pace
3 projects | dev.to | 1 Jan 2024
A note from our sponsor - InfluxDB
www.influxdata.com | 26 Apr 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source ML projects in Python? This list will help you:

	Project	Stars
1	yolov5	46,921
2	MindsDB	21,223
3	MLflow	17,234
4	best-of-ml-python	15,302
5	ludwig	10,801
6	deeplake	7,690
7	metaflow	7,586
8	CoreML-Models	6,221
9	feast	5,255
10	aim	4,782
11	superduperdb	4,327
12	zenml	3,657
13	awesome-mlops	3,555
14	polyaxon	3,479
15	hub	3,436
16	deepchecks	3,338
17	giskard	3,111
18	zvt	2,981
19	deepsparse	2,873
20	RasaGPT	2,168
21	ScaledYOLOv4	2,017
22	GPflow	1,794
23	Photonix	1,765