SaaSHub helps you find the best software and product alternatives Learn more β
Top 23 Python ML Projects
-
There are several implementations of the YOLO algorithm available, but for ease-of-use, we will use the Ultralytics implementation in this guide. We will implement and test the code locally and then deploy to Koyeb's GPUs for higher inference speed.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
MindsDB
AGI's query engine - Platform for building AI that can learn and answer questions over federated data.
But this hacktoberfest I took a great step towards contributing to open source. During this hacktoberfest I have contributed to MindsDB which is a great platform for building AI from enterprise data, enabling smarter organizations.
-
Model management: MLflow, Kubeflow are used for managing the deployment and lifecycle of models.
-
Best of ml python
-
Project mention: Show HN: Toolkit for LLM Fine-Tuning, Ablating and Testing | news.ycombinator.com | 2024-04-07
This is a great project, little bit similar to https://github.com/ludwig-ai/ludwig, but it includes testing capabilities and ablation.
questions regarding the LLM testing aspect: How extensive is the test coverage for LLM use cases, and what is the current state of this project area? Do you offer any guarantees, or is it considered an open-ended problem?
Would love to see more progress toward this area!
-
OpenLLM
Run any open-source LLMs, such as Llama, Mistral, as OpenAI compatible API endpoint in the cloud.
OpenLLM is a powerful platform that empowers developers to leverage the potential of open-source large language models (LLMs). It is like a Swiss Army knife for LLMs. It's a set of tools that helps developers overcome these deployment hurdles.
-
Project mention: Show HN: Flow β A Dynamic Task Engine for AI Agents Without DAG | news.ycombinator.com | 2024-12-02
Interesting! I feel like this is a cross between https://github.com/dagworks-inc/burr (switch state for context) and https://github.com/Netflix/metaflow because the output of the "task" declares its next hop...
-
deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
Finally, we stored these vectors in our chosen database: the activeloop DeepLake database. This database is open source, something near and dear to our own open-source hearts. We will cover some additional details in a further section, but it is specifically designed to handle vector data and perform efficient similarity searches, which is crucial for quick and accurate retrieval during the RAG process.
-
-
Project mention: RFC: The Feast Kubernetes Operator (The Open Source Feature Store) | news.ycombinator.com | 2024-09-24
Hey folks!
I'm a maintainer for Feast (https://github.com/feast-dev/feast) (the Open Source Feature Store) and the Feast community is working on creating a Kubernetes Operator for deploying Feast on Kubernetes and would love any feedback you have before we get started!
Here is the GitHub issue: https://github.com/feast-dev/feast/issues/4561, a design doc: https://docs.google.com/document/d/1vGKMizf3_14IyiF_W_Ik7CR0..., and a Slack channel: https://communityinviter.com/apps/feastopensource/feast-the-...!
Thanks a ton in advance for your interest/comments!
-
-
To instrument our app you will use the CLI tool opentelemetry-instrument that automatically sets up auto-instrumentation in your code without needing to do boilerplate configuration, and OTel Python SDK and OpenLLMetry to do manual instrumentation and specific instrumentation for LLM SDKs, like the file ./app/telemetry.py in the example:
-
superduper
Superduper: Build end-to-end AI applications and agent workflows on your existing data infrastructure and preferred tools - without migrating your data.
Project mention: Build fully portable AI applications on top of Snowflake with SuperDuperDB | dev.to | 2024-06-26Customize how AI and databases work together. Scale your AI projects to handle more data and users. Move AI projects between different environments easily. Extend the system with new AI features and database functionality. Check it out: Blog: https://blog.superduperdb.com/version-02 Github: https://github.com/SuperDuperDB/superduperdb (leave us a star βοΈπ₯³)
-
-
-
deepchecks
Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
-
-
-
-
RasaGPT
π¬ RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. Built w/ Rasa, FastAPI, Langchain, LlamaIndex, SQLModel, pgvector, ngrok, telegram
-
-
NannyML is a Python library specialized in post-deployment monitoring and maintenance of machine learning (ML) models. It enables data scientists to detect and address silent model failure, estimate model performance without immediate ground truth data, and identify data drift that might be responsible for performance degradation.
-
Python ML discussion
Python ML related posts
-
Understanding the MLOps Lifecycle
-
How to Use KitOps with MLflow
-
Top 10 MLOps Tools for 2025
-
Building SDKs for MindsDB this Hacktoberfest
-
Top 8 OpenSource Tools for AI Startups
-
10 MLOps Tools That Comply With the EU AI Act
-
Top open-source repos/projects to contribute (Hacktoberfest Edition π)
-
A note from our sponsor - SaaSHub
www.saashub.com | 15 Jan 2025
Index
What are some of the best open-source ML projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | yolov5 | 51,849 |
2 | MindsDB | 27,076 |
3 | MLflow | 19,220 |
4 | best-of-ml-python | 18,704 |
5 | ludwig | 11,272 |
6 | OpenLLM | 10,368 |
7 | metaflow | 8,409 |
8 | deeplake | 8,294 |
9 | CoreML-Models | 6,518 |
10 | feast | 5,715 |
11 | aim | 5,294 |
12 | openllmetry | 5,269 |
13 | superduper | 4,907 |
14 | zenml | 4,315 |
15 | awesome-mlops | 4,223 |
16 | deepchecks | 3,666 |
17 | polyaxon | 3,592 |
18 | hub | 3,493 |
19 | zvt | 3,335 |
20 | RasaGPT | 2,350 |
21 | ScaledYOLOv4 | 2,020 |
22 | nannyml | 2,008 |
23 | GPflow | 1,856 |