InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →
Top 23 Python Machinelearning Projects
-
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
-
vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
-
clearml
ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution
-
-
In cases where a company possesses a strong technological foundation and faces a substantial workload demanding advanced vector search capabilities, its ideal solution lies in adopting a specialized vector database. Prominent options in this domain include Chroma (having raised $20 million), Zilliz (having raised $113 million), Pinecone (having raised $138 million), Qdrant (having raised $9.8 million), Weaviate (having raised $67.7 million), LanceDB (YC W22), Vespa, Marqo, and others. Many of these players have secured significant funding in recent years and are well-positioned to capture notable market share. These vector databases offer efficient storage, indexing, and similarity search functionalities for vectors. They often incorporate specific optimizations tailored for vector data, such as similarity search based on inverted indexes and efficient vector computations. As a result, they cater to the requirements of companies operating in areas like recommendation systems, image search, and natural language processing.
-
Project mention: Sparrow: Open-source data processing with ML, LLM and Vision LLM | news.ycombinator.com | 2025-02-17
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
igel
a delightful machine learning tool that allows you to train, test, and use models without writing code
-
-
-
-
-
Project mention: LiuAlgoTrader VS QTradeX-Algo-Trading-SDK - a user suggested alternative | libhunt.com/r/LiuAlgoTrader | 2025-05-28
-
retentioneering-tools
Retentioneering: product analytics, data-driven CJM optimization, marketing analytics, web analytics, transaction analytics, graph visualization, process mining, and behavioral segmentation in Python. Predictive analytics over clickstream, AB tests, machine learning, and Markov Chain simulations.
-
covalent
Pythonic tool for orchestrating machine-learning/high performance/quantum-computing workflows in heterogeneous compute environments. (by AgnostiqHQ)
-
-
ai-hub-models
The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
Project mention: Recapping the AI, Machine Learning and Computer Meetup — November 14, 2024 | dev.to | 2024-11-15In this talk we address the common challenges faced by developers migrating AI workloads from the cloud to edge devices. Qualcomm aims to democratize AI at the edge, easing the transition to the edge by supporting familiar frameworks and data types. This is where Qualcomm AI Hub comes in. Developers can follow along, gaining knowledge and tools to efficiently deploy optimized models on real devices using Qualcomm AI Hub.
-
Machine-Learning-Guide
Machine learning Guide. Learn all about Machine Learning Tools, Libraries, Frameworks, Large Language Models (LLMs), and Training Models.
-
-
-
CodeRL
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).
-
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Python Machinelearning discussion
Python Machinelearning related posts
-
TensorFlow implementation for optimizers
-
Show HN: TensorFlow Implementation for Optimizer
-
Ask HN: What's your serverless stack for AI/LLM apps in production?
-
AI Search That Understands the Way Your Customer's Think
-
Ask HN: Is there any good semantic search GUI for images or documents?
-
Fast Llama 2 on CPUs with Sparse Fine-Tuning and DeepSparse
-
It was not "Good First Issue"
-
A note from our sponsor - InfluxDB
www.influxdata.com | 20 Jun 2025
Index
What are some of the best open-source Machinelearning projects in Python? This list will help you:
# | Project | Stars |
---|---|---|
1 | horovod | 14,510 |
2 | ludwig | 11,496 |
3 | vaex | 8,390 |
4 | clearml | 6,052 |
5 | awesome-open-gpt | 5,835 |
6 | marqo | 4,890 |
7 | sparrow | 4,577 |
8 | igel | 3,112 |
9 | tslearn | 2,994 |
10 | nannyml | 2,077 |
11 | nsfw_model | 1,918 |
12 | pytorch2keras | 860 |
13 | LiuAlgoTrader | 837 |
14 | retentioneering-tools | 834 |
15 | covalent | 835 |
16 | ActionAI | 801 |
17 | ai-hub-models | 717 |
18 | Machine-Learning-Guide | 610 |
19 | dreamGPT | 576 |
20 | MetaSpore | 536 |
21 | CodeRL | 534 |
22 | hydra-zen | 385 |
23 | deep-significance | 335 |