Python Machinelearning

Open-source Python projects categorized as Machinelearning

Top 23 Python Machinelearning Projects

Machinelearning
  1. horovod

    Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. ludwig

    Low-code framework for building custom LLMs, neural networks, and other AI models

  4. vaex

    Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

  5. clearml

    ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution

  6. awesome-open-gpt

    Collection of Open Source Projects Related to GPT,GPT相关开源项目合集🚀、精选🔥🔥

  7. marqo

    Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

    Project mention: Why You Shouldn’t Invest In Vector Databases? | dev.to | 2025-04-24

    In cases where a company possesses a strong technological foundation and faces a substantial workload demanding advanced vector search capabilities, its ideal solution lies in adopting a specialized vector database. Prominent options in this domain include Chroma (having raised $20 million), Zilliz (having raised $113 million), Pinecone (having raised $138 million), Qdrant (having raised $9.8 million), Weaviate (having raised $67.7 million), LanceDB (YC W22), Vespa, Marqo, and others. Many of these players have secured significant funding in recent years and are well-positioned to capture notable market share. These vector databases offer efficient storage, indexing, and similarity search functionalities for vectors. They often incorporate specific optimizations tailored for vector data, such as similarity search based on inverted indexes and efficient vector computations. As a result, they cater to the requirements of companies operating in areas like recommendation systems, image search, and natural language processing.

  8. sparrow

    Data processing and instruction calling with ML, LLM and Vision LLM (by katanaml)

    Project mention: Sparrow: Open-source data processing with ML, LLM and Vision LLM | news.ycombinator.com | 2025-02-17
  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. igel

    a delightful machine learning tool that allows you to train, test, and use models without writing code

  11. tslearn

    The machine learning toolkit for time series analysis in Python

  12. nannyml

    nannyml: post-deployment data science in python

    Project mention: Personal Picks: Data Product News (June 11, 2025) | dev.to | 2025-06-10
  13. nsfw_model

    Keras model of NSFW detector

  14. pytorch2keras

    PyTorch to Keras model convertor

  15. LiuAlgoTrader

    Framework for algorithmic trading

    Project mention: LiuAlgoTrader VS QTradeX-Algo-Trading-SDK - a user suggested alternative | libhunt.com/r/LiuAlgoTrader | 2025-05-28
  16. retentioneering-tools

    Retentioneering: product analytics, data-driven CJM optimization, marketing analytics, web analytics, transaction analytics, graph visualization, process mining, and behavioral segmentation in Python. Predictive analytics over clickstream, AB tests, machine learning, and Markov Chain simulations.

  17. covalent

    Pythonic tool for orchestrating machine-learning/high performance/quantum-computing workflows in heterogeneous compute environments. (by AgnostiqHQ)

  18. ActionAI

    Real-Time Spatio-Temporally Localized Activity Detection by Tracking Body Keypoints

  19. ai-hub-models

    The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

    Project mention: Recapping the AI, Machine Learning and Computer Meetup — November 14, 2024 | dev.to | 2024-11-15

    In this talk we address the common challenges faced by developers migrating AI workloads from the cloud to edge devices. Qualcomm aims to democratize AI at the edge, easing the transition to the edge by supporting familiar frameworks and data types. ​This is where Qualcomm AI Hub comes in. Developers can follow along, gaining knowledge and tools to efficiently deploy optimized models on real devices using Qualcomm AI Hub. ​

  20. Machine-Learning-Guide

    Machine learning Guide. Learn all about Machine Learning Tools, Libraries, Frameworks, Large Language Models (LLMs), and Training Models.

  21. dreamGPT

    Leverage hallucinations from Large Language Models (LLMs) for novelty-driven explorations.

  22. MetaSpore

    A unified end-to-end machine intelligence platform

  23. CodeRL

    This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).

  24. hydra-zen

    Create powerful Hydra applications without the yaml files and boilerplate code.

  25. deep-significance

    Enabling easy statistical significance testing for deep neural networks.

  26. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Machinelearning discussion

Log in or Post with

Python Machinelearning related posts

  • TensorFlow implementation for optimizers

    1 project | dev.to | 8 May 2025
  • Show HN: TensorFlow Implementation for Optimizer

    1 project | news.ycombinator.com | 8 Apr 2025
  • Ask HN: What's your serverless stack for AI/LLM apps in production?

    1 project | news.ycombinator.com | 10 Jan 2025
  • AI Search That Understands the Way Your Customer's Think

    1 project | news.ycombinator.com | 28 May 2024
  • Ask HN: Is there any good semantic search GUI for images or documents?

    2 projects | news.ycombinator.com | 17 Jan 2024
  • Fast Llama 2 on CPUs with Sparse Fine-Tuning and DeepSparse

    1 project | news.ycombinator.com | 23 Nov 2023
  • It was not "Good First Issue"

    1 project | dev.to | 8 Oct 2023
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 20 Jun 2025
    InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →

Index

What are some of the best open-source Machinelearning projects in Python? This list will help you:

# Project Stars
1 horovod 14,510
2 ludwig 11,496
3 vaex 8,390
4 clearml 6,052
5 awesome-open-gpt 5,835
6 marqo 4,890
7 sparrow 4,577
8 igel 3,112
9 tslearn 2,994
10 nannyml 2,077
11 nsfw_model 1,918
12 pytorch2keras 860
13 LiuAlgoTrader 837
14 retentioneering-tools 834
15 covalent 835
16 ActionAI 801
17 ai-hub-models 717
18 Machine-Learning-Guide 610
19 dreamGPT 576
20 MetaSpore 536
21 CodeRL 534
22 hydra-zen 385
23 deep-significance 335

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?