Python Machinelearning

Open-source Python projects categorized as Machinelearning

Top 23 Python Machinelearning Projects

  • horovod

    Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

  • ludwig

    Low-code framework for building custom LLMs, neural networks, and other AI models

  • Project mention: Show HN: Toolkit for LLM Fine-Tuning, Ablating and Testing | news.ycombinator.com | 2024-04-07

    This is a great project, little bit similar to https://github.com/ludwig-ai/ludwig, but it includes testing capabilities and ablation.

    questions regarding the LLM testing aspect: How extensive is the test coverage for LLM use cases, and what is the current state of this project area? Do you offer any guarantees, or is it considered an open-ended problem?

    Would love to see more progress toward this area!

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • vaex

    Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second ๐Ÿš€

  • clearml

    ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution

  • Project mention: FLaNK Stack Weekly 12 February 2024 | dev.to | 2024-02-12
  • awesome-open-gpt

    Collection of Open Source Projects Related to GPT๏ผŒGPT็›ธๅ…ณๅผ€ๆบ้กน็›ฎๅˆ้›†๐Ÿš€ใ€็ฒพ้€‰๐Ÿ”ฅ๐Ÿ”ฅ

  • Project mention: The best free ChatGPT alternatives | /r/ArtificialInteligence | 2023-06-20

    Extract from awesome-open-gpt

  • marqo

    Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

  • Project mention: Are we at peak vector database? | news.ycombinator.com | 2024-01-25

    We (Marqo) are doing a lot on 1 and 2. There is a huge amount to be done on the ML side of vector search and we are investing heavily in it. I think it has not quite sunk in that vector search systems are ML systems and everything that comes with that. I would love to chat about 1 and 2 so feel free to email me (email is in my profile). What we have done so far is here -> https://github.com/marqo-ai/marqo

  • igel

    a delightful machine learning tool that allows you to train, test, and use models without writing code

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • deepsparse

    Sparsity-aware deep learning inference runtime for CPUs

  • Project mention: Fast Llama 2 on CPUs with Sparse Fine-Tuning and DeepSparse | news.ycombinator.com | 2023-11-23

    Interesting company. Yannic Kilcher interviewed Nir Shavit last year and they went into some depth: https://www.youtube.com/watch?v=0PAiQ1jTN5k DeepSparse is on GitHub: https://github.com/neuralmagic/deepsparse

  • tslearn

    The machine learning toolkit for time series analysis in Python

  • nannyml

    nannyml: post-deployment data science in python

  • Project mention: Introduction to NannyML: Model Evaluation without labels | dev.to | 2023-12-15

    In order to try to solve this issue, NannyML was created. NannyML is an open-source Python library designed in order to make it easy to monitor drift in the distributions of our model input variables and estimate our model performance (even without labels!) thanks to the Confidence-Based Performance Estimation algorithm they developed. But first of all, why do models need to be monitored and why their performance might vary over time?

  • nsfw_model

    Keras model of NSFW detector

  • pytorch2keras

    PyTorch to Keras model convertor

  • Project mention: Help Needed: Converting PlantNet-300k Pretrained Model Weights from Tar to h5 Format Help | /r/learnpython | 2023-06-09

    It's almost certainly a pickled pytorch model so you will first need to load it using pytorch and then write it out to h5 (legacy keras format) with https://github.com/gmalivenko/pytorch2keras.

  • retentioneering-tools

    Retentioneering: product analytics, data-driven CJM optimization, marketing analytics, web analytics, transaction analytics, graph visualization, process mining, and behavioral segmentation in Python. Predictive analytics over clickstream, AB tests, machine learning, and Markov Chain simulations.

  • ActionAI

    Real-Time Spatio-Temporally Localized Activity Detection by Tracking Body Keypoints

  • Project mention: Show HN: Custom Action Recognition with ActionAI | news.ycombinator.com | 2023-09-23
  • covalent

    Pythonic tool for orchestrating machine-learning/high performance/quantum-computing workflows in heterogeneous compute environments. (by AgnostiqHQ)

  • Project mention: Remote execution of code | /r/Python | 2023-12-05

    Pretty interesting request, if SSH is not used, i would try using something like dask which uses tcp to connect and execute assuming your workers are in another machine.I also think something like covalent can be used to extend your own custom plugin in their ecosystem to connect how you want. We have a very custom private plugin written on top of covalent's to have a custom protocol to connect our central on-prem GPU machines to our local laptops that is rpc based, mostly for high performance as well as some mandate security from where the GPU machines are. Once done it is pretty much something like

  • LiuAlgoTrader

    Framework for algorithmic trading

  • MetaSpore

    A unified end-to-end machine intelligence platform

  • CodeRL

    This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).

  • Machine-Learning-Guide

    Machine learning Guide. Learn all about Machine Learning Tools, Libraries, Frameworks, Large Language Models (LLMs), and Training Models.

  • deep-significance

    Enabling easy statistical significance testing for deep neural networks.

  • hydra-zen

    Create powerful Hydra applications without the yaml files and boilerplate code.

  • yolo-hand-detection

    A pre-trained YOLO based hand detection network.

  • zoofs

    zoofs is a python library for performing feature selection using a variety of nature-inspired wrapper algorithms. The algorithms range from swarm-intelligence to physics-based to Evolutionary. It's easy to use , flexible and powerful tool to reduce your feature size.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Machinelearning related posts

Index

What are some of the best open-source Machinelearning projects in Python? This list will help you:

Project Stars
1 horovod 13,969
2 ludwig 10,845
3 vaex 8,178
4 clearml 5,279
5 awesome-open-gpt 5,092
6 marqo 4,152
7 igel 3,080
8 deepsparse 2,881
9 tslearn 2,786
10 nannyml 1,759
11 nsfw_model 1,614
12 pytorch2keras 846
13 retentioneering-tools 766
14 ActionAI 724
15 covalent 698
16 LiuAlgoTrader 672
17 MetaSpore 629
18 CodeRL 475
19 Machine-Learning-Guide 441
20 deep-significance 316
21 hydra-zen 284
22 yolo-hand-detection 259
23 zoofs 236

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com