Machine Learning

Top 23 Machine Learning Open-Source Projects

  • tensorflow

    An Open Source Machine Learning Framework for Everyone

    Project mention: Data-Oriented Programming in Python | news.ycombinator.com | 2022-11-27

    > In practice, scientific computing users rely on the NumPy family of libraries e.g. NumPy, SciPy, TensorFlow, PyTorch, CuPy, JAX, etc..

    this is a somewhat confusing statement. most of these libraries actually don't rely on numpy. e.g. tensorflow ultimately wraps c++/eigen tensors [0] and numpy enters somewhere higher up in their python integration

    [0] https://github.com/tensorflow/tensorflow/blob/master/tensorf...

  • transformers

    🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

    Project mention: [D] Best Practices for Training From Scratch With Large Datasets | reddit.com/r/MachineLearning | 2022-11-30

    I've been working from this as a starting point: https://github.com/huggingface/transformers/tree/main/examples/pytorch/speech-pretraining

  • Sonar

    Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.

  • Pytorch

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Project mention: Stable Diffusion with Core ML on Apple Silicon | news.ycombinator.com | 2022-12-01

    I can't get fine-tune the model ron Apple Silicon due to PyTorch supportability issues. I don't have high-hopes it will be supported.

    https://github.com/pytorch/pytorch/issues/77794

    https://github.com/pytorch/pytorch/issues/77764

  • Keras

    Deep Learning for humans

    Project mention: 65 Blog Posts to Learn Data Science | dev.to | 2022-11-30

    Hello world. This tutorial is a gentle introduction to building modern text recognition system using deep learning in 15 minutes. It will teach you the main ideas of how to use Keras and Supervisely for this problem. This guide is for anyone who is interested in using Deep Learning for text recognition in images but has no idea where to start.

  • scikit-learn

    scikit-learn: machine learning in Python

    Project mention: Scaling PostgresML to 1M Requests per Second | news.ycombinator.com | 2022-11-11

    Of course. The paper is at https://arxiv.org/abs/1408.3060.

    > Our method applies to any translation invariant and any dot-product kernel, such as the popular RBF kernels and polynomial kernels. We prove that the approximation is unbiased and has low variance. Experiments show that we achieve similar accuracy to full kernel expansions and Random Kitchen Sinks while being 100x faster and using 1000x less memory. These improvements, especially in terms of memory usage, make kernel methods more practical for applications that have large training sets and/or require real-time prediction.

    Sadly Fastfood didn't quite make it into Scikit[1], but did land in scikit-learn-extra[2].

    1. https://github.com/scikit-learn/scikit-learn/pull/3665. A shame, Scikit's equivalents scale very poorly.

    2. https://scikit-learn-extra.readthedocs.io/en/stable/generate...

  • tesseract-ocr

    Tesseract Open Source OCR Engine (main repository)

    Project mention: Help with DLLimport function | reddit.com/r/csharp | 2022-11-26

    [1] https://github.com/tesseract-ocr/tesseract/blob/main/include/tesseract/capi.h

  • cs-video-courses

    List of Computer Science courses with video lectures.

    Project mention: Looking for a YouTube playlist for learning compsci | reddit.com/r/cscareerquestions | 2022-11-10

    There's no one playlist that will have all content but here's a good resource https://github.com/Developer-Y/cs-video-courses

  • InfluxDB

    Build time-series-based applications quickly and at scale.. InfluxDB is the Time Series Data Platform where developers build real-time applications for analytics, IoT and cloud-native services in less time with less code.

  • Face Recognition

    The world's simplest facial recognition api for Python and the command line

    Project mention: Any algorithm to get coordinates of classified face? | reddit.com/r/neuralnetworks | 2022-11-12

    face_recognition - fairly easy install as long as dlib compiles for you. Should be able to get your face location ok. This is the older, most mature, but doesn't have as many features and might not have the accuracy of other models. Able to do face recognition.

  • ML-For-Beginners

    12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all

    Project mention: Pocetak ML karijere | reddit.com/r/programiranje | 2022-09-26

    https://github.com/microsoft/ML-For-Beginners jel mislis na ovo?

  • faceswap

    Deepfakes Software For All

    Project mention: [D] How is it checked if models do not just memorize their training examples? | reddit.com/r/MachineLearning | 2022-04-28

    But there's a nice survey on Arxiv here of various deepfake / face swap methods. Some of methods listed in the table on page 4, such as Faceswap and Faceswap-GAN, apparently use encoder-decoder models. I think Faceswap-GAN was the one that I was thinking of in particular; apparently it adds a perceptual loss and an adversarial loss to an autoencoder.

  • TensorFlow-Examples

    TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)

  • awesome-scalability

    The Patterns of Scalable, Reliable, and Performant Large-Scale Systems

    Project mention: 100+ Must Know Github Repositories For Any Programmer | dev.to | 2022-11-17

    3. Awesome Scalability

  • julia

    The Julia Programming Language

    Project mention: What other programming language do you actively develop with productively, to complement Python? | reddit.com/r/Python | 2022-11-22

    https://julialang.org/ for non-trivial numerical calculations - mathematical syntax and very high runtime performance.

  • 100-Days-Of-ML-Code

    100 Days of ML Coding

    Project mention: ✨ 5 Best GitHub Repositories to Learn Machine Learning in 2022 for Free 💯 | reddit.com/r/learnmachinelearning | 2022-10-14

    1️⃣ 100 Days Of ML Code

  • DeepFaceLab

    DeepFaceLab is the leading software for creating deepfakes.

    Project mention: Margot Robbie - Dirndl Pantene Pro-V | reddit.com/r/SFWdeepfakes | 2022-11-15

    I am using https://github.com/iperov/DeepFaceLab and Adobe Video and Picture editing software. You will find all information on github. The hardware is mainly a Zotac RTX 3090 24GB combined with a AMD Ryzen 9 5950X 32GB RAM

  • yolov5

    YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

    Project mention: YOLOv5 SOTA Realtime Instance Segmentation | news.ycombinator.com | 2022-11-22
  • Caffe

    Caffe: a fast open framework for deep learning.

    Project mention: German ad: "Artificial intelligence: the 4 most used drinks will be placed on the main screen" | reddit.com/r/ProgrammerHumor | 2022-09-20
  • Made-With-ML

    Learn how to responsibly develop, deploy and maintain production machine learning applications.

    Project mention: When do you create new environments to code in? Is there a best practice for this? | reddit.com/r/pytorch | 2022-11-28

    With regards to best practices, I'd guide my project workflow on the Made With ML blog, specifically the Developing sub-section within the MLOps course section, this blog is a great guideline for structuring any ML/DS project btw

  • gym

    A toolkit for developing and comparing reinforcement learning algorithms.

    Project mention: Pusher task on mujoco/pybulletenv | reddit.com/r/reinforcementlearning | 2022-11-19
  • Tesseract.js

    Pure Javascript OCR for more than 100 Languages 📖🎉🖥

    Project mention: WebDev Recommendations to Mimic AWS Textract Demo Functionality | reddit.com/r/Frontend | 2022-10-03

    I've never used it - but Tesseract.js looks interesting. Perhaps you can use it as a foundation on which to build something like the AWS tool (which seems to work by overlaying an SVG over the top of the image, then manipulates that SVG when user clicks on a results tag to get the word/phrase's highlight box to display)?

  • machine-learning-for-software-engineers

    A complete daily plan for studying to become a machine learning engineer.

  • google-research

    Google Research

    Project mention: Nearest-neighbor search in high-dimensional spaces | reddit.com/r/compsci | 2022-11-04
  • openpose

    OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation

    Project mention: How to get rotation (yaw/pitch/roll) from face detection keypoints? | reddit.com/r/tensorflow | 2022-10-31
  • Zigi

    Delete the most useless function ever: context switching.. Zigi monitors Jira and GitHub updates, pings you when PRs need approval and lets you take fast actions - all directly from Slack! Plus it reduces cycle time by up to 75%.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2022-12-01.

Machine Learning related posts

Index

What are some of the best open-source Machine Learning projects? This list will help you:

Project Stars
1 tensorflow 169,420
2 transformers 75,115
3 Pytorch 60,605
4 Keras 56,758
5 scikit-learn 52,167
6 tesseract-ocr 47,653
7 cs-video-courses 47,379
8 Face Recognition 46,626
9 ML-For-Beginners 42,954
10 faceswap 42,849
11 TensorFlow-Examples 42,364
12 awesome-scalability 42,071
13 julia 41,021
14 100-Days-Of-ML-Code 39,019
15 DeepFaceLab 35,961
16 yolov5 33,311
17 Caffe 32,998
18 Made-With-ML 31,731
19 gym 29,100
20 Tesseract.js 28,761
21 machine-learning-for-software-engineers 26,323
22 google-research 26,246
23 openpose 25,469
Truly a developer’s best friend
Scout APM is great for developers who want to find and fix performance issues in their applications. With Scout, we'll take care of the bugs so you can focus on building great things 🚀.
scoutapm.com