Python Pytorch

Open-source Python projects categorized as Pytorch

Top 23 Python Pytorch Projects

  • GitHub repo transformers

    🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.

    Project mention: Retrieval Augmented Generation with Huggingface Transformers and Ray | reddit.com/r/deeplearning | 2021-02-10

    Improving the scalability RAG distributed fine tuning

  • GitHub repo Real-Time-Voice-Cloning

    Clone a voice in 5 seconds to generate arbitrary speech in real-time

    Project mention: Cost for a voice skin? | reddit.com/r/deeplearning | 2021-02-21

    I can do the coding myself. It looks like there are some FOSS projects with pretrained models that only need a few seconds of audio for new voices.

  • Scout

    Get performance insights in less than 4 minutes. Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.

  • GitHub repo pytorch-tutorial

    PyTorch Tutorial for Deep Learning Researchers

    Project mention: [P] Probabilistic Machine Learning: An Introduction, Kevin Murphy's 2021 e-textbook is out | reddit.com/r/MachineLearning | 2021-01-01
  • GitHub repo pytorch-CycleGAN-and-pix2pix

    Image-to-Image Translation in PyTorch

    Project mention: This Wojak Does Not Exist | news.ycombinator.com | 2020-12-31

    https://github.com/junyanz/pytorch-CycleGAN-and-pix2pix

  • GitHub repo mmdetection

    OpenMMLab Detection Toolbox and Benchmark

    Project mention: Mask RCNN implementation in python | reddit.com/r/computervision | 2021-02-11

    I’ve trained Mask RCNN in Google Colab using this Pytorch library - https://github.com/open-mmlab/mmdetection

  • GitHub repo pytorch-lightning

    The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.

    Project mention: DDP with model parallelism with multi host multi GPU system | reddit.com/r/pytorch | 2021-02-07
  • GitHub repo fairseq

    Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

    Project mention: What are some good speech recognition papers I can implement? | reddit.com/r/MLQuestions | 2021-02-01

    fairseq

  • GitHub repo horovod

    Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

    Project mention: SKLean, TensorFlow, etc vs Spark ML? | reddit.com/r/apachespark | 2021-02-12

    I'm the maintainer for an open source project called Horovod that allows you to distribute deep learning training (e.g., TensorFlow) on platforms like Spark.

  • GitHub repo EasyOCR

    Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

    Project mention: Using Google's OCR API with Puppeteer for Visual Testing | dev.to | 2021-02-08

    There are multiple open-source OCR tools like pytesseract or EasyOCR, which can be used to integrate OCR functionality into a program. However, these tools require significant configurations to get up and running to provide results with an acceptable accuracy level.

  • GitHub repo allennlp

    An open-source NLP research library, built on PyTorch.

    Project mention: AllenNLP v2.0.0 | news.ycombinator.com | 2021-01-27
  • GitHub repo nni

    An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

    Project mention: How we were able to achieve hyper-parameter tuning (HPT) for deep learning workflows at 1.5x faster in our clusters and 3x cheaper on AWS | reddit.com/r/learnmachinelearning | 2021-02-23

    To tackle the problem of long and expensive HPT workflows, our team at Petuum collaborated with Microsoft to integrate AdaptDL with Neural Network Intelligence (NNI). AdaptDL is an open-source tool in the CASL (Composable, Automatic, and Scalable Learning) ecosystem. AdaptDL offers adaptive resource management for distributed clusters, and reduces the cost of deep learning workloads ranging from a few training/tuning trials to thousands. NNI from the Microsoft open-source community, is a toolkit for automatic machine learning (AutoML) and hyper-parameter tuning.

  • GitHub repo yolov5

    YOLOv5 in PyTorch > ONNX > CoreML > TFLite

    Project mention: Scaled Yolo v4 and yolov5 Scaling linearly with batchsize | reddit.com/r/deeplearning | 2021-02-17

    https://github.com/ultralytics/yolov5/issues/1806#issuecomment-752837988

  • GitHub repo Bringing-Old-Photos-Back-to-Life

    Bringing Old Photo Back to Life (CVPR 2020 oral)

    Project mention: Weekly Developer Roundup #23 - Sun Nov 22 2020 | dev.to | 2020-11-21

    microsoft/Bringing-Old-Photos-Back-to-Life (Python): Bringing Old Photo Back to Life (CVPR 2020 oral)

  • GitHub repo datasets

    🤗 The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools

    Project mention: Build an Embeddings index with Hugging Face Datasets | dev.to | 2021-01-28

    This article shows how txtai can index and search with Hugging Face's Datasets library. Datasets opens access to a large and growing list of publicly available datasets. Datasets has functionality to select, transform and filter data stored in each dataset.

  • GitHub repo Stanza

    Official Stanford NLP Python Library for Many Human Languages

  • GitHub repo DeepSpeed

    DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

    Project mention: [D] Will Nvidia's anti-mining GPU modifications affect deep learning performance? | reddit.com/r/MachineLearning | 2021-02-20

    Maybe https://github.com/microsoft/DeepSpeed

  • GitHub repo best-of-ml-python

    🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

    Project mention: best-of-python: A ranked list of awesome Python libraries and tools | reddit.com/r/Python | 2021-01-14

    Here ya go: https://github.com/ml-tooling/best-of-ml-python/pull/47

  • GitHub repo jukebox

    Code for the paper "Jukebox: A Generative Model for Music"

    Project mention: Any of these bands like Tally Hall? | reddit.com/r/tallyhall | 2021-02-23
  • GitHub repo Kornia

    Open Source Differentiable Computer Vision Library for PyTorch

    Project mention: SpaCy v3.0 Released (Python Natural Language Processing) | news.ycombinator.com | 2021-02-01

    I haven't had a situation to use it, but I think Kornia looks cool: https://github.com/kornia/kornia

  • GitHub repo espnet

    End-to-End Speech Processing Toolkit

    Project mention: What are some good speech recognition papers I can implement? | reddit.com/r/MLQuestions | 2021-02-01

    espnet

  • GitHub repo PyTorchZeroToAll

    Simple PyTorch Tutorials Zero to ALL!

    Project mention: [D] I'm trying to do more stuff in pure Tensorflow. Is there an in-depth book that explain constructing recurrent, convolutional, graph etc layers in it? | reddit.com/r/MachineLearning | 2021-01-30

    I'm doing this rn, but with PyTorch. I look for notebooks/scripts (https://github.com/hunkim/PyTorchZeroToAll, primarily) on github read and copy them, along with the deeplearning book. Surprisingly pretty much everything just clicks now, it's my third attempt reading the text though. I don't think any book serves the purpose well, except for knowing the well established conventions of the field.

  • GitHub repo MindsDB

    Predictive AI layer for existing databases.

    Project mention: Launch HN: MindsDB (YC W20) – Machine Learning Inside Your Database | news.ycombinator.com | 2021-02-19

    Here's an issue that enumerates all pending tasks for a first iteration of this feature: https://github.com/mindsdb/mindsdb/issues/1116

  • GitHub repo segmentation_models.pytorch

    Segmentation models with pretrained backbones. PyTorch.

    Project mention: C++ trainable semantic segmentation models | reddit.com/r/computervision | 2021-02-23

    This project is under developing. By now, these projects helps a lot. - official pytorch - qubvel SMP - nlohmann json

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2021-02-23.

Index

What are some of the best open-source Pytorch projects in Python? This list will help you:

Project Stars
1 transformers 41,393
2 Real-Time-Voice-Cloning 23,069
3 pytorch-tutorial 19,726
4 pytorch-CycleGAN-and-pix2pix 14,338
5 mmdetection 13,662
6 pytorch-lightning 12,092
7 fairseq 11,270
8 horovod 10,835
9 EasyOCR 10,671
10 allennlp 9,712
11 nni 9,102
12 yolov5 8,807
13 Bringing-Old-Photos-Back-to-Life 7,545
14 datasets 6,802
15 Stanza 5,200
16 DeepSpeed 4,348
17 best-of-ml-python 4,148
18 jukebox 4,073
19 Kornia 3,613
20 espnet 3,453
21 PyTorchZeroToAll 3,443
22 MindsDB 3,436
23 segmentation_models.pytorch 2,928