Python Computer Vision

Open-source Python projects categorized as Computer Vision

Top 23 Python Computer Vision Projects

  • GitHub repo Face Recognition

    The world's simplest facial recognition api for Python and the command line

    Project mention: Need ideas regarding Face Recognition | reddit.com/r/computervision | 2021-06-02
  • GitHub repo pytorch-CycleGAN-and-pix2pix

    Image-to-Image Translation in PyTorch

    Project mention: This Wojak Does Not Exist | news.ycombinator.com | 2020-12-31

    https://github.com/junyanz/pytorch-CycleGAN-and-pix2pix

  • GitHub repo facenet

    Face recognition using Tensorflow

    Project mention: CompreFace - Free and open-source self-hosted face recognition system from Exadel | reddit.com/r/selfhosted | 2021-05-07

    As for me, openface is already outdated - the latest release was in 2016. If you look for a library, the easiest to use is ageitgey/face_recognition. The more accurate libraries are davidsandberg/facenet and deepinsight/insightface.

  • GitHub repo EasyOCR

    Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

    Project mention: Best OCR for keras CNN trained handwritten character model? | reddit.com/r/tensorflow | 2021-06-03

    There are many possible approaches to OCR. You might want to take a look at the pipeline of EasyOCR at https://github.com/JaidedAI/EasyOCR.

  • GitHub repo d2l-en

    Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 175 universities.

    Project mention: I created a way to learn machine learning through Jupyter | reddit.com/r/learnmachinelearning | 2021-04-30

    There are actually some online books and courses built on Jupyter Notebook ([Dive to Deep Learning Book](https://github.com/d2l-ai/d2l-en) for example). However yours is more detail and could really helps beginners.

  • GitHub repo vision

    Datasets, Transforms and Models specific to Computer Vision (by pytorch)

    Project mention: The image load from PyTorch DataLoader looks different from my origin image | reddit.com/r/learnmachinelearning | 2021-02-20

    The reason your images have 3 channels is that the default loader of torchvision.datasets.Imagefolder automatically converts the images to RGB, see also the source code.

  • GitHub repo fashion-mnist

    A MNIST-like fashion product database. Benchmark :point_down:

    Project mention: Machine Learning Orchestration on Kubernetes using Kubeflow | dev.to | 2021-03-23

    About the Dataset Fashion-MNIST is a dataset of Zalando's article images—consisting of a training set of 60,000 examples and a test set of 10,000 examples. Each example is a 28x28 grayscale image associated with a label from 10 classes. We intend Fashion-MNIST to serve as a direct drop-in replacement for the original MNIST dataset for benchmarking machine learning algorithms. It shares the exact image size and structure of training and testing splits. source: https://github.com/zalandoresearch/fashion-mnist

  • GitHub repo datasets

    🤗 The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools

    Project mention: Build an Embeddings index with Hugging Face Datasets | dev.to | 2021-01-28

    This article shows how txtai can index and search with Hugging Face's Datasets library. Datasets opens access to a large and growing list of publicly available datasets. Datasets has functionality to select, transform and filter data stored in each dataset.

  • GitHub repo labelme

    Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).

    Project mention: Convert Yolov3 annotation to labelme | reddit.com/r/deeplearning | 2021-04-25

    Ref. - https://github.com/wkentaro/labelme/

  • GitHub repo meshroom

    3D Reconstruction Software

    Project mention: Dream software can be real | reddit.com/r/accursedfarms | 2021-06-06

    There is a free alternative Meshroom

  • GitHub repo label-studio

    Label Studio is a multi-type data labeling and annotation tool with standardized output format

    Project mention: From where do i download prodigy??? [D], [P] | reddit.com/r/MachineLearning | 2021-04-22

    I do not believe there is a freely available version. Prodigy is great, but if you want a free tool, I would try Labelstudio.

  • GitHub repo Serpent.AI

    Game Agent Framework. Helping you create AIs / Bots that learn to play any game you own!

    Project mention: I'm on a 64 bit win10 pc and want to make a tas for a unity game, that is what I have. How do I make a tas | reddit.com/r/TAS | 2021-05-27

    i cant. is there any way https://github.com/SerpentAI/SerpentAI would work. the game is entirely mouse movements.

  • GitHub repo pix2pixHD

    Synthesizing and manipulating 2048x1024 images with conditional GANs

    Project mention: How to access a class object when I use torch.nn.DataParallel()? | reddit.com/r/pytorch | 2021-03-12

    I used Pix2PixHD implementation in GitHub if you want to see the full code.

  • GitHub repo vit-pytorch

    Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

    Project mention: [R] Rotary Positional Embeddings - a new relative positional embedding for Transformers that significantly improves convergence (20-30%) and works for both regular and efficient attention | reddit.com/r/MachineLearning | 2021-04-21

    I've attempted it here https://github.com/lucidrains/vit-pytorch/blob/main/vit_pytorch/rvt.py but those who have tried it haven't seen knock out results as 1d. Perhaps the axial lengths are too small to see a benefit

  • GitHub repo scikit-image

    Image processing in Python

    Project mention: Scikit-image for Image Processing | reddit.com/r/learnpython | 2021-04-12

    Then you would need to find what this plugin does for imshow. First thing you can see is that "interpolation" is not "bicubic" as you used, but "nearest"… but there are other settings here that are responsible for the difference of displays. (it's better that you look at the source code in your environment, as it might be slightly different)

  • GitHub repo Kornia

    Open Source Differentiable Computer Vision Library for PyTorch

    Project mention: Top 5 Python libraries for Computer vision | dev.to | 2021-05-06

    Kornia - Kornia is a differentiable computer vision library for PyTorch. It consists of a set of routines and differentiable modules to solve generic computer vision problems. At its core, the package uses PyTorch as its main backend both for efficiency and to take advantage of the reverse-mode auto-differentiation to define and compute the gradient of complex functions.

  • GitHub repo jina

    An easier way to build neural search on the cloud

    Project mention: My open-source project is on Github trending #1 spot. I'm elated :), AMA | reddit.com/r/github | 2021-06-15

    It's been almost 1.5 yr since we launched this open source project Jina - A Neural Search framework. And today, we end up in Github Trending #1 spot.

  • GitHub repo pysot

    SenseTime Research platform for single object tracking, implementing algorithms like SiamRPN and SiamMask.

    Project mention: [P] Object Detection with rotated bounding boxes | reddit.com/r/MachineLearning | 2021-01-25

    SiamMask has pretty good rotated bbox code you can take a look at. https://github.com/STVIR/pysot

  • GitHub repo imagededup

    😎 Finding duplicate images made easy!

    Project mention: How to remove duplicate images from your dataset (Also CIFAR-100 has issues) | reddit.com/r/computervision | 2021-02-01

    I had used phash to find the duplicate images. The hashing algorithm is ingenious...And it was able to find duplicates (and thus remove) efficiently imagededub has perceptual hashing.

  • GitHub repo ALAE

    [CVPR2020] Adversarial Latent Autoencoders

    Project mention: [D] How do I make a model which takes a bedroom image as input give an output of different design of bedroom related to input image? | reddit.com/r/MachineLearning | 2021-03-27

    source link: https://github.com/podgorskiy/ALAE

  • GitHub repo image-super-resolution

    🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.

    Project mention: Totally free and unlimited upscale or superresolution AI | reddit.com/r/ArtificialInteligence | 2021-06-03
  • GitHub repo SimpleCV

    The Open Source Framework for Machine Vision

    Project mention: Top 5 Python libraries for Computer vision | dev.to | 2021-05-06

    SimpleCV - SimpleCV is a framework for Open Source Machine Vision, using OpenCV and the Python programming language. It provides a concise, readable interface for cameras, image manipulation, feature extraction, and format conversion. Our mission is to give casual users a comprehensive interface for basic machine vision functions and an elegant programming interface for advanced users.

  • GitHub repo layout-parser

    A unified toolkit for Deep Learning Based Document Image Analysis

    Project mention: Document Classification | reddit.com/r/computervision | 2021-06-08

    One project that I saw not to long ago which might be useful is this: https://github.com/Layout-Parser/layout-parser

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2021-06-15.

Index

What are some of the best open-source Computer Vision projects in Python? This list will help you:

Project Stars
1 Face Recognition 40,311
2 pytorch-CycleGAN-and-pix2pix 15,177
3 facenet 11,840
4 EasyOCR 11,678
5 d2l-en 10,071
6 vision 9,237
7 fashion-mnist 9,184
8 datasets 8,352
9 labelme 6,713
10 meshroom 6,520
11 label-studio 5,966
12 Serpent.AI 5,957
13 pix2pixHD 5,288
14 vit-pytorch 4,611
15 scikit-image 4,378
16 Kornia 4,114
17 jina 3,919
18 pysot 3,764
19 imagededup 3,621
20 ALAE 3,026
21 image-super-resolution 2,830
22 SimpleCV 2,462
23 layout-parser 2,208