Python Computer Vision

Open-source Python projects categorized as Computer Vision

Top 23 Python Computer Vision Projects

  • Face Recognition

    The world's simplest facial recognition api for Python and the command line

    Project mention: Any algorithm to get coordinates of classified face? | reddit.com/r/neuralnetworks | 2022-11-12

    face_recognition - fairly easy install as long as dlib compiles for you. Should be able to get your face location ok. This is the older, most mature, but doesn't have as many features and might not have the accuracy of other models. Able to do face recognition.

  • pytorch-CycleGAN-and-pix2pix

    Image-to-Image Translation in PyTorch

    Project mention: IA art | reddit.com/r/ArtHistory | 2022-09-26
  • Scout APM

    Truly a developer’s best friend. Scout APM is great for developers who want to find and fix performance issues in their applications. With Scout, we'll take care of the bugs so you can focus on building great things 🚀.

  • jina

    🔮 The most advanced MLOps platform for multimodal AI on the cloud · Neural Search · Creative AI · Cloud Native

    Project mention: Have you used Jina for multi-modal applications? | dev.to | 2022-10-24

    How will you build a multi-modal application? I just noticed the release ofJina which is a MLOps framework that empowers anyone to build cross-modal and multi-modal applications on the cloud. It uplifts a PoC into a production-ready service. Jina handles the infrastructure complexity, making advanced solution engineering and cloud-native technologies accessible to every developer. If you tried before, please let me know how do you find about it? Thanks!

  • EasyOCR

    Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

    Project mention: [P]Modern open-source OCR capabilities and which model to choose | reddit.com/r/MachineLearning | 2022-11-18

    I've used EasyOCR for number recognition tasks. Works fairly well. https://github.com/JaidedAI/EasyOCR

  • d2l-en

    Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 400 universities from 60 countries including Stanford, MIT, Harvard, and Cambridge.

    Project mention: How to pre-train BERT on different objective tasks using HuggingFace | reddit.com/r/deeplearning | 2022-04-10

    There might is bert library for pre-train bert model in huggingface, But I suggestion that you train bert model in native pytorch to understand detail, Limu's course is recommended for you

  • datasets

    🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

    Project mention: FauxPilot – an open-source GitHub Copilot server | news.ycombinator.com | 2022-08-02

    And then pass that my_code.json as the dataset name.

    [1] https://github.com/huggingface/datasets

  • vision

    Datasets, Transforms and Models specific to Computer Vision

    Project mention: Solution for "RuntimeError: Couldn't load custom C++ ops" | reddit.com/r/StableDiffusion | 2022-09-07

    RuntimeError: Couldn't load custom C++ ops. This can happen if your PyTorch and torchvision versions are incompatible, or if you had errors while compiling torchvision from source. For further information on the compatible versions, check https://github.com/pytorch/vision#installation for the compatibility matrix. Please check your PyTorch version with torch.version and your torchvision version with torchvision.version and verify if they are compatible, and if not please reinstall torchvision so that it matches your PyTorch install.

  • InfluxDB

    Build time-series-based applications quickly and at scale.. InfluxDB is the Time Series Data Platform where developers build real-time applications for analytics, IoT and cloud-native services in less time with less code.

  • facenet

    Face recognition using Tensorflow

  • vit-pytorch

    Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

    Project mention: What are the best resources online to learn attention and transformers? | reddit.com/r/computervision | 2022-08-23

    For code implementation, check out this git repo. It contains fairly straightforward PyTorch implementations for various ViT papers with references.

  • label-studio

    Label Studio is a multi-type data labeling and annotation tool with standardized output format

    Project mention: Best open source annotation tool for NLP? | reddit.com/r/LanguageTechnology | 2022-11-10

    Label Studio (limited colab feature) https://labelstud.io/ or Annotation lab https://www.johnsnowlabs.com/annotation-lab/

  • fashion-mnist

    A MNIST-like fashion product database. Benchmark :point_down:

    Project mention: MNIST classification using pytorch/I will do data science, data analysis, machine learning in python | reddit.com/r/kalivouches | 2022-10-31

    Fashion MNIST: This dataset from Zalando Research contains images of 10 classes consisting of clothing apparel and accessories like ankle boots, bags, coats, dresses, pullovers, sandals, shirts, sneakers, etc. instead of handwritten digits. The images are grayscale just like the original MNIST.

  • labelme

    Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).

    Project mention: [D] What is a free tool for generating image segmentation masks? | reddit.com/r/MachineLearning | 2022-10-18
  • ludwig

    Data-centric declarative deep learning framework

  • Meshroom

    3D Reconstruction Software

    Project mention: Best way to work with point cloud files? | reddit.com/r/architecture | 2022-11-17

    You could start by importing the point cloud model into a software with native support for the file format. Some open source examples are Meshroom (https://alicevision.org) or Meshlab (https://www.meshlab.net). There are other softwares from companies like Autodesk that might be more intuitive but you'd need licenses to use them. Rhino and Blender can handle the file formats but might struggle depending on the size and complexity.

  • Kornia

    Open Source Differentiable Computer Vision Library

    Project mention: [News] Kornia 0.6.6: ParametrizedLine API, load_image support for Apple Windows Developer, integration demos with Hugging Face and many more. | reddit.com/r/MachineLearning | 2022-07-18

    👉 https://github.com/kornia/kornia/releases/tag/v0.6.6

  • RobustVideoMatting

    Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

    Project mention: CatFileCreator in Nuke | reddit.com/r/vfx | 2022-10-10

    I have done a bit of coding and I will use pretrained models only. Looking at things like depth and segmentation. Like this as an example. I am using it on a collab now but its so cumbersome. https://github.com/PeterL1n/RobustVideoMatting

  • BackgroundMattingV2

    Real-Time High-Resolution Background Matting

    Project mention: [D] Is this ML project safe? | reddit.com/r/MachineLearning | 2022-07-14

    I want to use Background Matting V2 professionally; it is very impressive. My concern however is privacy. The successor to this project, Robust Video Matting, says (on its github page) that it was developed at ByteDance Inc. I know this company owns TikTok and are known for storing user data on Chinese servers. My question is, is background matting v2 safe to use? https://github.com/PeterL1n/BackgroundMattingV2

  • pix2pixHD

    Synthesizing and manipulating 2048x1024 images with conditional GANs

    Project mention: How do I run more than 200 epochs in training a Pix2PixHD model? | reddit.com/r/MLQuestions | 2022-08-11
  • scikit-image

    Image processing in Python

    Project mention: A CLI that does simple image processing and also generates cool patterns | reddit.com/r/rust | 2022-09-26

    Also, don't know if you're familiar with Python, but if you need ideas for to implement for future directions : https://scikit-image.org/

  • autogluon

    AutoGluon: AutoML for Image, Text, Time Series, and Tabular Data

    Project mention: autogluon: NEW Data - star count:5070.0 | reddit.com/r/algoprojects | 2022-11-25
  • deeplake

    Data Lake for Deep Learning. Build, manage, query, version, & visualize datasets. Stream data real-time to PyTorch/TensorFlow. https://activeloop.ai

    Project mention: Launch HN: Activeloop (YC S18) – Data lake for deep learning | news.ycombinator.com | 2022-11-15

    Re: HF - we know them and admire their work (primarily, until very recently, focused on NLP, while we focus mostly on CV). As mentioned in the post, a large part of Deep Lake, including the Python-based dataloader and dataset format, is open source as well - https://github.com/activeloopai/deeplake.

    Likewise, we curate a list of large open source datasets here -> https://datasets.activeloop.ai/docs/ml/, but our main thing isn't aggregating datasets (focus for HF datasets), but rather providing people with a way to manage their data efficiently. That being said, all of the 125+ public datasets we have are available in seconds with one line of code. :)

    We haven't benchmarked against HF datasets in a while, but Deep Lake's dataloader is much, much faster in third-party benchmarks (see this https://arxiv.org/pdf/2209.13705 and here for an older version, that was much slower than what we have now, see this: https://pasteboard.co/la3DmCUR2iFb.png). HF under the hood uses Git-LFS (to the best of my knowledge) and is not opinionated on formats, so LAION just dumps Parquet files on their storage.

    While your setup would work for a few TBs, scaling to PB would be tricky including maintaining your own infrastructure. And yep, as you said NAS/NFS would neither be able to handle the scale (especially writes with 1k workers). I am also slightly curious about your use of mmap files with image/video compressed data (as zero-copy won’t happen) unless you decompress inside the GPU ;), but would love to learn more from you! Re: pricing thanks for the feedback, storage is one component and customly priced for PB-scale workloads.

  • pytorch-metric-learning

    The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.

    Project mention: Similarity Learning lacks a framework. So we built one | news.ycombinator.com | 2022-07-13

    Not a full featured framework, but pytorch-metric-learning has data loaders, lossess, etc. to facilitate similarity learning: https://github.com/KevinMusgrave/pytorch-metric-learning

    Disclaimer: I've made some contributions to it.

  • imagededup

    😎 Finding duplicate images made easy!

    Project mention: Find visual similar photos | reddit.com/r/selfhosted | 2022-09-17
  • Sonar

    Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2022-11-25.

Python Computer Vision related posts

Index

What are some of the best open-source Computer Vision projects in Python? This list will help you:

Project Stars
1 Face Recognition 46,534
2 pytorch-CycleGAN-and-pix2pix 18,779
3 jina 16,690
4 EasyOCR 16,331
5 d2l-en 15,632
6 datasets 14,783
7 vision 12,824
8 facenet 12,785
9 vit-pytorch 12,217
10 label-studio 11,259
11 fashion-mnist 10,421
12 labelme 9,440
13 ludwig 8,629
14 Meshroom 8,590
15 Kornia 7,404
16 RobustVideoMatting 6,366
17 BackgroundMattingV2 6,044
18 pix2pixHD 5,974
19 scikit-image 5,123
20 autogluon 5,061
21 deeplake 5,031
22 pytorch-metric-learning 4,874
23 imagededup 4,261
Workflow assistant built for devs & their teams
Automate the mundane part of your day, with live actionable messages for your GitHub & Jira tasks.
www.zigi.ai