Top 23 Python Computer Vision Projects
The world's simplest facial recognition api for Python and the command lineProject mention: OpenCV or Tensorflow or both ? | reddit.com/r/robotics | 2021-02-21
It’s call face recognition. Face recognition contains two step face detection and face comparison. If you don’t have any background on this I suggest you try the face_recognition python module https://github.com/ageitgey/face_recognition
Image-to-Image Translation in PyTorchProject mention: This Wojak Does Not Exist | news.ycombinator.com | 2020-12-31
Get performance insights in less than 4 minutes. Scout APM uses tracing logic that ties bottlenecks to source code so you know the exact line of code causing performance issues and can get back to building a great product faster.
Face recognition using TensorflowProject mention: Show HN: CompreFace is a free and open-source face recognition software | news.ycombinator.com | 2021-02-21
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.Project mention: Using Google's OCR API with Puppeteer for Visual Testing | dev.to | 2021-02-08
There are multiple open-source OCR tools like pytesseract or EasyOCR, which can be used to integrate OCR functionality into a program. However, these tools require significant configurations to get up and running to provide results with an acceptable accuracy level.
A MNIST-like fashion product database. Benchmark :point_right:Project mention: [P] Why are stacked autoencoders still a thing? | reddit.com/r/MachineLearning | 2021-01-25
Datasets, Transforms and Models specific to Computer Vision (by pytorch)Project mention: The image load from PyTorch DataLoader looks different from my origin image | reddit.com/r/learnmachinelearning | 2021-02-20
The reason your images have 3 channels is that the default loader of torchvision.datasets.Imagefolder automatically converts the images to RGB, see also the source code.
🤗 The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation toolsProject mention: Build an Embeddings index with Hugging Face Datasets | dev.to | 2021-01-28
This article shows how txtai can index and search with Hugging Face's Datasets library. Datasets opens access to a large and growing list of publicly available datasets. Datasets has functionality to select, transform and filter data stored in each dataset.
3D Reconstruction SoftwareProject mention: [Question] House in Vr | reddit.com/r/Vive | 2021-02-20
Good info here and they use Meshroom (free) to generate the meshes from the photos. This gives you a photo-approximation that is... interesting. It's a bit surreal because of the mesh angles. But the thing is, it's good enough to share with people, and it's also a great start if you wanted to manually build out geometry, you can use the room mesh as a guide.
Game Agent Framework. Helping you create AIs / Bots that learn to play any game you own!
Image processing in PythonProject mention: Patented algorithm removed from scikit-image shortly before merge accept | news.ycombinator.com | 2021-01-01
SenseTime Research platform for single object tracking, implementing algorithms like SiamRPN and SiamMask.Project mention: [P] Object Detection with rotated bounding boxes | reddit.com/r/MachineLearning | 2021-01-25
SiamMask has pretty good rotated bbox code you can take a look at. https://github.com/STVIR/pysot
Open Source Differentiable Computer Vision Library for PyTorch
😎 Finding duplicate images made easy!Project mention: How to remove duplicate images from your dataset (Also CIFAR-100 has issues) | reddit.com/r/computervision | 2021-02-01
I had used phash to find the duplicate images. The hashing algorithm is ingenious...And it was able to find duplicates (and thus remove) efficiently imagededub has perceptual hashing.
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in PytorchProject mention: Short term memory solutions for video tasks? | reddit.com/r/deeplearning | 2021-01-22
An easier way to build neural search on the cloudProject mention: Show HN: Jina – Open-source AI framework to build search for anything, fast | news.ycombinator.com | 2021-02-10
Awesome work on hand pose estimation/trackingProject mention: Ombromanie: Creating Hand Shadow stories with Azure Speech and TensorFlow.js Handposes | dev.to | 2021-02-22
'Awesome' list for hand tracking
Unsupervised Data Augmentation (UDA)Project mention: A Visual Survey of Data Augmentation in NLP | dev.to | 2020-08-26
The words that replaces the original word are chosen by calculating TF-IDF scores of words over the whole document and taking the lowest ones. You can refer to the code implementation for this in the original paper here.
A Python wrapper for the tesseract-ocr APIProject mention: OCR and Pytesseract | reddit.com/r/learnpython | 2021-02-16
For OCR I'd generally suggest tesserocr over pytesseract, because it's a true library binding which means you can process OpenCV images in memory instead of having to save them first, and process multiple images with the same tesseract engine instead of having to start a new one for every image.
The official PyTorch implementation of Towards Fast, Accurate and Stable 3D Dense Face Alignment, ECCV 2020.Project mention: Top 10 Developer Trends, Wed Sep 02 2020 | dev.to | 2020-09-02
cleardusk / 3DDFA_V2
RIFE: Real-Time Intermediate Flow Estimation for Video Frame InterpolationProject mention: Gura smug dance | reddit.com/r/Hololive | 2021-02-14
You can give my GUI implementation of RIFE a shot.
Composable GAN framework with api and user interfaceProject mention: So I trained an AI to generate Pokemon sprites and this is the result | reddit.com/r/teenagers | 2021-01-21
There is something called HyperGAN which builds generative adversarial networks (GANs) and those networks take some images as input and give those as output. Here is the GitHub page for that.
A list of synthetic dataset and tools for computer visionProject mention: Can video games help overcome the problem of 3D invariances and object permanence? | reddit.com/r/computervision | 2020-12-24
Contrastive unpaired image-to-image translation, faster and lighter training than cyclegan (ECCV 2020, in PyTorch)Project mention: [D] What's the successor to pix2pixHD | reddit.com/r/MachineLearning | 2021-01-26
What are some of the best open-source Computer Vision projects in Python? This list will help you: