Top 23 Computer Vision Open-Source Projects
-
Project mention: stereodemo: compare several recent stereo depth estimation methods in the wild | reddit.com/r/computervision | 2022-05-23
OpenCV stereo block matching and semi-global block matching baselines, with all their parameters
-
Project mention: How to embed a dlib based face recognition library written in python? | reddit.com/r/reactnative | 2022-04-08
I want use this library: https://github.com/ageitgey/face_recognition. Its a python library based on https://dlib.net for face recognition
-
SonarLint
Deliver Cleaner and Safer Code - Right in Your IDE of Choice!. SonarLint is a free and open source IDE extension that identifies and catches bugs and vulnerabilities as you code, directly in the IDE. Install from your favorite IDE marketplace today.
-
Project mention: 800 free Computer Science classes you can take online right now, with video lectures | reddit.com/r/machinelearningnews | 2022-05-25
-
openpose
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
If you want something out-of-the-box, use an additional Kinect sensor. The lens itself does not have built-in algorithm for IK. You can also feed the video stream of your lens into an openpose model (https://github.com/CMU-Perceptual-Computing-Lab/openpose) and work with the results. However, I don't think the lens is able to provide a decent inference speed.
-
applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
The second repo I LOVE is Eugene Yan’s Applied ML repository. This is a brilliant idea to create and actually something I was planning on sort of casually doing in my non-existent free time… Anyhow, it is a curated list of technical posts from top engineering teams (Netflix, Amazon, Pinterest, Linkedin, etc.) detailing how they built out different types of AI/ML systems (e.g. forecasting, recommenders, search and ranking, etc.). Ofc, it focuses on AI/ML, but something similar could be made for the traditional or BI-oriented analytics stack, as well as the streaming world, super high value for practitioners! Btw-one of my favorite things at BCG used to be looking at our IT architecture team’s reference architecture diagrams… the best way to understand technologies is to look at how a ton of stuff is architected… and its fun!
-
darknet
YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet ) (by AlexeyAB)
...and since I'm here, be aware of the issues before you use YOLOv5: https://github.com/AlexeyAB/darknet/issues/5920
-
Project mention: Trying to understand PatchGAN discriminator | reddit.com/r/deeplearning | 2021-12-09
Code for https://arxiv.org/abs/1611.07004 found: https://github.com/junyanz/pytorch-CycleGAN-and-pix2pix
-
Scout APM
Less time debugging, more time building. Scout APM allows you to find and fix performance issues with no hassle. Now with error monitoring and external services monitoring, Scout is a developer's best friend when it comes to application development.
-
Project mention: obs-backgroundremoval: An OBS plugin for removing background in portrait video | news.ycombinator.com | 2022-02-25
-
Project mention: Object detection with depth measurement using pre-trained models with OAK-D | reddit.com/r/computerscience | 2022-04-29
Code Link : https://github.com/spmallick/learnopencv/tree/master/OAK-Object-Detection-with-Depth
-
EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Project mention: [P] Training to read PDF documents. Any ideas? | reddit.com/r/MachineLearning | 2022-05-20If all you need to do is OCR, check out https://github.com/JaidedAI/EasyOCR , it's a similar architecture to the cloud services, without all the $. You'll end up with extracted text and bounding boxes for it.
-
Project mention: Released my 2nd best open-source project out of 170 projects shipped in the past 2 yrs to make AI powered search easy | reddit.com/r/opensource | 2022-03-26
And this latest project Clip as a service has just received 10k stars on GitHub, very close to my other project Jina which has 14k stars. Clip as a service providers an easy and intuitive way to create embeddings for text and images that can be used to create semantic text and image search.
-
d2l-en
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 300 universities from 55 countries including Stanford, MIT, Harvard, and Cambridge.
Project mention: How to pre-train BERT on different objective tasks using HuggingFace | reddit.com/r/deeplearning | 2022-04-10There might is bert library for pre-train bert model in huggingface, But I suggestion that you train bert model in native pytorch to understand detail, Limu's course is recommended for you
-
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Project mention: Hugging Face Introduces ‘Datasets’: A Lightweight Community Library For Natural Language Processing (NLP) | reddit.com/r/artificial | 2021-11-08Code for https://arxiv.org/abs/2109.02846 found: https://github.com/huggingface/datasets
-
AirSim
Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research
Project mention: Currently writing out a plan for an RL based path-planning project. (I'm doing it for my Smart Vehicles course in my Master's Degree) Don't have much domain knowledge atm but looking for some advice on how to approach the problem? | reddit.com/r/reinforcementlearning | 2021-10-02AirSim: https://github.com/microsoft/AirSim
-
Awesome-pytorch-list
A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
Project mention: Similar open source long library list to TF like Pytorch "ECOSYSTEM TOOLS" | reddit.com/r/tensorflow | 2021-11-19I got the following as recombination from elsewhere - https://github.com/jtoy/awesome-tensorflow and there is one for pt as well https://github.com/bharathgs/Awesome-pytorch-list . Thx for the help :D
-
-
Project mention: [D] My experience with running PyTorch on the M1 GPU | reddit.com/r/MachineLearning | 2022-05-19
$ python vgg16-cifar10.py --device "cuda" torch 1.11.0+cu102 device cuda Downloading https://www.cs.toronto.edu/~kriz/cifar-10-python.tar.gz to data/cifar-10-python.tar.gz 170499072it [00:46, 3628424.66it/s] Extracting data/cifar-10-python.tar.gz to data Downloading: "https://github.com/pytorch/vision/archive/v0.11.0.zip" to /home/md/.cache/torch/hub/v0.11.0.zip Epoch: 001/001 | Batch 0000/1406 | Loss: 2.6563 Epoch: 001/001 | Batch 0100/1406 | Loss: 2.4686 Epoch: 001/001 | Batch 0200/1406 | Loss: 2.1224 Epoch: 001/001 | Batch 0300/1406 | Loss: 2.1879 Epoch: 001/001 | Batch 0400/1406 | Loss: 2.1733 Epoch: 001/001 | Batch 0500/1406 | Loss: 2.2413 Epoch: 001/001 | Batch 0600/1406 | Loss: 2.0518 Epoch: 001/001 | Batch 0700/1406 | Loss: 2.1621 Epoch: 001/001 | Batch 0800/1406 | Loss: 1.9033 Epoch: 001/001 | Batch 0900/1406 | Loss: 1.8379 Epoch: 001/001 | Batch 1000/1406 | Loss: 1.9572 Epoch: 001/001 | Batch 1100/1406 | Loss: 1.8823 Epoch: 001/001 | Batch 1200/1406 | Loss: 1.7978 Epoch: 001/001 | Batch 1300/1406 | Loss: 2.0239 Epoch: 001/001 | Batch 1400/1406 | Loss: 1.8389 Time / epoch without evaluation: 6.75 min
-
CycleGAN
Software that can generate photos from paintings, turn horses into zebras, perform style transfer, and more.
It's image translation, check this out https://github.com/junyanz/CycleGAN
-
dlib seems cool, did not try it yet but i thought its a good idea to have a link to it here.
-
deep-learning-drizzle
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Project mention: Consolidated Video lectures for Machine Learning(including DL, CV, NLP, etc) | reddit.com/r/developersIndia | 2022-01-22Also this as well for whoever needs it
-
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Project mention: [P] Release the Vision Transformer Cookbook with Tensorflow ! (Thanks to @lucidrains) | reddit.com/r/MachineLearning | 2022-03-28looks great Junho! i've linked to it from https://github.com/lucidrains/vit-pytorch like you asked :)
-
Project mention: How to produce data visualizations like this? | reddit.com/r/computervision | 2022-01-05
-
Project mention: CAIRE - content aware image resizing library supporting GUI mode | reddit.com/r/golang | 2021-12-15
Computer Vision related posts
- stereodemo: compare several recent stereo depth estimation methods in the wild
- Fast, scalable, and extensive implementation of Poisson Image Editing
- [P] Training to read PDF documents. Any ideas?
- Extracting Triangular 3D Models, Materials, and Lighting from Images
- [D] My experience with running PyTorch on the M1 GPU
- Docker multi-architecture, .NET 6.0 and OpenCVSharp
- Open source software that digitalizes real life chess boards.
Index
What are some of the best open-source Computer Vision projects? This list will help you:
Project | Stars | |
---|---|---|
1 | OpenCV | 61,786 |
2 | Face Recognition | 44,304 |
3 | cs-video-courses | 36,079 |
4 | openpose | 24,227 |
5 | applied-ml | 19,702 |
6 | darknet | 19,083 |
7 | pytorch-CycleGAN-and-pix2pix | 17,583 |
8 | mediapipe | 17,296 |
9 | learnopencv | 16,349 |
10 | EasyOCR | 14,847 |
11 | jina | 14,601 |
12 | d2l-en | 13,445 |
13 | datasets | 13,385 |
14 | AirSim | 13,090 |
15 | Awesome-pytorch-list | 13,034 |
16 | facenet | 12,490 |
17 | vision | 11,654 |
18 | CycleGAN | 11,162 |
19 | Dlib | 11,149 |
20 | deep-learning-drizzle | 10,102 |
21 | vit-pytorch | 10,073 |
22 | fashion-mnist | 10,030 |
23 | caire | 9,884 |
Are you hiring? Post a new remote job listing for free.