stereo-image-generation
EasyOCR
stereo-image-generation | EasyOCR | |
---|---|---|
2 | 39 | |
33 | 22,132 | |
- | 2.3% | |
10.0 | 3.6 | |
over 1 year ago | about 2 months ago | |
Python | Python | |
- | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
stereo-image-generation
-
I have a Rokid Air and am looking for suggestions as to how to include it in a HS classroom.
In context computer science class, for example, you may consider familiarizing students with generating stereo SBS image based on images of their choosing, perhaps using stable-diffusion-webui-depthmap-script (works in A1111 UI), or to keep things more focused https://github.com/m5823779/stereo-image-generation (no UI, but very simple to use in command-line).
-
3D side by side images (cross your eyes slowly until the images superimpose and you will see in 3D)
I started from that repository but it didn't work and had to rework a lot of what was there to make it better and more optimized. I can't share my code ATM because it's in draft-state (meaning : horrible mess) and I'm still working a lot on it but I couldn't resist to share a few of my results!
EasyOCR
-
Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide
PyTesseract Module [ Github ] EasyOCR Module [ Github ] PaddlePaddle OCR [ Github ]
- OCR a lot of hand written invoice and records?
-
[P] EasyOCR in C++!
I just uploaded my C++ implementation of EasyOCR, a well known ocr library for python. Also dusted some cobwebbs from some audio related projects as well, feel free to leave feedback or contribute! I only implemented the most salient parts, so certainly could use some community help! Cheers!
-
OCR at Edge on Cloudflare Constellation
EasyOCR is a popular project if you are in an environment where you can use run Python and PyTorch (https://github.com/JaidedAI/EasyOCR). Other open source projects of note are PaddleOCR (https://github.com/PaddlePaddle/PaddleOCR) and docTR (https://github.com/mindee/doctr).
-
Donut: OCR-Free Document Understanding Transformer
The main one was https://github.com/JaidedAI/EasyOCR, mostly because, as promised, it was pretty easy to use, and uses pytorch (which I preferred in case I wanted to tweak it). It has been updated since, but at the time it was using CRNN, which is a solid model, especially for the time - it wasn't (academic) SOTA but not far behind that. I'm sure I could've coaxed better performance than I got out of it with some retraining and hyperparameter tuning.
-
Help with OCR of pixel-y numbers
Anyways, you can give a shot to EasyOCR, pretty solid and flexible
- How to perform document OCR?
-
Python unexpectedly quits (macOS ventura, M1)
The easyocr library: https://github.com/JaidedAI/EasyOCR
- I made a website for a friend who owns a restaurant. He's wondering if there's a way to upload a picture of his menu daily. What is the best way to do this?
-
Raspberry Pi Easyocr
Not used it on a Pi but maybe a Docker version (if there is one) would run? Compose file here
What are some alternatives?
SincNet - SincNet is a neural architecture for efficiently processing raw audio samples.
PaddleOCR - Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
cnnimageretrieval-pytorch - CNN Image Retrieval in PyTorch: Training and evaluating CNNs for Image Retrieval in PyTorch
tesseract-ocr - Tesseract Open Source OCR Engine (main repository)
edge-connect - EdgeConnect: Structure Guided Image Inpainting using Edge Prediction, ICCV 2019 https://arxiv.org/abs/1901.00212
doctr - docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
merged_depth - Monocular Depth Estimation - Weighted-average prediction from multiple pre-trained depth estimation models
OpenCV - Open Source Computer Vision Library
calibrated-backprojection-network - PyTorch Implementation of Unsupervised Depth Completion with Calibrated Backprojection Layers (ORAL, ICCV 2021)
awesome-colab-notebooks - Collection of google colaboratory notebooks for fast and easy experiments
tesserocr - A Python wrapper for the tesseract-ocr API
LaTeX-OCR - pix2tex: Using a ViT to convert images of equations into LaTeX code.