SaaSHub helps you find the best software and product alternatives Learn more →
Mmocr Alternatives
Similar projects and alternatives to mmocr
-
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
-
-
doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
-
ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
-
-
deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
-
-
-
iam-crnn-ctc-recognition
Discontinued IAM Dataset Handwriting Recognition Using CRNN, CTC Loss, DeepSpeech Beam Search, And KenLM Scorer
-
mmagic
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
-
-
-
-
-
keras-ocr
A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
-
-
-
-
BetterOCR
🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM.
mmocr discussion
mmocr reviews and mentions
-
Show HN: BetterOCR combines and corrects multiple OCR engines with an LLM
Yup! But I'm still exploring options. (any recommendations would be welcomed!) Here are some candidates I'm considering:
- https://github.com/mindee/doctr
- https://github.com/open-mmlab/mmocr
- https://github.com/PaddlePaddle/PaddleOCR (honestly I don't know Mandarin so I'm a bit stuck)
- https://github.com/clovaai/donut - While it's primarily an "OCR-free document understanding transformer," I think it's worth experimenting with. Think I can sort this out by letting the LLM reason through it multiple times (although this will impact performance)
- yesterday got a suggestion to consider https://github.com/kakaobrain/pororo - I don't think development is still active but the results are pretty great on Korean text
-
MMDeploy: Deploy All the Algorithms of OpenMMLab
MMOCR: OpenMMLab text detection, recognition, and understanding toolbox.
-
[P]Modern open-source OCR capabilities and which model to choose
Link: https://github.com/open-mmlab/mmocr
-
Text Classification Library for a Quick Baseline
For more text classification baselines (CRNN, NRTR, RubustScanner, SAR, SegOCR), checkout https://github.com/open-mmlab/mmocr They are reproducible, customizable.
-
[N] MMOCR: A Toolbox for Text Detection, Recognition, and Understanding Based on PyTorch
We just released https://github.com/open-mmlab/mmocr, a new member in OpenMMLab https://openmmlab.com/. This first release supports
- OCR Baselines Based on PyTorch
-
A note from our sponsor - SaaSHub
www.saashub.com | 7 Oct 2024
Stats
open-mmlab/mmocr is an open source project licensed under Apache License 2.0 which is an OSI approved license.
The primary programming language of mmocr is Python.