Onboard AI learns any GitHub repo in minutes and lets you chat with it to locate functionality, understand different parts, and generate new code. Use it for free at www.getonboard.dev. Learn more →
Mmocr Alternatives
Similar projects and alternatives to mmocr
-
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
-
CRAFT-pytorch
Official implementation of Character Region Awareness for Text Detection (CRAFT)
-
Onboard AI
Learn any GitHub repo in 59 seconds. Onboard AI learns any GitHub repo in minutes and lets you chat with it to locate functionality, understand different parts, and generate new code. Use it for free at www.getonboard.dev.
-
doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
-
deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods.
-
iam-crnn-ctc-recognition
IAM Dataset Handwriting Recognition Using CRNN, CTC Loss, DeepSpeech Beam Search, And KenLM Scorer
-
keras-ocr
A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
-
AdelaiDet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
-
InfluxDB
Collect and Analyze Billions of Data Points in Real Time. Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge.
-
-
BetterOCR
🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM.
-
-
TextRecognitionDataGenerator
A synthetic data generator for text recognition
-
-
ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
-
-
mmagic
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
-
-
mmdetection3d
OpenMMLab's next-generation platform for general 3D object detection.
-
-
mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
mmocr reviews and mentions
-
Show HN: BetterOCR combines and corrects multiple OCR engines with an LLM
Yup! But I'm still exploring options. (any recommendations would be welcomed!) Here are some candidates I'm considering:
- https://github.com/mindee/doctr
- https://github.com/open-mmlab/mmocr
- https://github.com/PaddlePaddle/PaddleOCR (honestly I don't know Mandarin so I'm a bit stuck)
- https://github.com/clovaai/donut - While it's primarily an "OCR-free document understanding transformer," I think it's worth experimenting with. Think I can sort this out by letting the LLM reason through it multiple times (although this will impact performance)
- yesterday got a suggestion to consider https://github.com/kakaobrain/pororo - I don't think development is still active but the results are pretty great on Korean text
-
MMDeploy: Deploy All the Algorithms of OpenMMLab
MMOCR: OpenMMLab text detection, recognition, and understanding toolbox.
-
[P]Modern open-source OCR capabilities and which model to choose
Link: https://github.com/open-mmlab/mmocr
-
Text Classification Library for a Quick Baseline
For more text classification baselines (CRNN, NRTR, RubustScanner, SAR, SegOCR), checkout https://github.com/open-mmlab/mmocr They are reproducible, customizable.
-
A note from our sponsor - Onboard AI
getonboard.dev | 10 Dec 2023
Stats
open-mmlab/mmocr is an open source project licensed under Apache License 2.0 which is an OSI approved license.
The primary programming language of mmocr is Python.