The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 9 Python text-detection Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
AdelaiDet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
-
doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
-
keras-ocr
A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
-
comic-translate
Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
Image2CAD
An application to translate raster image of CAD drawing sheet to a user editable DXF format.
Project mention: [DISC] - The angel who came to pick me up is a Gal (Oneshot by Shiraishi Kouhei) | /r/manga | 2023-09-06OCR works pretty good. ocr.space, ocr.best and cotrans.touhou.ai/ are all pretty nice.
Project mention: Show HN: BetterOCR combines and corrects multiple OCR engines with an LLM | news.ycombinator.com | 2023-10-28Yup! But I'm still exploring options. (any recommendations would be welcomed!) Here are some candidates I'm considering:
- https://github.com/mindee/doctr
- https://github.com/open-mmlab/mmocr
- https://github.com/PaddlePaddle/PaddleOCR (honestly I don't know Mandarin so I'm a bit stuck)
- https://github.com/clovaai/donut - While it's primarily an "OCR-free document understanding transformer," I think it's worth experimenting with. Think I can sort this out by letting the LLM reason through it multiple times (although this will impact performance)
- yesterday got a suggestion to consider https://github.com/kakaobrain/pororo - I don't think development is still active but the results are pretty great on Korean text
Project mention: Show HN: How do you OCR on a Mac using the CLI or just Python for free | news.ycombinator.com | 2024-01-02https://github.com/mindee/doctr/issues/1049
I am looking for something this polished and reliable for handwriting, does anyone have any pointers? I want to integrate it in a workflow with my eink tablet I take notes on. A few years ago, I tried various models, but they performed poorly (around 80% accuracy) on my handwriting, which I can read almost 90% of the time.
Project mention: How can I install pytorch versions < 1.0 and torchvision==0.13 or lower? | /r/pytorch | 2023-07-16
Python text-detection related posts
- Show HN: BetterOCR combines and corrects multiple OCR engines with an LLM
- How can I install pytorch versions < 1.0 and torchvision==0.13 or lower?
- Ask HN: Best pretrained OCR model for dashcam footage?
- Why do new architectures still use old models?
- Decimal number detection
- [N] MMOCR: A Toolbox for Text Detection, Recognition, and Understanding Based on PyTorch
- OCR Baselines Based on PyTorch
-
A note from our sponsor - WorkOS
workos.com | 28 Apr 2024
Index
What are some of the best open-source text-detection projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | manga-image-translator | 4,206 |
2 | mmocr | 4,077 |
3 | AdelaiDet | 3,324 |
4 | doctr | 3,038 |
5 | CRAFT-pytorch | 2,947 |
6 | keras-ocr | 1,331 |
7 | comic-translate | 247 |
8 | Image2CAD | 76 |
9 | vkit | 21 |
Sponsored