Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 21 text-recognition Open-Source Projects
-
deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
AdelaiDet
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
-
doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
tika-python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
-
League-of-Legends-Bot
League of legends bot is a pixel bot for League Of Legends 10.19, written in C# .NET using image processing , and dependency injection (Pattern Scripting)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Show HN: BetterOCR combines and corrects multiple OCR engines with an LLM | news.ycombinator.com | 2023-10-28Yup! But I'm still exploring options. (any recommendations would be welcomed!) Here are some candidates I'm considering:
- https://github.com/mindee/doctr
- https://github.com/open-mmlab/mmocr
- https://github.com/PaddlePaddle/PaddleOCR (honestly I don't know Mandarin so I'm a bit stuck)
- https://github.com/clovaai/donut - While it's primarily an "OCR-free document understanding transformer," I think it's worth experimenting with. Think I can sort this out by letting the LLM reason through it multiple times (although this will impact performance)
- yesterday got a suggestion to consider https://github.com/kakaobrain/pororo - I don't think development is still active but the results are pretty great on Korean text
Project mention: Show HN: How do you OCR on a Mac using the CLI or just Python for free | news.ycombinator.com | 2024-01-02https://github.com/mindee/doctr/issues/1049
I am looking for something this polished and reliable for handwriting, does anyone have any pointers? I want to integrate it in a workflow with my eink tablet I take notes on. A few years ago, I tried various models, but they performed poorly (around 80% accuracy) on my handwriting, which I can read almost 90% of the time.
I really recommend the usage of scene text recognition models. They are perfect for these type of usecases: https://github.com/baudm/parseq or check https://paperswithcode.com/task/scene-text-recognition make sure to check the licenses and good luck 👍🏻
text-recognition related posts
- GitHub - hypertensiune/Android-Sudoku-Solver-OCR: Android app for solving sudoku puzzles.
- Show HN: OCR Sudoku Solver for Android
- Show HN: OCR Sudoku Solver for Android
- Show HN: Android app for solving sudokus from images
- Domain adaptation text recognition/OCR dataset (MSDA) and benchmark: Multi-source domain adaptation dataset for text recognition
- What are my options for extracting text from photos? I've already got ImageMagick installed, and assume there's a handful of PHP libraries for this task? Which are most performant and most likely to be maintained?
- Processing Identity Documents in Laravel
-
A note from our sponsor - InfluxDB
www.influxdata.com | 19 Apr 2024
Index
What are some of the best open-source text-recognition projects? This list will help you:
Project | Stars | |
---|---|---|
1 | mmocr | 4,059 |
2 | deep-text-recognition-benchmark | 3,619 |
3 | AdelaiDet | 3,326 |
4 | mlkit | 3,316 |
5 | TextRecognitionDataGenerator | 3,030 |
6 | doctr | 2,973 |
7 | tesseract-ocr-for-php | 2,776 |
8 | tika-python | 1,406 |
9 | react-native-tesseract-ocr | 543 |
10 | parseq | 493 |
11 | Meta-SelfLearning | 196 |
12 | react-native-mlkit-ocr | 160 |
13 | League-of-Legends-Bot | 156 |
14 | EverTranslator | 127 |
15 | scannerate | 44 |
16 | Google-MLKit-Android-Apps | 39 |
17 | Android-Sudoku-Solver-OCR | 35 |
18 | EasyOCR-cpp | 24 |
19 | vkit | 21 |
20 | summarize-text | 6 |
21 | sight-dotty | 4 |