The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 23 OCR Open-Source Projects
-
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
-
paperless-ngx
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
-
siyuan
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
-
Easydict
一个简洁优雅的词典翻译 macOS App。开箱即用,支持离线 OCR 识别,支持有道词典,🍎 苹果系统词典,🍎 苹果系统翻译,ChatGPT,Gemini,DeepL,Google,Bing,腾讯,百度,阿里,小牛,彩云和火山翻译。A concise and elegant Dictionary and Translator macOS App for looking up words and translating text.
-
ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
-
donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
-
video-subtitle-extractor
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
-
TNN
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
You will also need to install the Tesseract OCR engine, which can be downloaded and installed from the following link: https://github.com/tesseract-ocr/tesseract
Project mention: Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide | dev.to | 2023-12-27PyTesseract Module [ Github ] EasyOCR Module [ Github ] PaddlePaddle OCR [ Github ]
Project mention: I am out of the loop. Is Next.js "the future" and something I should consider adding to my knowledge pool? | /r/webdev | 2023-07-05What do you have against tesseract.js?
Project mention: The fastest tool for measuring anything on your screen | news.ycombinator.com | 2024-03-06
Project mention: Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide | dev.to | 2023-12-27PyTesseract Module [ Github ] EasyOCR Module [ Github ] PaddlePaddle OCR [ Github ]
I steered a friend towards Paperless (and away from an LLM solution) as a way of searching/accessing GBs of architectural PDFs recently - so far, it’s apparently working well for them.
https://github.com/paperless-ngx/paperless-ngx
Try SiYuan Note. It's free and open source local-first mix of Notion and Obsidian.
https://github.com/siyuan-note/siyuan
Project mention: TextSnatcher: Copy text from images, for the Linux Desktop | news.ycombinator.com | 2024-03-14Try https://github.com/ocrmypdf/OCRmyPDF - it uses Tesseract behind the scenes and it absolutely brilliant.
Sorry about that! I should have checked before sharing that link.
It looks like Microsoft published the code on GitHub, so you might be able to deploy it via Azure. (I haven't tried it.)
https://github.com/Microsoft/ailab/blob/master/Sketch2Code/R...
Be careful with unstructured:
https://github.com/Unstructured-IO/unstructured/blob/d11c70c...
from: https://github.com/open-webui/open-webui/issues/687
I personally macOS's dictionary app and Easydict as the translator.
I'm part of the team that build LlamaParse. It's net improvement compare to other PDF->Structured Text extractors (I build several in the past, includig https://github.com/axa-group/Parsr).
For character extraction, LlamaParse use a mixture of OCR / character extraction from the PDF (it's the only parser I'm aware of that address some of the buggy PDF font issues, check the 'text' mode to see raw document before reconstruction), use a mixture of heuristic and Machine learning models to reconstruct the document.
Once plug with a Recursive retrieval strategy, allow you to get Sota result on question answering over complexe text (see notebook: https://github.com/run-llama/llama_parse/blob/main/examples/...).
AMA
Project mention: RAGFlow is an open-source RAG engine based on deep document understanding | news.ycombinator.com | 2024-04-01Just link them to https://github.com/infiniflow/ragflow/blob/main/rag/llm/chat... :)
maybe this is better? https://github.com/clovaai/donut
I'm not sure
Project mention: [DISC] - The angel who came to pick me up is a Gal (Oneshot by Shiraishi Kouhei) | /r/manga | 2023-09-06OCR works pretty good. ocr.space, ocr.best and cotrans.touhou.ai/ are all pretty nice.
OCR related posts
- When Will the GenAI Bubble Burst?
- Ocrs: Rust library and CLI tool for extracting text from images
- 🔍Underrated Open Source Projects You Should Know About 🧠
- TextSnatcher: Copy text from images, for the Linux Desktop
- LlamaCloud and LlamaParse
- one of the Codia AI Design technologies: OCR Technology
- Easy-to-Use Apple Vision wrapper for text extraction and clustering
-
A note from our sponsor - WorkOS
workos.com | 26 Apr 2024
Index
What are some of the best open-source OCR projects? This list will help you:
Project | Stars | |
---|---|---|
1 | tesseract-ocr | 58,022 |
2 | PaddleOCR | 38,373 |
3 | Tesseract.js | 33,498 |
4 | ShareX | 27,635 |
5 | EasyOCR | 21,882 |
6 | paperless-ngx | 16,754 |
7 | siyuan | 15,733 |
8 | OCRmyPDF | 11,936 |
9 | LaTeX-OCR | 10,711 |
10 | Bob | 8,161 |
11 | ailab | 7,627 |
12 | unstructured | 6,193 |
13 | tessdata | 5,869 |
14 | Easydict | 5,863 |
15 | Parsr | 5,645 |
16 | ragflow | 5,516 |
17 | pytesseract | 5,513 |
18 | donut | 5,264 |
19 | video-subtitle-extractor | 4,814 |
20 | SwiftOCR | 4,579 |
21 | layout-parser | 4,438 |
22 | TNN | 4,281 |
23 | manga-image-translator | 4,169 |
Sponsored