OCR

Top 23 OCR Open-Source Projects

  • tesseract-ocr

    Tesseract Open Source OCR Engine (main repository)

  • Project mention: Highlighting Image Text | dev.to | 2024-04-30

    We are going to be using an OCR (Optical Character Recognition) engine called Tesseract for the image-to-text recognition part. It is free software, released under the Apache License. Install the engine for your desired OS from their official website. I'm using Windows for this. Add the installation path to your environment variables.

  • PaddleOCR

    Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

  • Project mention: Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide | dev.to | 2023-12-27

    PyTesseract Module [ Github ] EasyOCR Module [ Github ] PaddlePaddle OCR [ Github ]

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • Tesseract.js

    Pure Javascript OCR for more than 100 Languages 📖🎉🖥

  • Project mention: I am out of the loop. Is Next.js "the future" and something I should consider adding to my knowledge pool? | /r/webdev | 2023-07-05

    What do you have against tesseract.js?

  • ShareX

    ShareX is a free and open source program that lets you capture or record any area of your screen and share it with a single press of a key. It also allows uploading images, text or other types of files to many supported destinations you can choose from.

  • Project mention: From Dull to Dazzling: 3 Methods to Elevate Your Writing with Visual Content | dev.to | 2024-05-02

    For Windows: ShareX - https://github.com/ShareX/ShareX

  • EasyOCR

    Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

  • Project mention: Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide | dev.to | 2023-12-27

    PyTesseract Module [ Github ] EasyOCR Module [ Github ] PaddlePaddle OCR [ Github ]

  • paperless-ngx

    A community-supported supercharged version of paperless: scan, index and archive all your physical documents

  • Project mention: I accidentally built a meme search engine | news.ycombinator.com | 2024-04-13

    I steered a friend towards Paperless (and away from an LLM solution) as a way of searching/accessing GBs of architectural PDFs recently - so far, it’s apparently working well for them.

    https://github.com/paperless-ngx/paperless-ngx

  • siyuan

    A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.

  • Project mention: A structured note-taking app for personal use | news.ycombinator.com | 2023-12-21

    Try SiYuan Note. It's free and open source local-first mix of Notion and Obsidian.

    https://github.com/siyuan-note/siyuan

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

  • Project mention: TextSnatcher: Copy text from images, for the Linux Desktop | news.ycombinator.com | 2024-03-14

    Try https://github.com/ocrmypdf/OCRmyPDF - it uses Tesseract behind the scenes and it absolutely brilliant.

  • LaTeX-OCR

    pix2tex: Using a ViT to convert images of equations into LaTeX code.

  • Project mention: Detexify LaTeX Handwriting Symbol Recognition | news.ycombinator.com | 2023-11-14
  • Bob

    Bob 是一款 macOS 平台的翻译和 OCR 软件。

  • ailab

    Experience, Learn and Code the latest breakthrough innovations with Microsoft AI

  • Project mention: AI-Powered Developer Tools | news.ycombinator.com | 2023-08-06

    Sorry about that! I should have checked before sharing that link.

    It looks like Microsoft published the code on GitHub, so you might be able to deploy it via Azure. (I haven't tried it.)

    https://github.com/Microsoft/ailab/blob/master/Sketch2Code/R...

  • unstructured

    Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

  • Project mention: LlamaCloud and LlamaParse | news.ycombinator.com | 2024-02-20

    Be careful with unstructured:

    https://github.com/Unstructured-IO/unstructured/blob/d11c70c...

    from: https://github.com/open-webui/open-webui/issues/687

  • ragflow

    RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

  • Project mention: RAGFlow is an open-source RAG engine based on deep document understanding | news.ycombinator.com | 2024-04-01

    Just link them to https://github.com/infiniflow/ragflow/blob/main/rag/llm/chat... :)

  • Easydict

    一个简洁优雅的词典翻译 macOS App。开箱即用,支持离线 OCR 识别,支持有道词典,🍎 苹果系统词典,🍎 苹果系统翻译,OpenAI,Gemini,DeepL,Google,Bing,腾讯,百度,阿里,小牛,彩云和火山翻译。A concise and elegant Dictionary and Translator macOS App for looking up words and translating text.

  • Project mention: What is the best dictionary app for macOS? | /r/macapps | 2023-06-05

    I personally macOS's dictionary app and Easydict as the translator.

  • tessdata

    Trained models with fast variant of the "best" LSTM models + legacy models

  • Parsr

    Transforms PDF, Documents and Images into Enriched Structured Data

  • Project mention: LlamaCloud and LlamaParse | news.ycombinator.com | 2024-02-20

    I'm part of the team that build LlamaParse. It's net improvement compare to other PDF->Structured Text extractors (I build several in the past, includig https://github.com/axa-group/Parsr).

    For character extraction, LlamaParse use a mixture of OCR / character extraction from the PDF (it's the only parser I'm aware of that address some of the buggy PDF font issues, check the 'text' mode to see raw document before reconstruction), use a mixture of heuristic and Machine learning models to reconstruct the document.

    Once plug with a Recursive retrieval strategy, allow you to get Sota result on question answering over complexe text (see notebook: https://github.com/run-llama/llama_parse/blob/main/examples/...).

    AMA

  • pytesseract

    A Python wrapper for Google Tesseract

  • donut

    Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

  • Project mention: Ask HN: Why are all OCR outputs so raw? | news.ycombinator.com | 2023-11-15

    maybe this is better? https://github.com/clovaai/donut

    I'm not sure

  • video-subtitle-extractor

    视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

  • SwiftOCR

    Fast and simple OCR library written in Swift

  • layout-parser

    A Unified Toolkit for Deep Learning Based Document Image Analysis

  • TNN

    TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts

  • manga-image-translator

    Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/

  • Project mention: [DISC] - The angel who came to pick me up is a Gal (Oneshot by Shiraishi Kouhei) | /r/manga | 2023-09-06

    OCR works pretty good. ocr.space, ocr.best and cotrans.touhou.ai/ are all pretty nice.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

OCR related posts

  • Highlighting Image Text

    1 project | dev.to | 30 Apr 2024
  • When Will the GenAI Bubble Burst?

    1 project | news.ycombinator.com | 4 Apr 2024
  • Ocrs: Rust library and CLI tool for extracting text from images

    1 project | news.ycombinator.com | 31 Mar 2024
  • 🔍Underrated Open Source Projects You Should Know About 🧠

    9 projects | dev.to | 20 Mar 2024
  • TextSnatcher: Copy text from images, for the Linux Desktop

    7 projects | news.ycombinator.com | 14 Mar 2024
  • LlamaCloud and LlamaParse

    9 projects | news.ycombinator.com | 20 Feb 2024
  • one of the Codia AI Design technologies: OCR Technology

    1 project | dev.to | 14 Feb 2024
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 7 May 2024
    Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source OCR projects? This list will help you:

Project Stars
1 tesseract-ocr 58,182
2 PaddleOCR 38,704
3 Tesseract.js 33,577
4 ShareX 27,749
5 EasyOCR 22,049
6 paperless-ngx 16,882
7 siyuan 16,019
8 OCRmyPDF 12,067
9 LaTeX-OCR 10,860
10 Bob 8,161
11 ailab 7,635
12 unstructured 6,515
13 ragflow 6,507
14 Easydict 6,009
15 tessdata 5,911
16 Parsr 5,656
17 pytesseract 5,528
18 donut 5,312
19 video-subtitle-extractor 4,889
20 SwiftOCR 4,579
21 layout-parser 4,476
22 TNN 4,289
23 manga-image-translator 4,239

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com