Frog: OCR Tool for Linux

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Judoscale - Save 47% on cloud hosting with autoscaling that just works
Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
judoscale.com
featured
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai
featured
  1. normcap

    OCR powered screen-capture tool to capture information instead of images

  2. Judoscale

    Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.

    Judoscale logo
  3. tessdata

    Trained models with fast variant of the "best" LSTM models + legacy models

    Appears to be a nice wrapper around Tesseract:

    https://github.com/tesseract-ocr/tessdata

    https://en.wikipedia.org/wiki/Tesseract_(software)

    The demo of course works perfectly on a Mac as this is already built into Ventura.

      In November 2020, Brewster Kahle from the Internet Archive praised Tesseract saying:

  4. doctr

    docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

    There's also DocTR which can do text detection and extraction out of the box.

    It's command line driven but can display the detected text as an overlay of the document.

    https://github.com/mindee/doctr

  5. OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

  6. PaddleOCR

    Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

    I’ve had good results from paddle ocr.

    https://github.com/PaddlePaddle/PaddleOCR

  7. flameshot

    Powerful yet simple to use screenshot software :desktop_computer: :camera_flash:

    Cool! I've seen similar ideas before and made my own inspired by these some years ago. It's a simple bash script based on [flameshot](https://flameshot.org/) for taking the screenshot and Tesseract:

        #!/usr/bin/env bash

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Llama-OCR: An Open-Source Llama 3.2 Based OCR Tool

    6 projects | news.ycombinator.com | 15 Nov 2024
  • A return to hand-written notes by learning to read and write

    2 projects | news.ycombinator.com | 28 Oct 2024
  • TextSnatcher: Copy text from images, for the Linux Desktop

    7 projects | news.ycombinator.com | 14 Mar 2024
  • A better document viewer

    1 project | /r/linux4noobs | 13 Sep 2023
  • OCR for a full pdf on Neoreader

    1 project | /r/Onyx_Boox | 25 Jun 2023

Did you know that Python is
the 2nd most popular programming language
based on number of references?