If you want to OCR your PDF, the fastest, easiest and less buggy tool out there is "pdfsandwich"

This page summarizes the projects mentioned and recommended in the original post on /r/linux

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

  • I would recommend ocrmypdf over pdfsandwich, as latter's development seems to have stopped since 2018.

  • normcap

    OCR powered screen-capture tool to capture information instead of images

  • I've had some success with normcap, but I'm not a heavy user.

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • TextSnatcher

    How to Copy Text from Images ? Answer is TextSnatcher !. Perform OCR operations in seconds on Linux Desktop.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts