Need to translate a 200 page book

This page summarizes the projects mentioned and recommended in the original post on /r/languagelearning

Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
  • tesseract-ocr

    Tesseract Open Source OCR Engine (main repository)

  • After that you would use Tesseract-OCR to OCR the pages. Tesseract is a open source multiplatform OCR software. If the typeface is something non standard you would have to train the recognition engine on your data.

  • bergamot-translator

    Cross platform C++ library focusing on optimized machine translation on the consumer-grade device.

  • Then you can run the OCR text through either a google, deepl, or one of the other commercial services to translate as a first pass. They all sell API access to the engines for bulk translation. Or you can use an open source engine like the new Bergamot Engine

  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts