Tesseract.js – Pure JavaScript OCR for 100 Languages

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Our great sponsors
  • SurveyJS - Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • Tesseract.js

    Pure Javascript OCR for more than 100 Languages 📖🎉🖥

  • The total size of the download seems to be 3-4MB (based on https://github.com/naptha/tesseract.js/blob/master/docs/loca...), which is less than I expected.

  • scene_text

    Discontinued Finding text in photos made simple

  • Being disappointed by classic open source OCR I started an attempt to package neural net based based approaches (https://github.com/gtsoukas/scene_text, don't use it, it is crap), then I found out that Googles' ML Kit (https://developers.google.com/ml-kit/vision/text-recognition) gives quite good results, as long as it is for latin based character sets.

  • SurveyJS

    Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App. With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

    SurveyJS logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts