tessdata VS tessdata_fast

Compare tessdata vs tessdata_fast and see what are their differences.

tessdata

Trained models with fast variant of the "best" LSTM models + legacy models (by tesseract-ocr)

tessdata_fast

Fast integer versions of trained LSTM models (by tesseract-ocr)
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
tessdata tessdata_fast
10 1
5,934 442
1.8% 2.0%
2.8 4.6
2 months ago 2 months ago
Apache License 2.0 Apache License 2.0
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

tessdata

Posts with mentions or reviews of tessdata. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-11-22.

tessdata_fast

Posts with mentions or reviews of tessdata_fast. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-06-10.
  • Has anyone used a RPI4 for OCR (Tesseract)? How did you feel about execution speed?
    2 projects | /r/raspberry_pi | 10 Jun 2022
    Then you could try different engines. --oem 0 disables the shiny new neural network stuff and uses the classic Tesseract engine; --oem 1 does the opposite. Try both, see which works best in terms of performance and accuracy for your particular use-case. You'll need to have training data for the legacy engine, though. These would work, or you could try tessdata_fast, which is specifically built for speed.

What are some alternatives?

When comparing tessdata and tessdata_fast you can also consider the following projects:

tesseract-ocr-for-php - A wrapper to work with Tesseract OCR inside PHP.

Reichsanzeiger - Software and data related to "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger"

tesseract-ocr - Tesseract Open Source OCR Engine (main repository)

OCRmyPDF - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Tesseract.js - Pure Javascript OCR for more than 100 Languages 📖🎉🖥

doctr - docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

siyuan - A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.

greenshot - Greenshot for Windows - Report bugs & features go here: https://greenshot.atlassian.net or look for information on:

gosseract - Go package for OCR (Optical Character Recognition), by using Tesseract C++ library

Keycloak - Open Source Identity and Access Management For Modern Applications and Services

Paperless - Scan, index, and archive all of your paper documents