Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 23 Tesseract Open-Source Projects
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
siyuan
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
-
J.A.R.V.I.S
Personal Assistant built using python libraries. It does almost anything which includes sending emails, Optical Text Recognition, Dynamic News Reporting at any time with API integration, Todo list generator, Opens any website with just a voice command, Plays Music, Wikipedia searching, Dictionary with Intelligent Sensing i.e. auto spell checking, Weather Reporting i.e. temp, wind speed, humidity, YouTube searching, Google Map searching, Youtube Downloading, etc.
-
Tesseract4Android
Fork of tess-two rewritten from scratch to support latest version of Tesseract OCR.
-
BetterOCR
🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
We are going to be using an OCR (Optical Character Recognition) engine called Tesseract for the image-to-text recognition part. It is free software, released under the Apache License. Install the engine for your desired OS from their official website. I'm using Windows for this. Add the installation path to your environment variables.
Project mention: I am out of the loop. Is Next.js "the future" and something I should consider adding to my knowledge pool? | /r/webdev | 2023-07-05What do you have against tesseract.js?
Try SiYuan Note. It's free and open source local-first mix of Notion and Obsidian.
https://github.com/siyuan-note/siyuan
Project mention: TextSnatcher: Copy text from images, for the Linux Desktop | news.ycombinator.com | 2024-03-14Try https://github.com/ocrmypdf/OCRmyPDF - it uses Tesseract behind the scenes and it absolutely brilliant.
Project mention: Honour mode tip: Use a macro tool to save build/Tav presets | /r/BaldursGate3 | 2023-12-10I came across a post where a player was asking Larian for a way to save character presets for Honour Mode since we may want to run the same character over and over. While we await that feature, I'll share what I've been doing to "save" my presets: I use a macro (you can choose any macro software you prefer, i use this one) to record my mouse and keyboard movements and clicks while creating my Tav.
Project mention: 🔥 600+ 🌟 and 140+ Forks to J.A.R.V.I.S 🚀, Added Dynamic Face Recognition to J.A.R.V.I.S 🤖 | dev.to | 2023-05-14[GitHub Code](https://github.com/GauravSingh9356/J.A.R.V.I.S
you can train tesseract models as described here, but you would need to create a dataset first: https://github.com/tesseract-ocr/tesstrain
Project mention: Svelte Native: The Svelte Mobile Development Experience | news.ycombinator.com | 2024-01-29It's being used here: https://github.com/Akylas/OSS-DocumentScanner
Tesseract related posts
-
Highlighting Image Text
-
one of the Codia AI Design technologies: OCR Technology
-
DpScreenOCR – cross-platform OCR tool
-
OCR text to speech for disability
-
Honour mode tip: Use a macro tool to save build/Tav presets
-
Marker: Convert PDF to Markdown quickly with high accuracy
-
How to Read Text From an Image with Python
-
A note from our sponsor - InfluxDB
www.influxdata.com | 6 May 2024
Index
What are some of the best open-source Tesseract projects? This list will help you:
Project | Stars | |
---|---|---|
1 | tesseract-ocr | 58,182 |
2 | Tesseract.js | 33,577 |
3 | siyuan | 16,019 |
4 | OCRmyPDF | 12,067 |
5 | tessdata | 5,911 |
6 | TagUI | 5,351 |
7 | RPA-Python | 4,555 |
8 | PyMuPDF | 4,103 |
9 | tesseract-ocr-for-php | 2,788 |
10 | gosseract | 2,497 |
11 | tesserocr | 1,936 |
12 | textshot | 1,677 |
13 | PuloversMacroCreator | 1,524 |
14 | lambda-packs | 1,106 |
15 | J.A.R.V.I.S | 786 |
16 | ccextractor | 670 |
17 | Tesseract4Android | 651 |
18 | tesstrain | 569 |
19 | react-native-tesseract-ocr | 547 |
20 | OSS-DocumentScanner | 497 |
21 | tessdata_fast | 441 |
22 | BetterOCR | 389 |
23 | android-ocr | 331 |
Sponsored