VTK
tesseract-ocr
Our great sponsors
VTK | tesseract-ocr | |
---|---|---|
5 | 109 | |
2,165 | 51,279 | |
2.0% | 1.3% | |
9.9 | 7.7 | |
3 days ago | 11 days ago | |
C++ | C++ | |
GNU General Public License v3.0 or later | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
VTK
-
[Discussion] What are some old C++ open source projects you wish were still active?
You are referring to the Visualization Toolkit right? Looking at that it appears to be actively maintained stilled I think. https://github.com/Kitware/VTK
tesseract-ocr
-
Github packages/Apps that are must have for Physicists using Linux
I have recently discovered a few very helpful github packages which help me make notes while listening to lectures. These would be 1. pix2tex (allows you to scan an equation and convert it to latex) 2. pix2text (allows you to scan an equation with words in it and converts it to latex and text) 3. Tesseract (not really a physics related package, but it does allow me to copy notes from transcripts easily) 4. Mathpix an app that performs all the above mentioned operations better than the packages above, but one which ain't free.
-
Does anyone here has Statement of Purpose or Motivation letter dataset ??
I suggest manually creating a dataset using scribd.com. It offers a free trial period of 30 days, but I am uncertain whether it covers unlimited documents or not. Nevertheless, there are over one million statements of purpose (SOPs) available on the site. You could also use the Scribd downloader. Some documents may be composed of a bunch of images, so you will have to use something like Tesseract OCR.
-
Exploring OCR and text-to-speech in FFMPEG...
The ocr filter in ffmpeg is powered by the Tesseract library. As you will often find in ffmpeg, the build within ffmpeg has only a subset of the functionality of the original library - at least, for the moment. There's always the possibility of APIs being expanded in later ffmpeg releases. And it is open source of course, so there's the option of instigating those changes yourself - or using the original library in conjunction with ffmpeg if that suits your needs better.
-
Need to translate a 200 page book
After that you would use Tesseract-OCR to OCR the pages. Tesseract is a open source multiplatform OCR software. If the typeface is something non standard you would have to train the recognition engine on your data.
-
Are there any OCR and Speech-to-Text services that are privacy friendly?
Decent OCR: https://github.com/tesseract-ocr/tesseract
-
[D] Can I use ML/AI to read the back panels of electronic components?
tesseract-ocr/tesseract: Tesseract Open Source OCR Engine (main repository)
-
Help with DLLimport function
[1] https://github.com/tesseract-ocr/tesseract/blob/main/include/tesseract/capi.h
- Is there "Text Extractor" tool (from Windows Powertoys) equivalent in linux?
-
PDF processing and analysis with open-source tools
> Would love to find a cheaper (local) option vs AWS
How about tesseract (https://github.com/tesseract-ocr/tesseract)
-
✨ Best Computer Vision Projects with Source Code 🚀
🔗 https://github.com/tesseract-ocr/tesseract
What are some alternatives?
PaddleOCR - Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
OpenCV - Open Source Computer Vision Library
pytesseract - A Python wrapper for Google Tesseract
EasyOCR - Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
ITK - Insight Toolkit (ITK) -- Official Repository. ITK builds on a proven, spatially-oriented architecture for processing, segmentation, and registration of scientific images in two, three, or more dimensions.
SVG++ - C++ SVG library
Face Recognition - The world's simplest facial recognition api for Python and the command line
Pytorch - Tensors and Dynamic neural networks in Python with strong GPU acceleration
deep-license-plate-recognition - Automatic License Plate Recognition (ALPR) or Automatic Number Plate Recognition (ANPR) software that works with any camera.
libvips - A fast image processing library with low memory needs.
Mayan EDMS - Free Open Source Document Management System (mirror, no pull request or issues)