Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more β
Top 23 optical-character-recognition Open-Source Projects
-
EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
-
paperless-ngx
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
-
J.A.R.V.I.S
Personal Assistant built using python libraries. It does almost anything which includes sending emails, Optical Text Recognition, Dynamic News Reporting at any time with API integration, Todo list generator, Opens any website with just a voice command, Plays Music, Wikipedia searching, Dictionary with Intelligent Sensing i.e. auto spell checking, Weather Reporting i.e. temp, wind speed, humidity, YouTube searching, Google Map searching, Youtube Downloading, etc.
-
Tesseract4Android
Fork of tess-two rewritten from scratch to support latest version of Tesseract OCR.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
signature_extractor
A super lightweight image processing algorithm for detection and extraction of overlapped handwritten signatures on scanned documents using OpenCV and scikit-image.
-
edenai-apis
Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines
-
OS-Bot-COLOR
A lightweight desktop client & toolkit for writing, controlling and monitoring color-based automation scripts.
-
Orchestra
Orchestra is a sheet music reader (optical music recognition (OMR) system) that converts sheet music to a machine-readable version.
-
formkiq-core
A full-featured Document Layer for your application, providing the functionality of a flexible document management system, including storage, discovery, processing, and retrieval. Deploys directly into your Amazon Web Services Cloud. π Star to support our work!
-
DocumentLab
OCR using tesseract, ImageMagick, EmguCV, an advanced query language and a fluent query interface for C#
-
image-to-sound-python-
A python project for converting an Image into audible sound using OCR and speech synthesis
-
Typewriter-OCR-TintypeText
This typewriter OCR code can convert JPEG typewritten text images into RTF documents, while removing typos for you!
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Project mention: Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide | dev.to | 2023-12-27PyTesseract Module [ Github ] EasyOCR Module [ Github ] PaddlePaddle OCR [ Github ]
I steered a friend towards Paperless (and away from an LLM solution) as a way of searching/accessing GBs of architectural PDFs recently - so far, itβs apparently working well for them.
https://github.com/paperless-ngx/paperless-ngx
Project mention: Show HN: How do you OCR on a Mac using the CLI or just Python for free | news.ycombinator.com | 2024-01-02https://github.com/mindee/doctr/issues/1049
I am looking for something this polished and reliable for handwriting, does anyone have any pointers? I want to integrate it in a workflow with my eink tablet I take notes on. A few years ago, I tried various models, but they performed poorly (around 80% accuracy) on my handwriting, which I can read almost 90% of the time.
Project mention: π₯ 600+ π and 140+ Forks to J.A.R.V.I.S π, Added Dynamic Face Recognition to J.A.R.V.I.S π€ | dev.to | 2023-05-14[GitHub Code](https://github.com/GauravSingh9356/J.A.R.V.I.S
I really recommend the usage of scene text recognition models. They are perfect for these type of usecases: https://github.com/baudm/parseq or check https://paperswithcode.com/task/scene-text-recognition make sure to check the licenses and good luck ππ»
It's active field of research. e.g. here https://github.com/amaljoseph/Signature-Verification_System_using_YOLOv5-and-CycleGAN or here https://github.com/ahmetozlu/signature_extractor
Project mention: We're Building an Open-Source LLM/AI API Wrapper: Here's Why | news.ycombinator.com | 2023-08-28HackerNoon featured our latest article in the "Future of AI" category
We explain how Eden AI contributes to the AI ecosystem in structuring AI and LLM APIs by creating the most accomplished Open-Source wrapper possible.
You can support us in reaching 1000 stars on Github here: https://github.com/edenai/edenai-apis
Project mention: A Clutter-Free Life: Going Paperless with Paperless-Ngx | news.ycombinator.com | 2023-10-07We may want to get in touch with each other. We have an Open Core document management platform that runs in AWS; I'm not sure about your roadmap, but there may be something there that's of use: https://github.com/formkiq/formkiq-core
Project mention: I want to make a small scripting language for my graduation project. | /r/compsci | 2023-06-26I've done a few languages in my time, here's a simple one that translates C like syntax into selenium operations: https://github.com/karisigurd4/SeleniumScript And a more novel query language for ocr'd document data: https://github.com/karisigurd4/DocumentLab
optical-character-recognition related posts
-
OCR at Edge on Cloudflare Constellation
-
Tesserocr
-
New Eco-Friendly Indigo Typewriter Ink (Recipe Included!)
-
Digitalizing typewritten text
-
Python Testing 1
-
How to make Brilliant Blue FCF (blue food dye)-glycerine erasable typewriter ink
-
Make Your Own Gamebook
-
A note from our sponsor - InfluxDB
www.influxdata.com | 3 May 2024
Index
What are some of the best open-source optical-character-recognition projects? This list will help you:
Project | Stars | |
---|---|---|
1 | EasyOCR | 21,953 |
2 | paperless-ngx | 16,882 |
3 | SwiftOCR | 4,579 |
4 | doctr | 3,038 |
5 | tesserocr | 1,930 |
6 | J.A.R.V.I.S | 786 |
7 | Tesseract4Android | 651 |
8 | kraken | 643 |
9 | react-native-tesseract-ocr | 547 |
10 | parseq | 500 |
11 | signature_extractor | 426 |
12 | edenai-apis | 360 |
13 | OS-Bot-COLOR | 229 |
14 | ssocr | 193 |
15 | handprint | 157 |
16 | Orchestra | 96 |
17 | formkiq-core | 91 |
18 | Easter2 | 73 |
19 | DocumentLab | 69 |
20 | image-to-sound-python- | 55 |
21 | EasyOCR-cpp | 27 |
22 | OCR-PDF-Action | 11 |
23 | Typewriter-OCR-TintypeText | 10 |
Sponsored