Python optical-character-recognition

Open-source Python projects categorized as optical-character-recognition

Top 15 Python optical-character-recognition Projects

  • EasyOCR

    Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

  • Project mention: Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide | dev.to | 2023-12-27

    PyTesseract Module [ Github ] EasyOCR Module [ Github ] PaddlePaddle OCR [ Github ]

  • paperless-ngx

    A community-supported supercharged version of paperless: scan, index and archive all your physical documents

  • Project mention: I accidentally built a meme search engine | news.ycombinator.com | 2024-04-13

    I steered a friend towards Paperless (and away from an LLM solution) as a way of searching/accessing GBs of architectural PDFs recently - so far, it’s apparently working well for them.

    https://github.com/paperless-ngx/paperless-ngx

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • doctr

    docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

  • Project mention: Show HN: How do you OCR on a Mac using the CLI or just Python for free | news.ycombinator.com | 2024-01-02

    https://github.com/mindee/doctr/issues/1049

    I am looking for something this polished and reliable for handwriting, does anyone have any pointers? I want to integrate it in a workflow with my eink tablet I take notes on. A few years ago, I tried various models, but they performed poorly (around 80% accuracy) on my handwriting, which I can read almost 90% of the time.

  • tesserocr

    A Python wrapper for the tesseract-ocr API

  • J.A.R.V.I.S

    Personal Assistant built using python libraries. It does almost anything which includes sending emails, Optical Text Recognition, Dynamic News Reporting at any time with API integration, Todo list generator, Opens any website with just a voice command, Plays Music, Wikipedia searching, Dictionary with Intelligent Sensing i.e. auto spell checking, Weather Reporting i.e. temp, wind speed, humidity, YouTube searching, Google Map searching, Youtube Downloading, etc.

  • Project mention: πŸ”₯ 600+ 🌟 and 140+ Forks to J.A.R.V.I.S πŸš€, Added Dynamic Face Recognition to J.A.R.V.I.S πŸ€– | dev.to | 2023-05-14

    [GitHub Code](https://github.com/GauravSingh9356/J.A.R.V.I.S

  • kraken

    OCR engine for all the languages (by mittagessen)

  • parseq

    Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)

  • Project mention: need help for license plate number segmentation | /r/deeplearning | 2023-05-31

    I really recommend the usage of scene text recognition models. They are perfect for these type of usecases: https://github.com/baudm/parseq or check https://paperswithcode.com/task/scene-text-recognition make sure to check the licenses and good luck πŸ‘πŸ»

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • signature_extractor

    A super lightweight image processing algorithm for detection and extraction of overlapped handwritten signatures on scanned documents using OpenCV and scikit-image.

  • edenai-apis

    Eden AI: simplify the use and deployment of AI technologies by providing a unique API that connects to the best possible AI engines

  • Project mention: We're Building an Open-Source LLM/AI API Wrapper: Here's Why | news.ycombinator.com | 2023-08-28

    HackerNoon featured our latest article in the "Future of AI" category

    We explain how Eden AI contributes to the AI ecosystem in structuring AI and LLM APIs by creating the most accomplished Open-Source wrapper possible.

    You can support us in reaching 1000 stars on Github here: https://github.com/edenai/edenai-apis

  • OS-Bot-COLOR

    A lightweight desktop client & toolkit for writing, controlling and monitoring color-based automation scripts.

  • handprint

    Apply different text recognition services to images of handwritten documents.

  • Orchestra

    Orchestra is a sheet music reader (optical music recognition (OMR) system) that converts sheet music to a machine-readable version.

  • image-to-sound-python-

    A python project for converting an Image into audible sound using OCR and speech synthesis

  • Typewriter-OCR-TintypeText

    This typewriter OCR code can convert JPEG typewritten text images into RTF documents, while removing typos for you!

  • Braille-OCR-e-Braille-Tales

    This braille OCR code can convert JPEG braille text images into RTF documents, while removing typos for you!

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python optical-character-recognition related posts

  • OCR at Edge on Cloudflare Constellation

    3 projects | news.ycombinator.com | 3 Jul 2023
  • Tesserocr

    1 project | /r/pycharm | 25 Jan 2023
  • New Eco-Friendly Indigo Typewriter Ink (Recipe Included!)

    1 project | /r/typewriters | 30 Dec 2022
  • Digitalizing typewritten text

    1 project | /r/typewriters | 5 Dec 2022
  • Python Testing 1

    1 project | /r/Testing_MR_Bot | 9 Nov 2022
  • How to make Brilliant Blue FCF (blue food dye)-glycerine erasable typewriter ink

    1 project | /r/typewriters | 6 May 2022
  • Make Your Own Gamebook

    2 projects | /r/gamebooks | 8 Apr 2022
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 4 May 2024
    Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more β†’

Index

What are some of the best open-source optical-character-recognition projects in Python? This list will help you:

Project Stars
1 EasyOCR 21,953
2 paperless-ngx 16,882
3 doctr 3,075
4 tesserocr 1,930
5 J.A.R.V.I.S 786
6 kraken 643
7 parseq 500
8 signature_extractor 426
9 edenai-apis 360
10 OS-Bot-COLOR 229
11 handprint 157
12 Orchestra 96
13 image-to-sound-python- 55
14 Typewriter-OCR-TintypeText 10
15 Braille-OCR-e-Braille-Tales 2

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com