layout-parser vs EasyOCR

layout-parser

A Unified Toolkit for Deep Learning Based Document Image Analysis (by Layout-Parser)

Source Code

layout-parser.github.io

Suggest alternative

Edit details

EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc. (by JaidedAI)

Computer Vision OCR Deep Learning crnn Pytorch Lstm Machine Learning scene-text scene-text-recognition optical-character-recognition Cnn Data Mining Image processing Python easyocr information-retrieval

Source Code

jaided.ai

Suggest alternative

Edit details

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

layout-parser		EasyOCR
	Project
6	Mentions	38
4,438	Stars	21,882
3.3%	Growth	3.1%
0.0	Activity	4.6
about 2 months ago	Latest Commit	27 days ago
Python	Language	Python
Apache License 2.0	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

layout-parser

Posts with mentions or reviews of layout-parser. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-01-06.

Crates for converting PDF's into Markdown
2 projects | /r/rust | 6 Jan 2023

I built my own solution using a combination of Tesseract and OpenCV (in python). But even though the source PDF content is computer generated, I still get sporadic OCR errors. After writing my solution, I came across this https://github.com/Layout-Parser/layout-parser which might be a better starting point for dealing with PDFs but I haven't tried it yet.
OCR help required
1 project | /r/Python | 18 Oct 2022

This sound more like a layout parking issue. Look at Layout Parser, it has helped me on many occasions when I was battling to extract info from PDF documents.
Amateur programmer here. Will Rust be used in backend for software in the future?
2 projects | /r/rust | 27 May 2022
Extract text from PDF
7 projects | /r/Python | 2 Nov 2021

One of the tools I'm excited about (but haven't used in production) is LayoutParser. It's open-source, and can do some document image analysis especially on non-generic docs.
Document Classification
2 projects | /r/computervision | 8 Jun 2021

One project that I saw not to long ago which might be useful is this: https://github.com/Layout-Parser/layout-parser
A Python Library for Document Layout Understanding
1 project | news.ycombinator.com | 8 Apr 2021

EasyOCR

Posts with mentions or reviews of EasyOCR. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-27.

Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide
5 projects | dev.to | 27 Dec 2023

PyTesseract Module [ Github ] EasyOCR Module [ Github ] PaddlePaddle OCR [ Github ]
OCR a lot of hand written invoice and records?
1 project | /r/selfhosted | 7 Dec 2023
[P] EasyOCR in C++!
2 projects | /r/MachineLearning | 2 Dec 2023

I just uploaded my C++ implementation of EasyOCR, a well known ocr library for python. Also dusted some cobwebbs from some audio related projects as well, feel free to leave feedback or contribute! I only implemented the most salient parts, so certainly could use some community help! Cheers!
OCR at Edge on Cloudflare Constellation
3 projects | news.ycombinator.com | 3 Jul 2023

EasyOCR is a popular project if you are in an environment where you can use run Python and PyTorch (https://github.com/JaidedAI/EasyOCR). Other open source projects of note are PaddleOCR (https://github.com/PaddlePaddle/PaddleOCR) and docTR (https://github.com/mindee/doctr).
Donut: OCR-Free Document Understanding Transformer
4 projects | news.ycombinator.com | 29 May 2023

The main one was https://github.com/JaidedAI/EasyOCR, mostly because, as promised, it was pretty easy to use, and uses pytorch (which I preferred in case I wanted to tweak it). It has been updated since, but at the time it was using CRNN, which is a solid model, especially for the time - it wasn't (academic) SOTA but not far behind that. I'm sure I could've coaxed better performance than I got out of it with some retraining and hyperparameter tuning.
Help with OCR of pixel-y numbers
1 project | /r/computervision | 4 Apr 2023

Anyways, you can give a shot to EasyOCR, pretty solid and flexible
How to perform document OCR?
1 project | /r/computervision | 14 Mar 2023
Python unexpectedly quits (macOS ventura, M1)
1 project | /r/learnpython | 7 Mar 2023

The easyocr library: https://github.com/JaidedAI/EasyOCR
I made a website for a friend who owns a restaurant. He's wondering if there's a way to upload a picture of his menu daily. What is the best way to do this?
2 projects | /r/learnprogramming | 15 Jan 2023
Raspberry Pi Easyocr
1 project | /r/RASPBERRY_PI_PROJECTS | 7 Jan 2023

Not used it on a Pi but maybe a Docker version (if there is one) would run? Compose file here

What are some alternatives?

When comparing layout-parser and EasyOCR you can also consider the following projects:

py-pdf-parser - A Python tool to help extracting information from structured PDFs.

PaddleOCR - Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

tika-python - Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.

tesseract-ocr - Tesseract Open Source OCR Engine (main repository)

BCNet - Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers [CVPR 2021]

doctr - docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

ssd_keras - A Keras port of Single Shot MultiBox Detector

OpenCV - Open Source Computer Vision Library

simpletransformers - Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI

awesome-colab-notebooks - Collection of google colaboratory notebooks for fast and easy experiments

shabby-pages - ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for use in training models to reverse distortions and recover to original denoised documents.

tesserocr - A Python wrapper for the tesseract-ocr API

layout-parser vs py-pdf-parser EasyOCR vs PaddleOCR layout-parser vs tika-python EasyOCR vs tesseract-ocr layout-parser vs BCNet EasyOCR vs doctr layout-parser vs ssd_keras EasyOCR vs OpenCV layout-parser vs simpletransformers EasyOCR vs awesome-colab-notebooks layout-parser vs shabby-pages EasyOCR vs tesserocr

Compare layout-parser vs EasyOCR and see what are their differences.

layout-parser

EasyOCR

layout-parser

EasyOCR

What are some alternatives?