layout-parser VS shabby-pages

Compare layout-parser vs shabby-pages and see what are their differences.

shabby-pages

ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for use in training models to reverse distortions and recover to original denoised documents. (by sparkfish)
Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
layout-parser shabby-pages
6 1
4,453 42
3.6% -
0.0 3.7
about 2 months ago 3 days ago
Python Jupyter Notebook
Apache License 2.0 MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

layout-parser

Posts with mentions or reviews of layout-parser. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-01-06.

shabby-pages

Posts with mentions or reviews of shabby-pages. We have used some of these posts to build our list of alternatives and similar projects.
  • [P][R] Announcing: Dataset & Denoising Shabby Pages Competition
    1 project | /r/MachineLearning | 6 Apr 2022
    Into machine learning? Want a chance to earn a new MacBook Pro? Check out the Denoising ShabbyPages competition! The ShabbyPages dataset is being produced as a way to help train, test, and calibrate computer vision machine learning algorithms designed for working with documents. Enter the competition by training a model to remove the noise, and be awarded a MacBook Pro or some swag in the process! Check out the short paper introducing the dataset, and learn more about the competition at denoising-shabby.com.

What are some alternatives?

When comparing layout-parser and shabby-pages you can also consider the following projects:

EasyOCR - Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Data-Science-Cheatsheet - A helpful 5-page machine learning cheatsheet to assist with exam reviews, interview prep, and anything in-between.

py-pdf-parser - A Python tool to help extracting information from structured PDFs.

HugsVision - HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision

tika-python - Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.

cia - 🐱‍💻 CIA Factbook data analysis and dataset reconstruction, modification, and tuning go here.

BCNet - Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers [CVPR 2021]

label-studio - Label Studio is a multi-type data labeling and annotation tool with standardized output format

ssd_keras - A Keras port of Single Shot MultiBox Detector

simpletransformers - Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI

pdf-extract - A rust library for extracting content from pdfs

GDR-Net - GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation. (CVPR 2021)