Multi-Type-TD-TSR vs CascadeTabNet

Multi-Type-TD-TSR

Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition: (by Psarpei)

Source Code

Suggest alternative

Edit details

This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents" (by DevashishPrasad)

table-recognition table-structure-recognition table-detection table-detection-using-deep-learning

Source Code

Suggest alternative

Edit details

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

Multi-Type-TD-TSR		CascadeTabNet
	Project
4	Mentions	1
236	Stars	1,397
-	Growth	-
0.0	Activity	0.0
over 1 year ago	Latest Commit	over 2 years ago
Jupyter Notebook	Language	Python
MIT License	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

Multi-Type-TD-TSR

Posts with mentions or reviews of Multi-Type-TD-TSR. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-08-23.

[D] Getting super-level table extraction
3 projects | /r/MachineLearning | 23 Aug 2022

Recently, I've been researching extracting tables from image documents. First I tried with pdfs, however, the data extraction libraries like camelot are inconsistent. I found a deep learning model called CascadeTabNet. The detection results are okay but cell recognition is poor. I even found Multi-Type-TD-TSR for table extraction. It uses image processing techniques to find the grids. It performs well on structured and bordered tables. However, it messes up if the cell is not properly aligned. Even if extraction is successful, aggregation of multi-line cells, i.e post-processing, is not very obvious.
Multi-Type-TD-TSR - Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition (State of the art approach for table structure recognition published on KI2021 - 44th German Conference on Artificial Intelligence)
1 project | /r/coolgithubprojects | 18 Dec 2021
Multi-Type-TD-TSR - Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition: from OCR to Structured Table Representations
1 project | /r/github | 31 May 2021

Check it out on my Github: https://github.com/Psarpei/Multi-Type-TD-TSR
Multi-Type-TD-TSR - Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition: from OCR to Structured Table Representations (New state-of-the-art approach for table structure recognition)
1 project | /r/coolgithubprojects | 31 May 2021

CascadeTabNet

Posts with mentions or reviews of CascadeTabNet. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-08-23.

[D] Getting super-level table extraction
3 projects | /r/MachineLearning | 23 Aug 2022

Recently, I've been researching extracting tables from image documents. First I tried with pdfs, however, the data extraction libraries like camelot are inconsistent. I found a deep learning model called CascadeTabNet. The detection results are okay but cell recognition is poor. I even found Multi-Type-TD-TSR for table extraction. It uses image processing techniques to find the grids. It performs well on structured and bordered tables. However, it messes up if the cell is not properly aligned. Even if extraction is successful, aggregation of multi-line cells, i.e post-processing, is not very obvious.

What are some alternatives?

When comparing Multi-Type-TD-TSR and CascadeTabNet you can also consider the following projects:

donut - Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

deepdoctection - A Repo For Document AI

MetalTranslate - Customizable machine translation in C++

table-transformer - Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.

Recognition-of-logical-document-structures - First approach for recognizing logical document structures like texts, sentences, segments, words, chars and sentence/segment depth based on recurrent neural network grammars.

oemer - End-to-end Optical Music Recognition (OMR) system. Transcribe phone-taken music sheet image into MusicXML, which can be edited and converted to MIDI.

Real-time-Object-Detection-for-Autonomous-Driving-using-Deep-Learning - My Computer Vision project from my Computer Vision Course (Fall 2020) at Goethe University Frankfurt, Germany. Performance comparison between state-of-the-art Object Detection algorithms YOLO and Faster R-CNN based on the Berkeley DeepDrive (BDD100K) Dataset.

ITC - Computer Science coursework and projects at Tec de Monterrey 👨‍🎓

elastic_transformers - Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers

deathcounter_ocr - A python script which detects death messages by using OCR and displays a corrosponding deathcounter. Preconfigured for Elden Ring

docutron - Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.

Multi-Type-TD-TSR vs donut CascadeTabNet vs deepdoctection Multi-Type-TD-TSR vs MetalTranslate CascadeTabNet vs table-transformer Multi-Type-TD-TSR vs Recognition-of-logical-document-structures CascadeTabNet vs donut Multi-Type-TD-TSR vs oemer Multi-Type-TD-TSR vs Real-time-Object-Detection-for-Autonomous-Driving-using-Deep-Learning Multi-Type-TD-TSR vs ITC Multi-Type-TD-TSR vs elastic_transformers Multi-Type-TD-TSR vs deathcounter_ocr Multi-Type-TD-TSR vs docutron

Compare Multi-Type-TD-TSR vs CascadeTabNet and see what are their differences.

Multi-Type-TD-TSR

CascadeTabNet

Multi-Type-TD-TSR

CascadeTabNet

What are some alternatives?