oemer VS Multi-Type-TD-TSR

Compare oemer vs Multi-Type-TD-TSR and see what are their differences.

oemer Multi-Type-TD-TSR
4 4
119 174
- -
5.8 2.3
2 months ago 3 months ago
Jupyter Notebook Jupyter Notebook
MIT License MIT License
  [D] Getting super-level table extraction
    3 projects | reddit.com/r/MachineLearning | 23 Aug 2022
    Recently, I've been researching extracting tables from image documents. First I tried with pdfs, however, the data extraction libraries like camelot are inconsistent. I found a deep learning model called CascadeTabNet. The detection results are okay but cell recognition is poor. I even found Multi-Type-TD-TSR for table extraction. It uses image processing techniques to find the grids. It performs well on structured and bordered tables. However, it messes up if the cell is not properly aligned. Even if extraction is successful, aggregation of multi-line cells, i.e post-processing, is not very obvious.

