img2table
ExtractTable-py
img2table | ExtractTable-py | |
---|---|---|
1 | 1 | |
378 | 241 | |
- | - | |
7.8 | 0.0 | |
6 days ago | 12 months ago | |
Python | Python | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
img2table
-
Extract Tables From Images in Python
Img2Table is a straightforward, user-friendly Python library for table extraction and identification that is based on OpenCV image processing and supports PDF files in addition to the majority of popular image file formats.
ExtractTable-py
-
Camelot VS ExtractTable-py - a user suggested alternative
2 projects | 2 Feb 2022
What are some alternatives?
caer - High-performance Vision library in Python. Scale your research, not boilerplate.
pdfplumber - Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
IkomiaApi - Deploy Computer Vision solutions with a few lines of code.
Camelot - A Python library to extract tabular data from PDFs
PySceneDetect - :movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
tabnet - PyTorch implementation of TabNet paper : https://arxiv.org/pdf/1908.07442.pdf
fishington.io-bot - Fishington.io bot with OpenCV and NumPy
visidata - A terminal spreadsheet multitool for discovering and arranging data
vaex - Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
PyMuPDF - PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.