ExtractTable-py
img2table
ExtractTable-py | img2table | |
---|---|---|
1 | 1 | |
241 | 387 | |
- | - | |
0.0 | 7.8 | |
almost 1 year ago | 1 day ago | |
Python | Python | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ExtractTable-py
-
Camelot VS ExtractTable-py - a user suggested alternative
2 projects | 2 Feb 2022
img2table
-
Extract Tables From Images in Python
Img2Table is a straightforward, user-friendly Python library for table extraction and identification that is based on OpenCV image processing and supports PDF files in addition to the majority of popular image file formats.
What are some alternatives?
pdfplumber - Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
caer - High-performance Vision library in Python. Scale your research, not boilerplate.
Camelot - A Python library to extract tabular data from PDFs
IkomiaApi - Deploy Computer Vision solutions with a few lines of code.
tabnet - PyTorch implementation of TabNet paper : https://arxiv.org/pdf/1908.07442.pdf
PySceneDetect - :movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
visidata - A terminal spreadsheet multitool for discovering and arranging data
fishington.io-bot - Fishington.io bot with OpenCV and NumPy
vaex - Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
PyMuPDF - PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.