pdftabextract VS google_drive_ocr

Compare pdftabextract vs google_drive_ocr and see what are their differences.

pdftabextract

A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents. (by WZBSocialScienceCenter)
Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
pdftabextract google_drive_ocr
- 2
2,152 31
1.1% -
0.0 0.0
almost 2 years ago almost 2 years ago
Python Python
Apache License 2.0 GNU General Public License v3.0 or later
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

pdftabextract

Posts with mentions or reviews of pdftabextract. We have used some of these posts to build our list of alternatives and similar projects.

We haven't tracked posts mentioning pdftabextract yet.
Tracking mentions began in Dec 2020.

google_drive_ocr

Posts with mentions or reviews of google_drive_ocr. We have used some of these posts to build our list of alternatives and similar projects.

What are some alternatives?

When comparing pdftabextract and google_drive_ocr you can also consider the following projects:

PDFMiner - Python PDF Parser (Not actively maintained). Check out pdfminer.six.

normcap - OCR powered screen-capture tool to capture information instead of images

Camelot - A Python library to extract tabular data from PDFs

OCRmyPDF - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched [Moved to: https://github.com/ocrmypdf/OCRmyPDF]

PyPDF2 - A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

OCRmyPDF - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

WeasyPrint - The awesome document factory

textshot - Python tool for grabbing text via screenshot

ReportLab

Zdrive - Seamless download/upload contents via Google Drive 📂

pymorphy2 - Morphological analyzer / inflection engine for Russian and Ukrainian languages.

borb - borb is a library for reading, creating and manipulating PDF files in python.