The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 3 Python pdf-parsing Projects
-
PyPDF2
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
-
pdfplumber
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
Project mention: Running OCR against PDFs and images directly in the browser | news.ycombinator.com | 2024-03-30 -
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
NOTE:
The open source projects on this list are ordered by number of github stars.
The number of mentions indicates repo mentiontions in the last 12 Months or
since we started tracking (Dec 2020).
The latest post mention was on 2024-03-30.
Python pdf-parsing related posts
- Running OCR against PDFs and images directly in the browser
- Parsing dates with PDFminer
- How to Extract Data from Tables in a Public Record PDF
- Code to extract text from pdf to excel
- I need to parse unstructured tables from a pdf into a json, what can I do
- Advanced PDF to Excel with documents and example code
- how do I automate extracting data from two pdfs and input into an excel sheet according to an order number
-
A note from our sponsor - WorkOS
workos.com | 18 Apr 2024
Index
What are some of the best open-source pdf-parsing projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | PyPDF2 | 7,359 |
2 | pdfplumber | 5,468 |
3 | py-pdf-parser | 332 |
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com