Python pdf-documents

Open-source Python projects categorized as pdf-documents

Top 4 Python pdf-document Projects

  • PyPDF2

    A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

  • Project mention: Yara scanning PDF files | /r/computerforensics | 2023-06-01
  • PyMuPDF

    PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

  • Project mention: FLaNK Stack for 04 December 2023 | dev.to | 2023-12-04
  • WorkOS

    The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

    WorkOS logo
  • pypdfium2

    Python bindings to PDFium

  • Project mention: Suggestions for container-based document conversion service? | /r/opensource | 2023-08-04

    I can't think of a proper API but you should check that library for parsing & dumping PDFs: https://github.com/pypdfium2-team/pypdfium2

  • pdfalyzer

    Analyze PDFs. With colors. And Yara.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Index

What are some of the best open-source pdf-document projects in Python? This list will help you:

Project Stars
1 PyPDF2 7,396
2 PyMuPDF 4,002
3 pypdfium2 264
4 pdfalyzer 220

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com