pdf2doi VS PyPDF2

Compare pdf2doi vs PyPDF2 and see what are their differences.

pdf2doi

A python library/command-line tool to extract the DOI or other identifiers of a scientific paper from a pdf file. (by MicheleCotrufo)

PyPDF2

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files (by py-pdf)
Our great sponsors
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • WorkOS - The modern identity platform for B2B SaaS
  • SaaSHub - Software Alternatives and Reviews
pdf2doi PyPDF2
2 30
84 7,359
- 3.6%
4.4 9.5
about 2 months ago 6 days ago
Python Python
- BSD 3-Clause
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

pdf2doi

Posts with mentions or reviews of pdf2doi. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-04-22.

PyPDF2

Posts with mentions or reviews of PyPDF2. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-01-24.

What are some alternatives?

When comparing pdf2doi and PyPDF2 you can also consider the following projects:

arxiv-vanity - Renders papers from arXiv as responsive web pages so you don't have to squint at a PDF.

PDFMiner - Python PDF Parser (Not actively maintained). Check out pdfminer.six.

textract - extract text from any document. no muss. no fuss.

ReportLab

arxiv-latex-cleaner - arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

pdfplumber - Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

pdftitle - a utility to extract the title from a PDF file

WeasyPrint - The awesome document factory

pubs - Your bibliography on the command line

Camelot - A Python library to extract tabular data from PDFs

cobib - Console Bibliography

borb - borb is a library for reading, creating and manipulating PDF files in python.