-
PyPDF2
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Are they scanned or computer-prepared? If they're computer-prepared, you could use PyPDF2, and compare the contents of each page to all the others. Open the file, open each page and compare it to all the pages that follow it using things like extractText() and getContents().
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.