Reading PDF with Python

This page summarizes the projects mentioned and recommended in the original post on /r/learnpython

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • pdfplumber

    Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

  • pdfplumber ... I don't personally do a lot of image processing with pdfs but I have a few scripts set up to auto extract sections of research articles for translation purposes. In my case, it works a charm. The documentation is a bit hard to follow sometimes but the GitHub page has some good examples.

  • OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

  • ocrmypdf https://github.com/ocrmypdf/OCRmyPDF

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Code to extract text from pdf to excel

    2 projects | /r/Python | 2 Jun 2023
  • Advanced PDF to Excel with documents and example code

    2 projects | /r/learnpython | 1 May 2023
  • how do I automate extracting data from two pdfs and input into an excel sheet according to an order number

    2 projects | /r/learnpython | 24 Apr 2023
  • When Will the GenAI Bubble Burst?

    1 project | news.ycombinator.com | 4 Apr 2024
  • A better document viewer

    1 project | /r/linux4noobs | 13 Sep 2023