[Discussion] [Project] Text extraction from PDF files

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • A better document viewer

    1 project | /r/linux4noobs | 13 Sep 2023
  • OCR for a full pdf on Neoreader

    1 project | /r/Onyx_Boox | 25 Jun 2023
  • ELI5: why is PDF such a widespread text format, instead of a format that's actually easier to edit?

    1 project | /r/explainlikeimfive | 3 Jun 2023
  • [Free-Post Friday!] Recommendations for high volume document scanners

    1 project | /r/DataHoarder | 19 May 2023
  • OCR pdf and just keep the OCR text

    1 project | /r/AskTechnology | 6 May 2023