Our great sponsors
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
This script uses the tesseract-ocr engine and some pip libraries. I've made it to be as user-friendly as I could and (theoretically) could translate from and to any language. It works with any PDF file, whether it is generated with any word proccessing software (MS Word, libreoffice writer...) or from a scanned document.
PDFtoTXT.py
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.