[P] Training to read PDF documents. Any ideas?

This page summarizes the projects mentioned and recommended in the original post on /r/MachineLearning

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • unilm

    Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

  • There's a few good models out there that take this a few steps further, and take care a lot of the work for you. Check out LayoutLM: https://github.com/microsoft/unilm

  • EasyOCR

    Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

  • If all you need to do is OCR, check out https://github.com/JaidedAI/EasyOCR , it's a similar architecture to the cloud services, without all the $. You'll end up with extracted text and bounding boxes for it.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • OCR_tablenet

    TableNet Implementation on Pytorch

  • If you want to extract structured stuff from PDFs, there is a piece of work you can find called TableNet: https://github.com/tomassosorio/OCR_tablenet , that may also be worth checking out.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • LlamaCloud and LlamaParse

    9 projects | news.ycombinator.com | 20 Feb 2024
  • OCR a lot of hand written invoice and records?

    1 project | /r/selfhosted | 7 Dec 2023
  • [P] EasyOCR in C++!

    2 projects | /r/MachineLearning | 2 Dec 2023
  • Unstructured – OSS libraries and APIs to build custom preprocessing pipelines

    1 project | news.ycombinator.com | 10 Jul 2023
  • More intelligent Pdf parsers

    1 project | /r/LocalLLaMA | 15 Jun 2023