DeepDoctection

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Judoscale - Save 47% on cloud hosting with autoscaling that just works
Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.
judoscale.com
featured
InfluxDB high-performance time series database
Collect, organize, and act on massive volumes of high-resolution data to power real-time intelligent systems.
influxdata.com
featured
  1. deepdoctection

    A Repo For Document AI

  2. Judoscale

    Save 47% on cloud hosting with autoscaling that just works. Judoscale integrates with Django, FastAPI, Celery, and RQ to make autoscaling easy and reliable. Save big, and say goodbye to request timeouts and backed-up task queues.

    Judoscale logo
  3. doctr

    docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

    Last I checked I saw a grocery bill example using https://github.com/mindee/doctr and was fairly accurate. Bear in mind that was last year, hopefully it got even better or there are other libraries

  4. table-transformer

    Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.

  5. llama

    Inference code for Llama models

    I think local models SOTA is llama which has 2048 context[1].

    [1] https://github.com/facebookresearch/llama/issues/16

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • 📝✨ClearText

    3 projects | dev.to | 15 Jan 2025
  • OCR a lot of hand written invoice and records?

    1 project | /r/selfhosted | 7 Dec 2023
  • [P] EasyOCR in C++!

    2 projects | /r/MachineLearning | 2 Dec 2023
  • Show HN: BetterOCR combines and corrects multiple OCR engines with an LLM

    8 projects | news.ycombinator.com | 28 Oct 2023
  • OCR at Edge on Cloudflare Constellation

    3 projects | news.ycombinator.com | 3 Jul 2023

Did you know that Python is
the 2nd most popular programming language
based on number of references?