yolo-doclaynet

YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis (by ppaanngggg)

Yolo-doclaynet Alternatives

Similar projects and alternatives to yolo-doclaynet

  1. PDF.js

    90 yolo-doclaynet VS PDF.js

    PDF Reader in JavaScript

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. ultralytics

    Ultralytics YOLO11 🚀

  4. tiktoken

    tiktoken is a fast BPE tokeniser for use with OpenAI's models.

  5. docling

    Get your documents ready for gen AI

  6. DocLayNet

    DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis

  7. vehicle_detection_tracker

    🚗 VehicleDetectionTracker: Real-time vehicle detection and tracking powered by YOLO. 🚙🚕 A personal Proof of Concept (POC) aimed at exploring the capabilities of real-time vehicle tracking, precision, and adaptability in computer vision projects. This is a testbed for learning and experimentation with YOLO and vehicle detection techniques.

  8. MinerU

    A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. puremd-mcp

    Unblock, scrape, and search tools for MCP clients

  11. AlcheMark

    Your files ready for Gen AI ✨🚀 AlcheMark is a lightweight PDF to Markdown, alchemical-inspired toolkit that transmutes PDF documents into structured Markdown pages—complete with rich metadata and named‐entity annotations—empowering you to uncover insights page by page.

  12. yolov3

    4 yolo-doclaynet VS yolov3

    YOLOv3 in PyTorch > ONNX > CoreML > TFLite

  13. AS-One

    Easy & Modular Computer Vision Detectors, Trackers & SAM - Run YOLOv9,v8,v7,v6,v5,R,X in under 10 lines of code.

  14. forest

    My mathematical Zettelkasten, created using forester. (by jonsterling)

  15. pdftotext

    Simple PDF text extraction

  16. yolov5

    YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

  17. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better yolo-doclaynet alternative or higher similarity.

yolo-doclaynet discussion

Log in or Post with

yolo-doclaynet reviews and mentions

Posts with mentions or reviews of yolo-doclaynet. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2025-05-13.
  • PDF to Text, a Challenging Problem
    9 projects | news.ycombinator.com | 13 May 2025
    I've worked on this in my day job: extracting _all_ relevant information from a financial services PDF for a bert based search engine.

    The only way to solve that is with a segmentation model followed by a regular OCR model. VLM aren't ready for prime time and won't be for a decade on more.

    What worked was just using doclaynet trained YOLO models: https://github.com/DS4SD/DocLayNet if you don't care about images or tables you can feed the results into tesseract (but for the love of god read the manual). Congratulations, you're done.

    Here's some pre-trained models that work OK out of the box: https://github.com/ppaanngggg/yolo-doclaynet I found that we needed to increase the resolution from ~700px to ~2100px horizontal for financial data segmentation.

    VLMs on the other hand still choke on long text and hallucinate unpredictably. Worse they can't understand nested data. If you give _any_ current model nothing harder than three nested rectangles with text under each they will not extract the text correctly. Given that nested rectangles describes every table no VLM can currently extract data from anything but the most straightforward of tables.

  • YOLOv12: The Next Evolution in Document Layout Analysis
    2 projects | dev.to | 7 Apr 2025
    The project uses my codebase yolo-doclaynet. You can find all free models on huggingface, while the largest model is available here (trained using rented GPU resources).
  • YOLO models trained on DocLayNet, support document analytic intelligency
    1 project | news.ycombinator.com | 5 Aug 2024
  • How to analyze document layout by YOLO
    3 projects | dev.to | 13 Jun 2024
    You can find my solution in yolo-doclaynet. After examining several models and datasets, I've chosen YOLO as the base model and DocLayNet as the training data. Let's delve into more details.
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 21 May 2025
    InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →

Stats

Basic yolo-doclaynet repo stats
4
106
6.8
2 months ago

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?