SaaSHub helps you find the best software and product alternatives Learn more →
Docling Alternatives
Similar projects and alternatives to docling
-
ollama
Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
qdrant
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
-
-
-
-
PaddleOCR
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
-
-
-
kreuzberg
A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.
-
-
-
-
-
-
-
-
-
-
-
docling discussion
docling reviews and mentions
-
My Local RAG article went viral. The product it promoted sold 1 copy in 6 months.
A comment thread that actually went somewhere — someone suggested docling, someone else brought up the EU AI Act
-
MarkItDown vs Docling vs Marker: PDF to Markdown for LLMs
Docling is IBM Research's MIT-licensed converter, currently at v2.92.0 (released April 29, 2026, four days before this post). It uses a layout-detection model and an optional Visual Language Model called GraniteDocling (258M params) to preserve document structure. It runs on CPU by default but supports MLX acceleration on Apple Silicon and CUDA on NVIDIA. Output is a structured DoclingDocument you can export to Markdown, JSON, or HTML.
-
Building docling-server: a one-command document API for our AI pipeline
If you have not seen docling yet, it is IBM's document processing library. PDF, DOCX, PPTX, scanned images, tables, the whole lot — out comes structured output. Very good at its job. The problem is not docling. The problem is everything around it.
- GLiNER2: Unified Schema-Based Information Extraction
-
The Curse of Context Window
OCR was the obvious option and with so many opensource libraries available, we were spoilt for choices. I wanted to use Docling as my prior experience with it has been good so far (I shall write a separate blog on those use-cases) but we were constrained by the infra.
-
How to build a knowledge graph for AI
Open source libraries: Kreuzberg, Docling, Marker
-
Docling is a Game-Changer for RAG Systems
Check out the Docling GitHub repository for documentation and examples
- Launch HN: Pulse (YC S24) – Production-grade unstructured document extraction
- Docling
-
My hands-on experience with Qdrant and Docling (and Ollama)
Docling documentation: https://docling-project.github.io/docling/
-
A note from our sponsor - SaaSHub
www.saashub.com | 8 Jun 2026
Stats
docling-project/docling is an open source project licensed under MIT License which is an OSI approved license.
The primary programming language of docling is Python.