surya

OCR, layout analysis, reading order, table recognition in 90+ languages (by VikParuchuri)

Surya Alternatives

Similar projects and alternatives to surya

  1. Medusa

    225 surya VS Medusa

    The world's most flexible commerce platform.

  2. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
  3. Alpaca-API

    The Alpaca API is a developer interface for trading operations and market data reception through the Alpaca platform.

  4. OCRmyPDF

    82 surya VS OCRmyPDF

    OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

  5. paip-lisp

    76 surya VS paip-lisp

    Lisp code for the textbook "Paradigms of Artificial Intelligence Programming"

  6. PaddleOCR

    69 surya VS PaddleOCR

    Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)

  7. beekeeper-studio

    64 surya VS beekeeper-studio

    Modern and easy to use SQL client for MySQL, Postgres, SQLite, SQL Server, and more. Linux, MacOS, and Windows.

  8. realtime

    59 surya VS realtime

    Broadcast, Presence, and Postgres Changes via WebSockets

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. unilm

    44 surya VS unilm

    Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

  11. EasyOCR

    41 surya VS EasyOCR

    Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

  12. marker

    29 surya VS marker

    Convert PDF to markdown + JSON quickly with high accuracy

  13. docling

    19 surya VS docling

    Get your documents ready for gen AI

  14. zerox

    15 surya VS zerox

    OCR & Document Extraction using vision models

  15. llmsherpa

    6 surya VS llmsherpa

    Developer APIs to Accelerate LLM Projects

  16. llama_cloud_services

    Knowledge Agents and Management in the Cloud

  17. PaddlePaddle

    9 surya VS PaddlePaddle

    PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

  18. unstract

    12 surya VS unstract

    No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents

  19. llama-hub

    5 surya VS llama-hub

    Discontinued A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain

  20. deepdoctection

    8 surya VS deepdoctection

    A Repo For Document AI

  21. Parsr

    8 surya VS Parsr

    Transforms PDF, Documents and Images into Enriched Structured Data

  22. document-ai-samples

    7 surya VS document-ai-samples

    Sample applications and demos for Document AI, the end-to-end document processing platform on Google Cloud

  23. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better surya alternative or higher similarity.

surya discussion

Log in or Post with

surya reviews and mentions

Posts with mentions or reviews of surya. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2025-02-16.
  • Ask HN: What is the best method for turning a scanned book as a PDF into text?
    13 projects | news.ycombinator.com | 16 Feb 2025
    I have tried a bunch of things. This is what worked best for me: Surya [0]. It can run fully local on your laptop. I also tried EasyOCR [1], which is also quite good. I haven't tried this myself, but I will look at Paddle [2] if the previous two don't float your boat.

    All of these are OSS, and you don't need to pay a dime to anyone.

    [0]: https://github.com/VikParuchuri/surya

    [1]: https://github.com/JaidedAI/EasyOCR

    [2]: https://github.com/PaddlePaddle/Paddle

  • Show HN: Kreuzberg – Modern async Python library for document text extraction
    8 projects | news.ycombinator.com | 15 Feb 2025
    pypdfium2 is a great choice and a solid piece of software!

    You might want to look into https://github.com/VikParuchuri/surya as an alternative to tesseract. Yes, it's associated with a commercial company, but as you long as you aren't a company with 5M in ARR or $5M in funding it's free to use.

  • Nvidia-Ingest: Multi-modal data extraction
    4 projects | news.ycombinator.com | 10 Jan 2025
    Surya is a great open source toolkit for table parsing, layout analysis and OCR: https://github.com/VikParuchuri/surya
  • Ask HN: Who is hiring? (January 2025)
    18 projects | news.ycombinator.com | 2 Jan 2025
    Datalab | NYC | Full-time | Software Engineer and Head of Business Ops | $250k-$350k + 1.5-3% equity | https://www.datalab.to

    A significant % of useful data is locked away in tough-to-parse formats like PDFs. We build tools to extract it, like https://github.com/VikParuchuri/surya (15k Github stars), and https://github.com/VikParuchuri/marker (19k stars). We also run an inference API and product.

    We do meaningful research (we’ve trained several SoTA models), ship product, and contribute to open source. We’re hiring for 2 roles to help us scale:

    Senior fullstack software engineer

    - work across our open source repos, inference api, and frontend product

  • Show HN: Lessons learned from a big OCR project
    2 projects | news.ycombinator.com | 17 Dec 2024
    I’ve used Surya (https://github.com/VikParuchuri/surya) before. It is very good (on par with Google Vision, potentially better layout analysis), but yours is a challenging use case. I wonder if it would be useful.
  • Show HN: LLM Aided OCR (Correcting Tesseract OCR Errors with LLMs)
    17 projects | news.ycombinator.com | 9 Aug 2024
    Hi, I'm the author of surya (https://github.com/VikParuchuri/surya) - working on improving speed and accuracy now. Happy to collaborate if you have specific page types it's not working on. For modern/clean documents it benchmarks very similarly to Google Cloud, but working on supporting older documents better now.
  • Decoding OCR: A Comprehensive Guide
    3 projects | dev.to | 7 Aug 2024
    For a deeper dive into Surya-OCR, an advanced OCR system, enthusiasts and developers can explore its extensive components on GitHub. This open-source project is readily accessible for those eager to understand its mechanics or contribute to its evolution. Visit Surya-OCR on GitHub to explore the documentation, source code, and more.
  • From GPT-4 to AGI: Counting the OOMs
    2 projects | news.ycombinator.com | 9 Jul 2024
  • Ask HN: How to OCR a PDF and preserve whitespace?
    2 projects | news.ycombinator.com | 8 Jun 2024
  • A Picture Is Worth 170 Tokens: How Does GPT-4o Encode Images?
    5 projects | news.ycombinator.com | 7 Jun 2024
    checkout https://github.com/mindee/doctr or https://github.com/VikParuchuri/surya for something practical

    multimodal llm would of course blow it all out the water, so some llama3-like model is probably SOTA in terms of what you can run yourself. something like https://huggingface.co/blog/idefics2

  • A note from our sponsor - SaaSHub
    www.saashub.com | 27 Mar 2025
    SaaSHub helps you find the best software and product alternatives Learn more →

Stats

Basic surya repo stats
16
16,920
9.7
7 days ago

VikParuchuri/surya is an open source project licensed under GNU General Public License v3.0 only which is an OSI approved license.

The primary programming language of surya is Python.


Sponsored
CodeRabbit: AI Code Reviews for Developers
Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
coderabbit.ai

Did you know that Python is
the 2nd most popular programming language
based on number of references?