SaaSHub helps you find the best software and product alternatives Learn more →
Surya Alternatives
Similar projects and alternatives to surya
-
-
CodeRabbit
CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.
-
Alpaca-API
The Alpaca API is a developer interface for trading operations and market data reception through the Alpaca platform.
-
-
-
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
-
beekeeper-studio
Modern and easy to use SQL client for MySQL, Postgres, SQLite, SQL Server, and more. Linux, MacOS, and Windows.
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
-
EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
-
-
-
-
-
-
PaddlePaddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
-
-
llama-hub
Discontinued A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain
-
-
-
document-ai-samples
Sample applications and demos for Document AI, the end-to-end document processing platform on Google Cloud
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
surya discussion
surya reviews and mentions
-
Ask HN: What is the best method for turning a scanned book as a PDF into text?
I have tried a bunch of things. This is what worked best for me: Surya [0]. It can run fully local on your laptop. I also tried EasyOCR [1], which is also quite good. I haven't tried this myself, but I will look at Paddle [2] if the previous two don't float your boat.
All of these are OSS, and you don't need to pay a dime to anyone.
[0]: https://github.com/VikParuchuri/surya
[1]: https://github.com/JaidedAI/EasyOCR
[2]: https://github.com/PaddlePaddle/Paddle
-
Show HN: Kreuzberg – Modern async Python library for document text extraction
pypdfium2 is a great choice and a solid piece of software!
You might want to look into https://github.com/VikParuchuri/surya as an alternative to tesseract. Yes, it's associated with a commercial company, but as you long as you aren't a company with 5M in ARR or $5M in funding it's free to use.
-
Nvidia-Ingest: Multi-modal data extraction
Surya is a great open source toolkit for table parsing, layout analysis and OCR: https://github.com/VikParuchuri/surya
-
Ask HN: Who is hiring? (January 2025)
Datalab | NYC | Full-time | Software Engineer and Head of Business Ops | $250k-$350k + 1.5-3% equity | https://www.datalab.to
A significant % of useful data is locked away in tough-to-parse formats like PDFs. We build tools to extract it, like https://github.com/VikParuchuri/surya (15k Github stars), and https://github.com/VikParuchuri/marker (19k stars). We also run an inference API and product.
We do meaningful research (we’ve trained several SoTA models), ship product, and contribute to open source. We’re hiring for 2 roles to help us scale:
Senior fullstack software engineer
- work across our open source repos, inference api, and frontend product
-
Show HN: Lessons learned from a big OCR project
I’ve used Surya (https://github.com/VikParuchuri/surya) before. It is very good (on par with Google Vision, potentially better layout analysis), but yours is a challenging use case. I wonder if it would be useful.
-
Show HN: LLM Aided OCR (Correcting Tesseract OCR Errors with LLMs)
Hi, I'm the author of surya (https://github.com/VikParuchuri/surya) - working on improving speed and accuracy now. Happy to collaborate if you have specific page types it's not working on. For modern/clean documents it benchmarks very similarly to Google Cloud, but working on supporting older documents better now.
-
Decoding OCR: A Comprehensive Guide
For a deeper dive into Surya-OCR, an advanced OCR system, enthusiasts and developers can explore its extensive components on GitHub. This open-source project is readily accessible for those eager to understand its mechanics or contribute to its evolution. Visit Surya-OCR on GitHub to explore the documentation, source code, and more.
- From GPT-4 to AGI: Counting the OOMs
- Ask HN: How to OCR a PDF and preserve whitespace?
-
A Picture Is Worth 170 Tokens: How Does GPT-4o Encode Images?
checkout https://github.com/mindee/doctr or https://github.com/VikParuchuri/surya for something practical
multimodal llm would of course blow it all out the water, so some llama3-like model is probably SOTA in terms of what you can run yourself. something like https://huggingface.co/blog/idefics2
-
A note from our sponsor - SaaSHub
www.saashub.com | 27 Mar 2025
Stats
VikParuchuri/surya is an open source project licensed under GNU General Public License v3.0 only which is an OSI approved license.
The primary programming language of surya is Python.