PaddleOCR
Pytorch
Our great sponsors
PaddleOCR | Pytorch | |
---|---|---|
60 | 333 | |
37,652 | 76,925 | |
4.0% | 2.6% | |
8.6 | 10.0 | |
4 days ago | 3 days ago | |
Python | Python | |
Apache License 2.0 | BSD 1-Clause License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
PaddleOCR
-
Leveraging GPT-4 for PDF Data Extraction: A Comprehensive Guide
PyTesseract Module [ Github ] EasyOCR Module [ Github ] PaddlePaddle OCR [ Github ]
-
Show HN: BetterOCR combines and corrects multiple OCR engines with an LLM
Yup! But I'm still exploring options. (any recommendations would be welcomed!) Here are some candidates I'm considering:
- https://github.com/mindee/doctr
- https://github.com/open-mmlab/mmocr
- https://github.com/PaddlePaddle/PaddleOCR (honestly I don't know Mandarin so I'm a bit stuck)
- https://github.com/clovaai/donut - While it's primarily an "OCR-free document understanding transformer," I think it's worth experimenting with. Think I can sort this out by letting the LLM reason through it multiple times (although this will impact performance)
- yesterday got a suggestion to consider https://github.com/kakaobrain/pororo - I don't think development is still active but the results are pretty great on Korean text
-
How would you go about driving contextual data from images?
For images with text, if you want to do visual qa, document classification, table/key information extraction, checkout https://huggingface.co/blog/document-ai https://github.com/philschmid/document-ai-transformers https://github.com/google-research/pix2struct https://github.com/PaddlePaddle/PaddleOCR/blob/release/2.6/ppstructure/README.md
-
OCR at Edge on Cloudflare Constellation
EasyOCR is a popular project if you are in an environment where you can use run Python and PyTorch (https://github.com/JaidedAI/EasyOCR). Other open source projects of note are PaddleOCR (https://github.com/PaddlePaddle/PaddleOCR) and docTR (https://github.com/mindee/doctr).
-
How do you parse tables in PDF with langchain? Especially, the context which is few lines above and below the table.
https://huggingface.co/blog/document-ai https://github.com/microsoft/table-transformer https://github.com/google-research/pix2struct https://github.com/PaddlePaddle/PaddleOCR/blob/release/2.6/ppstructure/table/README.md
-
Donut: OCR-Free Document Understanding Transformer
When I was evaluating options a few months ago I found https://github.com/PaddlePaddle/PaddleOCR to be a very strong contender for my use case (reading product labels), but you'll definitely want to put together some representative docs/images and test a bunch of solutions to see what works for you.
- [Python] [OCR] Un nouvel outil OCR avec une meilleure reconnaissance de texte pour les documents et les cartes.
-
[D] Can I use ML/AI to read the back panels of electronic components?
PaddlePaddle/PaddleOCR: Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
-
Frog: OCR Tool for Linux
I’ve had good results from paddle ocr.
-
[OCR] The 24k star repo about OCR with 30+ languages supported including Chinese, Japanese .. and image conversion to excel file supported.
And you can find a lot of corpus and dictionaries in the pinned issue Multilingual OCR Development Plan from the community.
Pytorch
-
The Elements of Differentiable Programming
Sure, right here: https://github.com/pytorch/pytorch/blob/main/torch/autograd/...
Here's the documentation: https://pytorch.org/tutorials/intermediate/forward_ad_usage....
> When an input, which we call “primal”, is associated with a “direction” tensor, which we call “tangent”, the resultant new tensor object is called a “dual tensor” for its connection to dual numbers[0].
-
Dot vs Matrix vs Element-wise multiplication in PyTorch
In PyTorch with @, dot() or matmul():
-
Open Source Ascendant: The Transformation of Software Development in 2024
AI's Open Embrace Artificial intelligence (AI) and machine learning (ML) are increasingly leveraging open-source frameworks like TensorFlow [https://www.tensorflow.org/] and PyTorch [https://pytorch.org/]. This democratization of AI tools is driving innovation and lowering entry barriers across industries.
-
Best AI Tools for Students Learning Development and Engineering
Which label applies to a tool sometimes depends on what you do with it. For example, PyTorch or TensorFlow can be called a library, a toolkit, or a machine-learning framework.
-
Element-wise vs Matrix vs Dot multiplication
In PyTorch with * or mul(). ` or mul()` can multiply 0D or more D tensors by element-wise multiplication:
-
Bash Debugging
When I was at Facebook, I wrote a Python script to extract shell scripts from GitHub Actions workflows, so we could run them all through ShellCheck: https://github.com/pytorch/pytorch/blob/69e0bda9996865e319db...
-
Releasing The Force Of Machine Learning: A Novice’s Guide 😃
PyTorch: An open-source deep learning framework that facilitates dynamic computational graphs, making it flexible and efficient for research and production.
-
How To Implement Data Streaming In PyTorch From A Remote Database
In this blog post, we will go through a full example and setup a data stream to PyTorch from a playground dataset on a remote database.
-
Introducing Flama for Robust Machine Learning APIs
PyTorch
-
Beyond Backpropagation - Higher Order, Forward and Reverse-mode Automatic Differentiation for Tensorken
This post describes how I added automatic differentiation to Tensorken. Tensorken is my attempt to build a fully featured yet easy-to-understand and hackable implementation of a deep learning library in Rust. It takes inspiration from the likes of PyTorch, Tinygrad, and JAX.
What are some alternatives?
EasyOCR - Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
tesseract-ocr - Tesseract Open Source OCR Engine (main repository)
mmocr - OpenMMLab Text Detection, Recognition and Understanding Toolbox
Tesseract.js - Pure Javascript OCR for more than 100 Languages 📖🎉🖥
OCRmyPDF - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Flux.jl - Relax! Flux is the ML library that doesn't make you tensor
mediapipe - Cross-platform, customizable ML solutions for live and streaming media.
Apache Spark - Apache Spark - A unified analytics engine for large-scale data processing
flax - Flax is a neural network library for JAX that is designed for flexibility.
tinygrad - You like pytorch? You like micrograd? You love tinygrad! ❤️ [Moved to: https://github.com/tinygrad/tinygrad]
keras-ocr - A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
Pandas - Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more