Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge. Learn more →
PaddleOCR Alternatives
Similar projects and alternatives to PaddleOCR
-
EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
-
-
InfluxDB
Collect and Analyze Billions of Data Points in Real Time. Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge.
-
-
OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
-
-
Pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
-
keras-ocr
A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
-
Sonar
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
-
-
donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
-
doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
-
normcap
OCR powered screen-capture tool to capture information instead of images
-
scantailor-advanced
ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and fixes.
-
-
-
devops-exercises
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
-
-
flameshot
Powerful yet simple to use screenshot software :desktop_computer: :camera_flash:
-
manim
A community-maintained Python framework for creating mathematical animations. (by ManimCommunity)
-
label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
-
Mergify
Updating dependencies is time-consuming.. Solutions like Dependabot or Renovate update but don't merge dependencies. You need to do it manually while it could be fully automated! Add a Merge Queue to your workflow and stop caring about PR management & merging. Try Mergify for free.
PaddleOCR reviews and mentions
-
How would you go about driving contextual data from images?
For images with text, if you want to do visual qa, document classification, table/key information extraction, checkout https://huggingface.co/blog/document-ai https://github.com/philschmid/document-ai-transformers https://github.com/google-research/pix2struct https://github.com/PaddlePaddle/PaddleOCR/blob/release/2.6/ppstructure/README.md
-
OCR at Edge on Cloudflare Constellation
EasyOCR is a popular project if you are in an environment where you can use run Python and PyTorch (https://github.com/JaidedAI/EasyOCR). Other open source projects of note are PaddleOCR (https://github.com/PaddlePaddle/PaddleOCR) and docTR (https://github.com/mindee/doctr).
-
How do you parse tables in PDF with langchain? Especially, the context which is few lines above and below the table.
https://huggingface.co/blog/document-ai https://github.com/microsoft/table-transformer https://github.com/google-research/pix2struct https://github.com/PaddlePaddle/PaddleOCR/blob/release/2.6/ppstructure/table/README.md
-
Donut: OCR-Free Document Understanding Transformer
When I was evaluating options a few months ago I found https://github.com/PaddlePaddle/PaddleOCR to be a very strong contender for my use case (reading product labels), but you'll definitely want to put together some representative docs/images and test a bunch of solutions to see what works for you.
- [Python] [OCR] Un nouvel outil OCR avec une meilleure reconnaissance de texte pour les documents et les cartes.
-
[D] Can I use ML/AI to read the back panels of electronic components?
PaddlePaddle/PaddleOCR: Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
-
Frog: OCR Tool for Linux
I’ve had good results from paddle ocr.
-
[OCR] The 24k star repo about OCR with 30+ languages supported including Chinese, Japanese .. and image conversion to excel file supported.
And you can find a lot of corpus and dictionaries in the pinned issue Multilingual OCR Development Plan from the community.
If you encounter any problems including usage problems or badcase, please pull an issue here and let us know.
-
A note from our sponsor - InfluxDB
www.influxdata.com | 29 Sep 2023
Stats
PaddlePaddle/PaddleOCR is an open source project licensed under Apache License 2.0 which is an OSI approved license.
The primary programming language of PaddleOCR is Python.