awesome-document-understanding
awesome-ocr
Our great sponsors
awesome-document-understanding | awesome-ocr | |
---|---|---|
4 | 1 | |
1,115 | 778 | |
- | - | |
4.5 | 4.0 | |
11 months ago | 4 months ago | |
- | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
awesome-document-understanding
-
[R] Are there any open-source implementations of Document Understanding pipelines?
I have worked on several Document Understanding (DU) projects for my company during the last year. We've mainly used UiPath and Google's DocumentAI.
-
Pdfsandwich
While trying to find a specific project I recalled, I encountered this list of projects which might be of interest: https://github.com/tstanislawek/awesome-document-understandi...
The project I had in mind was similar to this one but I can't remember the name currently: https://github.com/tabulapdf/tabula
However, if you're looking for a ML-based, invoice-specific project looks like the other comment to your reply might be more useful.
-
Extract informations from invoices with machine learning
Check out this repository for inspiration: https://github.com/tstanislawek/awesome-document-understanding
-
[P] Curated List of Document Understanding (DU) Papers & Resources.
In the last few years, I spent a lot of time working on automate business processes of big companies and seeing rising interest in DU topics (especially from Key Information Extraction field). Therefore, I create a list https://github.com/tstanislawek/awesome-document-understanding of resources to make easier to track all the papers out there which are relevant to this topic.
awesome-ocr
-
HTR/OCR for Handwritten Text?
Other than that, I've found this list of HTR (handwritten text recognition) options: https://github.com/zacharywhitley/awesome-ocr#handwritten. I'll be working my way through that. Do you know any of the options there, and should I focus on or skip them?
What are some alternatives?
InvoiceNet - Deep neural network to extract intelligent information from invoice documents.
deep-text-recognition-benchmark - Text recognition (optical character recognition) with deep learning methods, ICCV 2019
unstructured - Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
awesome-jax - JAX - A curated list of resources https://github.com/google/jax
Awesome-pytorch-list - A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
awesome-production-machine-learning - A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
awesome-document-understandi
awesome-project-ideas - Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas
awesome-huggingface - 🤗 A list of wonderful open-source projects & applications integrated with Hugging Face libraries.
Awesome-Out-Of-Distribution-Detection - A professionally curated list of papers, tutorials, books, videos, articles and open-source libraries etc for Out-of-distribution detection, robustness, and generalization
tocPDF - Generates bookmarks from the table of contents already available at the beginning of pdf files.
datascience - Curated list of Python resources for data science.