InvoiceNet
awesome-document-understandi
Our great sponsors
InvoiceNet | awesome-document-understandi | |
---|---|---|
4 | 1 | |
2,389 | - | |
- | - | |
3.9 | - | |
about 2 months ago | - | |
Python | ||
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
InvoiceNet
-
How would you annotate resumes for object detection?
You can also possibly look at invoice extraction tools such as https://github.com/naiveHobo/InvoiceNet. They solve a similar issue and are researched fairly well, since there is a big market for that.
- Pdfsandwich
-
Extract informations from invoices with machine learning
Also, I would suggest you to use this codebase: https://github.com/naiveHobo/InvoiceNet
-
P Information Extraction From A Document
You can check out this repository. It contains an implementation of some recent research in deep learning for information extraction on invoices. https://github.com/naiveHobo/InvoiceNet
awesome-document-understandi
-
Pdfsandwich
While trying to find a specific project I recalled, I encountered this list of projects which might be of interest: https://github.com/tstanislawek/awesome-document-understandi...
The project I had in mind was similar to this one but I can't remember the name currently: https://github.com/tabulapdf/tabula
However, if you're looking for a ML-based, invoice-specific project looks like the other comment to your reply might be more useful.
What are some alternatives?
GLOM-TensorFlow - An attempt at the implementation of GLOM, Geoffrey Hinton's paper for emergent part-whole hierarchies from data
awesome-document-understanding - A curated list of resources for Document Understanding (DU) topic
pytorch2keras - PyTorch to Keras model convertor
tabula - Tabula is a tool for liberating data tables trapped inside PDF files
OCRmyPDF - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Mask-RCNN-TF2 - Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow 2.0
ripgrep-all - rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc.
coral-ordinal - Tensorflow Keras implementation of ordinal regression using consistent rank logits (CORAL) by Cao et al. (2019)
pycm - Multi-class confusion matrix library in Python
szabadfogasu-maszk - A face mask detection system using Tensorflow/Keras and OpenCV, for the "<19 Szabadfogású Számítógép" competition in 2020.