awesome-bioie
awesome-document-understanding
awesome-bioie | awesome-document-understanding | |
---|---|---|
1 | 4 | |
300 | 1,136 | |
- | - | |
2.1 | 4.5 | |
about 1 year ago | 12 months ago | |
Creative Commons Zero v1.0 Universal | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
awesome-bioie
-
Snomed CT Entity Linking Challenge
> The objective of this competition is to link spans of text in clinical notes with specific topics in the SNOMED CT clinical terminology. Participants will train models based on real-world doctor's notes which have been de-identified and annotated with SNOMED CT concepts by medically trained professionals. This is the largest publicly available dataset of labelled clinical notes, and you can be one of the first to use it!
NER: Named Entity Recognition: https://en.wikipedia.org/wiki/Named-entity_recognition
awsome-medical-coding-nlp: https://github.com/acadTags/Awesome-medical-coding-NLP
awesome-ehr-deep-learning: https://github.com/hurcy/awesome-ehr-deeplearning
awesome-ner: https://github.com/smiyawaki0820/awesome-ner
awesome-bioie > Research groups: https://github.com/caufieldjh/awesome-bioie#groups-active-in...
SNOMED-CT as RDF: https://sphn-semantic-framework.readthedocs.io/en/latest/ext...
awesome-document-understanding
-
[R] Are there any open-source implementations of Document Understanding pipelines?
I have worked on several Document Understanding (DU) projects for my company during the last year. We've mainly used UiPath and Google's DocumentAI.
-
Pdfsandwich
While trying to find a specific project I recalled, I encountered this list of projects which might be of interest: https://github.com/tstanislawek/awesome-document-understandi...
The project I had in mind was similar to this one but I can't remember the name currently: https://github.com/tabulapdf/tabula
However, if you're looking for a ML-based, invoice-specific project looks like the other comment to your reply might be more useful.
-
Extract informations from invoices with machine learning
Check out this repository for inspiration: https://github.com/tstanislawek/awesome-document-understanding
-
[P] Curated List of Document Understanding (DU) Papers & Resources.
In the last few years, I spent a lot of time working on automate business processes of big companies and seeing rising interest in DU topics (especially from Key Information Extraction field). Therefore, I create a list https://github.com/tstanislawek/awesome-document-understanding of resources to make easier to track all the papers out there which are relevant to this topic.
What are some alternatives?
InvoiceNet - Deep neural network to extract intelligent information from invoice documents.
unstructured - Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Awesome-pytorch-list - A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.
awesome-ocr
awesome-document-understandi
awesome-huggingface - 🤗 A list of wonderful open-source projects & applications integrated with Hugging Face libraries.
tocPDF - Generates bookmarks from the table of contents already available at the beginning of pdf files.
odinson - Odinson is a powerful and highly optimized open-source framework for rule-based information extraction. Odinson couples a simple, yet powerful pattern language that can operate over multiple representations of text, with a runtime system that operates in near real time.
tabula - Tabula is a tool for liberating data tables trapped inside PDF files