docutron
German-NER-BERT
docutron | German-NER-BERT | |
---|---|---|
2 | 2 | |
17 | 7 | |
- | - | |
5.8 | 0.0 | |
7 months ago | almost 2 years ago | |
Jupyter Notebook | Jupyter Notebook | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
docutron
German-NER-BERT
-
[P] German NER on Legal Data using BERT
Link to project: https://github.com/harshildarji/German-NER-BERT/
What are some alternatives?
deep-text-recognition-benchmark - Text recognition (optical character recognition) with deep learning methods, ICCV 2019
malaya - Natural Language Toolkit for Malaysian language, https://malaya.readthedocs.io/
unstructured - Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
bert-sklearn - a sklearn wrapper for Google's BERT model
genalog - Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
document-ai-samples - Sample applications and demos for Document AI, the end-to-end document processing platform on Google Cloud
videocr-PaddleOCR - Extract hardcoded subtitles from videos using machine learning
ocrpy - OCR, Archive, Index and Search: Implementation agnostic OCR framework.
Multi-Type-TD-TSR - Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition: