genalog
docutron
genalog | docutron | |
---|---|---|
1 | 2 | |
295 | 17 | |
1.4% | - | |
0.0 | 5.8 | |
3 months ago | 6 months ago | |
Jupyter Notebook | Jupyter Notebook | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
genalog
-
Microsoft Unveils Genalog: An Open Source, AI Cross-Platform Python Package For Generating Document Images With Synthetic Noise
Github: https://github.com/microsoft/genalog
docutron
What are some alternatives?
deep-text-recognition-benchmark - Text recognition (optical character recognition) with deep learning methods, ICCV 2019
SDV - Synthetic data generation for tabular data
unstructured - Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
synthetic-data-genomics - Proof of concept code from Gretel.ai and Illumina using generative neural networks to create synthetic versions of mouse genotype and phenotype data.
document-ai-samples - Sample applications and demos for Document AI, the end-to-end document processing platform on Google Cloud
Copulas - A library to model multivariate data using copulas.
videocr-PaddleOCR - Extract hardcoded subtitles from videos using machine learning
ML-For-Beginners - 12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
ocrpy - OCR, Archive, Index and Search: Implementation agnostic OCR framework.