Calliar
deep-text-recognition-benchmark
Calliar | deep-text-recognition-benchmark | |
---|---|---|
1 | 1 | |
138 | 278 | |
2.2% | - | |
2.2 | 2.2 | |
about 1 year ago | about 1 month ago | |
Jupyter Notebook | Jupyter Notebook | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Calliar
-
[R] Calliar: An Online Handwritten Dataset for Arabic Calligraphy
Dataset (JSON) and visualizations in their github repo: https://github.com/ARBML/Calliar
deep-text-recognition-benchmark
-
Building an Internet Scale Meme Search Engine
https://github.com/roatienza/deep-text-recognition-benchmark (available weights are for tasks that seem similar to OCR so there is a good chance you can use it out of the box). With a good gpu it should process hundreds to thousands image per seconds, so you likely can build your index in less than a day. (Maybe you can even port it to your iphone stack :) )
https://github.com/microsoft/GenerativeImage2Text (You'll probably have to train on your custom dataset that you have constituted)
There are tons of other freely available solutions that you can get with a search for things with keywords like "image to text ocr" "transformers" "visual transformers"...
What are some alternatives?
postcss-rtl - PostCSS plugin for RTL-adaptivity
GenerativeImage2Text - GIT: A Generative Image-to-text Transformer for Vision and Language
deep-text-recognition-benchmark - Text recognition (optical character recognition) with deep learning methods, ICCV 2019
Transformer-Explainability - [CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
document-ai-samples - Sample applications and demos for Document AI, the end-to-end document processing platform on Google Cloud
sonic - 🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
ocrpy - OCR, Archive, Index and Search: Implementation agnostic OCR framework.
ocrit - Simple command-line utility for performing OCR using Apple's Vision framework
macOCR - Get any text on your screen into your clipboard.
Transformers-Tutorials - This repository contains demos I made with the Transformers library by HuggingFace.