Our great sponsors
-
sonic
🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
-
deep-text-recognition-benchmark
PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR) (by roatienza)
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
If you don't need advanced search features, you can use Sonic (https://github.com/valeriansaliou/sonic). It's blazing fast and you can save lot of money on servers.
https://github.com/roatienza/deep-text-recognition-benchmark (available weights are for tasks that seem similar to OCR so there is a good chance you can use it out of the box). With a good gpu it should process hundreds to thousands image per seconds, so you likely can build your index in less than a day. (Maybe you can even port it to your iphone stack :) )
https://github.com/microsoft/GenerativeImage2Text (You'll probably have to train on your custom dataset that you have constituted)
There are tons of other freely available solutions that you can get with a search for things with keywords like "image to text ocr" "transformers" "visual transformers"...
https://github.com/roatienza/deep-text-recognition-benchmark (available weights are for tasks that seem similar to OCR so there is a good chance you can use it out of the box). With a good gpu it should process hundreds to thousands image per seconds, so you likely can build your index in less than a day. (Maybe you can even port it to your iphone stack :) )
https://github.com/microsoft/GenerativeImage2Text (You'll probably have to train on your custom dataset that you have constituted)
There are tons of other freely available solutions that you can get with a search for things with keywords like "image to text ocr" "transformers" "visual transformers"...
There's ocrit, a CLI utility using Apple's Vision framework for OCR: https://github.com/insidegui/ocrit
Pretty insane. If you don’t want to use iPhones, I made a while back macOCR which uses the same vision APIs, with a very simple CLI interface. See: https://github.com/schappim/macOCR
Related posts
- sonic: Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.
- Sonic, An alternative to Elasticsearch that runs on a few MBs of RAM
- An alternative to Elasticsearch that runs on a few MBs of RAM
- An alternative to Elasticsearch that runs on a few MBs of RAM
- An alternative to Elasticsearch that runs on a few MBs of RAM