GenerativeImage2Text

GIT: A Generative Image-to-text Transformer for Vision and Language (by microsoft)

GenerativeImage2Text Alternatives

Similar projects and alternatives to GenerativeImage2Text

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better GenerativeImage2Text alternative or higher similarity.

GenerativeImage2Text reviews and mentions

Posts with mentions or reviews of GenerativeImage2Text. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-01-11.
  • Building an Internet Scale Meme Search Engine
    5 projects | news.ycombinator.com | 11 Jan 2023
    https://github.com/roatienza/deep-text-recognition-benchmark (available weights are for tasks that seem similar to OCR so there is a good chance you can use it out of the box). With a good gpu it should process hundreds to thousands image per seconds, so you likely can build your index in less than a day. (Maybe you can even port it to your iphone stack :) )

    https://github.com/microsoft/GenerativeImage2Text (You'll probably have to train on your custom dataset that you have constituted)

    There are tons of other freely available solutions that you can get with a search for things with keywords like "image to text ocr" "transformers" "visual transformers"...

Stats

Basic GenerativeImage2Text repo stats
1
520
4.9
6 months ago

microsoft/GenerativeImage2Text is an open source project licensed under MIT License which is an OSI approved license.

The primary programming language of GenerativeImage2Text is Python.


Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com