[D] LLM or model that does image -> prompt?

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

MiniGPT-4

37 24,859 9.4 Python

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
TaskMatrix

10 34,525 7.3 Python

Visual ChatGPT (now renamed as TaskMatrix https://github.com/microsoft/TaskMatrix likely as a result of OpenAI trying to regulate the use of the name GPT. Same happened for GPT-Eval -> G-Eval).

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
CLIP

103 22,051 1.2 Jupyter Notebook

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

CLIP might work for your needs.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project