Image to text models

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

clip-glass

13 177 0.0 Python

Repository for "Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search"

After a cursory search I found CLIP-GLaSS and CLIP-cap. I've used CLIP-GLaSS in a previous experiment, but found the captions for digital/CG images quite underwhelming. This is understandable since this is not what the model was trained on, but still I'd like to use a better model.

CLIP_prefix_caption

2 1,204 0.0 Jupyter Notebook

Simple image captioning model

After a cursory search I found CLIP-GLaSS and CLIP-cap. I've used CLIP-GLaSS in a previous experiment, but found the captions for digital/CG images quite underwhelming. This is understandable since this is not what the model was trained on, but still I'd like to use a better model.

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Building LinkedIn Elevator Pitch Generator with Lyzr SDK
1 project | dev.to | 28 Apr 2024
Show HN: Create typed declarative API clients quickly and easily (Python)
1 project | news.ycombinator.com | 28 Apr 2024
What are LLMs? An intro into AI, models, tokens, parameters, weights, quantization and more
4 projects | dev.to | 28 Apr 2024
LTK is a little toolkit for writing UIs in PyScript
1 project | news.ycombinator.com | 28 Apr 2024
Block* and AgentFormer – Playing with blocks and Transformers (yay)
1 project | news.ycombinator.com | 28 Apr 2024

This page summarizes the projects mentioned and recommended in the original post on /r/MediaSynthesis Post date: 16 Jan 2022

clip-glass

CLIP_prefix_caption

WorkOS

Related posts