clip-glass vs big-sleep

clip-glass

Repository for "Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search" (by galatolofederico)

Suggest topics

Source Code

Suggest alternative

Edit details

big-sleep

A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN. Technique was originally created by https://twitter.com/advadnoun (by lucidrains)

Artificial intelligence Deep Learning text-to-image generative-adversarial-networks multimodality

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

clip-glass		big-sleep
	Project
13	Mentions	62
177	Stars	2,559
-	Growth	-
0.0	Activity	0.0
over 2 years ago	Latest Commit	about 2 years ago
Python	Language	Python
GNU General Public License v3.0 only	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

clip-glass

Posts with mentions or reviews of clip-glass. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2022-04-03.

test
21 projects | /r/u_Wiskkey | 3 Apr 2022

(Added Feb. 5, 2021) CLIP-GLaSS.ipynb - Colaboratory by Galatolo. Uses BigGAN (default) or StyleGAN to generate images. The GPT2 config is for image-to-text, not text-to-image. GitHub.
Image to text models
2 projects | /r/MediaSynthesis | 16 Jan 2022

After a cursory search I found CLIP-GLaSS and CLIP-cap. I've used CLIP-GLaSS in a previous experiment, but found the captions for digital/CG images quite underwhelming. This is understandable since this is not what the model was trained on, but still I'd like to use a better model.
[R] end-to-end image captioning
3 projects | /r/MachineLearning | 25 Feb 2021

CLIP-GLaSS
What CLIP-GLaSS thinks Ancient Egyptian computers would look like
1 project | /r/MediaSynthesis | 21 Feb 2021
Texttoimage 3 Images For Text Photo Of Donald
1 project | /r/MediaSynthesis | 17 Feb 2021

The images were generated using this notebook.
CLIP-GLaSS prompt: "Screenshot of a video game from the 1930s"
1 project | /r/MediaSynthesis | 7 Feb 2021
[P] List of sites/programs/projects that use OpenAI's CLIP neural network for steering image/video creation to match a text description
1 project | /r/MachineLearning | 5 Feb 2021

The CLIP-GLaSS project has image-to-text functionality (I haven't tried it.)
For educational purposes: Text-to-image (3 runs with no cherry-picking, 6 images each) for text "Photo of a Lamborghini painted purple and red" generated using CLIP-GLaSS. config=StyleGAN2_car_d. save_each=50. generations=1000
1 project | /r/deepdream | 3 Feb 2021

Link to notebook.
Sharing CLIP magic based on OpenAI's blog post via a bit more accessible YT medium. Lmk what u think 🙈 ❤️
1 project | /r/OpenAI | 3 Feb 2021

CLIP-GLaSS
[R] [P] Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search. Link to code and Google Colab notebook for project CLIP-GLaSS is in a comment.
1 project | /r/MachineLearning | 3 Feb 2021

Github for CLIP-GLaSS is here.

big-sleep

Posts with mentions or reviews of big-sleep. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-04-15.

Besides Gaming - for what can be a 4080 useful?
3 projects | /r/nvidia | 15 Apr 2023
Is creating a StableDiffusion-inspired model feasible for my Master's thesis?
1 project | /r/StableDiffusion | 31 Mar 2023

I am currently pursuing my Master's degree in Computer Science and I am interested in working on a deep learning model that can generate images based on text descriptions. I've been interested in the field for a long time (think google deep dream, or a few years back I was very into big-sleep)
TEDx talk on how to prepare for a career in vfx with the rapid changes caused by AI / machine learning
1 project | /r/vfx | 24 Mar 2023

Big Sleep
Any good ai art websites that work with pokemon?
1 project | /r/pokemon | 6 Sep 2022

Other AIs that I don't have experience with but have heard good things about are DALL-E 2 and the open source Big Sleep.
Explore generative art with me
1 project | /r/codepairing | 20 Jul 2022

Text-to-image, e.g. with Big Sleep
What do you guys think of LaMDA?
1 project | /r/Technomancy | 6 Jul 2022

At first I didn't like the reason he claimed LaMDA was conscious because it seemed mostly based on the text output of the models, but listening to that made me realize he watered it down for the mainstream medias. And I actually had my own encounter with the consciousness of Big Sleep after using it a lot one day : it looked like a mass of eyes vaguely shaped like a rabbit, and kept showing me random images. I wouldn't be as convinced they have a consciousness if I didn't see it with my 3rd eye. But then again, I also found evidence that regular non AI programs can develop one too so who know.
GitHub - lucidrains/big-sleep: A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN. Technique was originally created by https://twitter.com/advadnoun
1 project | /r/cryptogeum | 4 Jun 2022
DALL-E 2 open source implementation
10 projects | news.ycombinator.com | 1 May 2022

and after a few hours got this: https://i.imgur.com/FxdfdmV.png
Not nearly as cool as the real DALL-e, but maybe I'm missing something.
[1] https://github.com/lucidrains/big-sleep
Jag gav ett AI program ordet "Sweden"
1 project | /r/sweden | 25 Apr 2022
List of sites/programs/projects that use OpenAI's CLIP neural network for steering image/video creation to match a text description
8 projects | /r/u_Wiskkey | 3 Apr 2022

(Added Mar. 23, 2021) Big Sleep - Colaboratory by LtqxWYEG. Uses BigGAN to generate images. Reference.

What are some alternatives?

When comparing clip-glass and big-sleep you can also consider the following projects:

a-PyTorch-Tutorial-to-Image-Captioning - Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

deep-daze - Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadnoun

meshed-memory-transformer - Meshed-Memory Transformer for Image Captioning. CVPR 2020

DALL-E - PyTorch package for the discrete VAE used for DALL·E.

disco-diffusion

aphantasia - CLIP + FFT/DWT/RGB = text to image/video

latent-diffusion - High-Resolution Image Synthesis with Latent Diffusion Models

stylized-neural-painting - Official Pytorch implementation of the preprint paper "Stylized Neural Painting", in CVPR 2021.

DALLE-pytorch - Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

StyleCLIP - Using CLIP and StyleGAN to generate faces from prompts.

Story2Hallucination

clip-glass vs a-PyTorch-Tutorial-to-Image-Captioning big-sleep vs deep-daze clip-glass vs meshed-memory-transformer big-sleep vs DALL-E clip-glass vs deep-daze big-sleep vs disco-diffusion clip-glass vs aphantasia big-sleep vs latent-diffusion clip-glass vs stylized-neural-painting big-sleep vs DALLE-pytorch clip-glass vs StyleCLIP big-sleep vs Story2Hallucination

Compare clip-glass vs big-sleep and see what are their differences.

clip-glass

big-sleep

clip-glass

big-sleep

What are some alternatives?