clip-glass
big-sleep
clip-glass | big-sleep | |
---|---|---|
13 | 62 | |
177 | 2,559 | |
- | - | |
0.0 | 0.0 | |
over 2 years ago | about 2 years ago | |
Python | Python | |
GNU General Public License v3.0 only | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
clip-glass
-
test
(Added Feb. 5, 2021) CLIP-GLaSS.ipynb - Colaboratory by Galatolo. Uses BigGAN (default) or StyleGAN to generate images. The GPT2 config is for image-to-text, not text-to-image. GitHub.
-
Image to text models
After a cursory search I found CLIP-GLaSS and CLIP-cap. I've used CLIP-GLaSS in a previous experiment, but found the captions for digital/CG images quite underwhelming. This is understandable since this is not what the model was trained on, but still I'd like to use a better model.
-
[R] end-to-end image captioning
CLIP-GLaSS
- What CLIP-GLaSS thinks Ancient Egyptian computers would look like
-
Texttoimage 3 Images For Text Photo Of Donald
The images were generated using this notebook.
- CLIP-GLaSS prompt: "Screenshot of a video game from the 1930s"
-
[P] List of sites/programs/projects that use OpenAI's CLIP neural network for steering image/video creation to match a text description
The CLIP-GLaSS project has image-to-text functionality (I haven't tried it.)
-
For educational purposes: Text-to-image (3 runs with no cherry-picking, 6 images each) for text "Photo of a Lamborghini painted purple and red" generated using CLIP-GLaSS. config=StyleGAN2_car_d. save_each=50. generations=1000
Link to notebook.
-
Sharing CLIP magic based on OpenAI's blog post via a bit more accessible YT medium. Lmk what u think 🙈 ❤️
CLIP-GLaSS
-
[R] [P] Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search. Link to code and Google Colab notebook for project CLIP-GLaSS is in a comment.
Github for CLIP-GLaSS is here.
big-sleep
- Besides Gaming - for what can be a 4080 useful?
-
Is creating a StableDiffusion-inspired model feasible for my Master's thesis?
I am currently pursuing my Master's degree in Computer Science and I am interested in working on a deep learning model that can generate images based on text descriptions. I've been interested in the field for a long time (think google deep dream, or a few years back I was very into big-sleep)
-
TEDx talk on how to prepare for a career in vfx with the rapid changes caused by AI / machine learning
Big Sleep
-
Any good ai art websites that work with pokemon?
Other AIs that I don't have experience with but have heard good things about are DALL-E 2 and the open source Big Sleep.
-
Explore generative art with me
Text-to-image, e.g. with Big Sleep
-
What do you guys think of LaMDA?
At first I didn't like the reason he claimed LaMDA was conscious because it seemed mostly based on the text output of the models, but listening to that made me realize he watered it down for the mainstream medias. And I actually had my own encounter with the consciousness of Big Sleep after using it a lot one day : it looked like a mass of eyes vaguely shaped like a rabbit, and kept showing me random images. I wouldn't be as convinced they have a consciousness if I didn't see it with my 3rd eye. But then again, I also found evidence that regular non AI programs can develop one too so who know.
- GitHub - lucidrains/big-sleep: A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN. Technique was originally created by https://twitter.com/advadnoun
-
DALL-E 2 open source implementation
and after a few hours got this: https://i.imgur.com/FxdfdmV.png
Not nearly as cool as the real DALL-e, but maybe I'm missing something.
[1] https://github.com/lucidrains/big-sleep
- Jag gav ett AI program ordet "Sweden"
-
List of sites/programs/projects that use OpenAI's CLIP neural network for steering image/video creation to match a text description
(Added Mar. 23, 2021) Big Sleep - Colaboratory by LtqxWYEG. Uses BigGAN to generate images. Reference.
What are some alternatives?
a-PyTorch-Tutorial-to-Image-Captioning - Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
deep-daze - Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadnoun
meshed-memory-transformer - Meshed-Memory Transformer for Image Captioning. CVPR 2020
DALL-E - PyTorch package for the discrete VAE used for DALL·E.
disco-diffusion
aphantasia - CLIP + FFT/DWT/RGB = text to image/video
latent-diffusion - High-Resolution Image Synthesis with Latent Diffusion Models
stylized-neural-painting - Official Pytorch implementation of the preprint paper "Stylized Neural Painting", in CVPR 2021.
DALLE-pytorch - Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
StyleCLIP - Using CLIP and StyleGAN to generate faces from prompts.
Story2Hallucination